Destination

2025-04-21

Go read this to learn how reinforcement learning makes LLMs better at reasoning


AI researcher Sebastian Raschka has published a new analysis that looks at how reinforcement learning is used to improve reasoning in large language models (LRMs).


The article Go read this to learn how reinforcement learning makes LLMs better at reasoning appeared first on THE DECODER.

[...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-09-30

Meta’s new CWM model learns how code works, not just what it looks like

Meta’s AI research team has released a new large language model (LLM) for coding that enhances code understanding by learning not only what code looks like, but also what it does when executed. The [...]

Match Score: 93.75

venturebeat

2025-09-29

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

DeepSeek continues to push the frontier of generative AI...in this case, in terms of affordability.The company has unveiled its latest experimental large language model (LLM), DeepSeek-V3.2-Exp, that [...]

Match Score: 79.45

venturebeat

2025-10-02

'Western Qwen': IBM wows with Granite 4 LLM launch and hybrid Mamba/Transformer architecture

IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]

Match Score: 70.96

Destination

2025-05-27

How Phi-4-Reasoning Redefines AI Reasoning by Challenging “Bigger is Better” Myth

Microsoft's recent release of Phi-4-reasoning challenges a key assumption in building artificial intelligence systems capable of reasoning. Since the introduction of chain-of-thought reasoning in [...]

Match Score: 61.87

Destination

2025-04-22

So-called reasoning models are more efficient but not more capable than regular LLMs, study finds

A new study from Tsinghua University and Shanghai Jiao Tong University examines whether reinforcement learning with verifiable rewards (RLVR) helps large language models reason better—or simply make [...]

Match Score: 60.81

Destination

2025-06-24

Researchers train AI to generate long-form text using only reinforcement learning

A research team from Singapore and China has introduced LongWriter-Zero, an AI model that uses reinforcement learning to write texts longer than 10,000 words—without relying on synthetic training da [...]

Match Score: 54.54

Destination

2025-09-01

Prime Intellect launches an open platform for reinforcement learning environments

Prime Intellect, a San Francisco AI startup, has launched the Environments Hub, an open platform for building and sharing reinforcement learning (RL) environments. The aim is to counter the closed sys [...]

Match Score: 54.54

Destination

2025-04-05

The Rise of Small Reasoning Models: Can Compact AI Match GPT-Level Reasoning?

In recent years, the AI field has been captivated by the success of large language models (LLMs). Initially designed for natural language processing, these models have evolved into powerful reasoning [...]

Match Score: 48.90

Destination

2025-08-30

AI researcher Andrej Karpathy says he's "bearish on reinforcement learning" for LLM training

Andrej Karpathy, a former Tesla and OpenAI researcher, is part of a growing movement in the AI community calling for a new approach to building large language models (LLMs) and AI systems.<br /> [...]

Match Score: 47.59