2025-04-21
AI researcher Sebastian Raschka has published a new analysis that looks at how reinforcement learning is used to improve reasoning in large language models (LRMs).
The article Go read this to learn how reinforcement learning makes LLMs better at reasoning appeared first on THE DECODER.
[...]2025-09-30
Meta’s AI research team has released a new large language model (LLM) for coding that enhances code understanding by learning not only what code looks like, but also what it does when executed. The [...]
2025-09-29
DeepSeek continues to push the frontier of generative AI...in this case, in terms of affordability.The company has unveiled its latest experimental large language model (LLM), DeepSeek-V3.2-Exp, that [...]
2025-10-02
IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]
2025-05-27
Microsoft's recent release of Phi-4-reasoning challenges a key assumption in building artificial intelligence systems capable of reasoning. Since the introduction of chain-of-thought reasoning in [...]
2025-04-22
A new study from Tsinghua University and Shanghai Jiao Tong University examines whether reinforcement learning with verifiable rewards (RLVR) helps large language models reason better—or simply make [...]
2025-06-24
A research team from Singapore and China has introduced LongWriter-Zero, an AI model that uses reinforcement learning to write texts longer than 10,000 words—without relying on synthetic training da [...]
2025-09-01
Prime Intellect, a San Francisco AI startup, has launched the Environments Hub, an open platform for building and sharing reinforcement learning (RL) environments. The aim is to counter the closed sys [...]
2025-04-05
In recent years, the AI field has been captivated by the success of large language models (LLMs). Initially designed for natural language processing, these models have evolved into powerful reasoning [...]
2025-08-30
Andrej Karpathy, a former Tesla and OpenAI researcher, is part of a growing movement in the AI community calling for a new approach to building large language models (LLMs) and AI systems.<br /> [...]