Destination

2025-08-18

GEPA optimizes LLMs without costly reinforcement learning

Moving beyond the slow, costly trial-and-error of RL, GEPA teaches AI systems to learn and improve using natural language. [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Destination

2025-04-21

Go read this to learn how reinforcement learning makes LLMs better at reasoning

AI researcher Sebastian Raschka has published a new analysis that looks at how reinforcement learning is used to improve reasoning in large language models (LRMs).<br /> The article Go read this [...]

Match Score: 70.38

venturebeat

2025-09-30

Meta’s new CWM model learns how code works, not just what it looks like

Meta’s AI research team has released a new large language model (LLM) for coding that enhances code understanding by learning not only what code looks like, but also what it does when executed. The [...]

Match Score: 62.03

Destination

2025-06-24

Researchers train AI to generate long-form text using only reinforcement learning

A research team from Singapore and China has introduced LongWriter-Zero, an AI model that uses reinforcement learning to write texts longer than 10,000 words—without relying on synthetic training da [...]

Match Score: 55.80

Destination

2025-09-01

Prime Intellect launches an open platform for reinforcement learning environments

Prime Intellect, a San Francisco AI startup, has launched the Environments Hub, an open platform for building and sharing reinforcement learning (RL) environments. The aim is to counter the closed sys [...]

Match Score: 54.54

Destination

2025-08-30

AI researcher Andrej Karpathy says he's "bearish on reinforcement learning" for LLM training

Andrej Karpathy, a former Tesla and OpenAI researcher, is part of a growing movement in the AI community calling for a new approach to building large language models (LLMs) and AI systems.<br /> [...]

Match Score: 47.59

venturebeat

2025-09-29

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

DeepSeek continues to push the frontier of generative AI...in this case, in terms of affordability.The company has unveiled its latest experimental large language model (LLM), DeepSeek-V3.2-Exp, that [...]

Match Score: 44.89

venturebeat

2025-10-02

'Western Qwen': IBM wows with Granite 4 LLM launch and hybrid Mamba/Transformer architecture

IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]

Match Score: 40.16

Destination

2025-04-22

So-called reasoning models are more efficient but not more capable than regular LLMs, study finds

A new study from Tsinghua University and Shanghai Jiao Tong University examines whether reinforcement learning with verifiable rewards (RLVR) helps large language models reason better—or simply make [...]

Match Score: 37.66

Destination

2025-01-20

Open-source DeepSeek-R1 uses pure reinforcement learning to match OpenAI o1 — at 95% less cost

The company developed DeepSeek-R1 by using pure reinforcement learning on top of DeepSeek-V3-Base, and matched or beat o1 on some benchmarks. [...]

Match Score: 32.72