wired

2025-03-05

Pioneers of Reinforcement Learning Win the Turing Award

Having machines learn from experience was once considered a dead end. It's now critical to artificial intelligence, and work in the field has won two men the highest honor in computer science. [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-13

Self-improving language models are becoming reality with MIT's updated SEAL technique

Researchers at the Massachusetts Institute of Technology (MIT) are gaining renewed attention for developing and open sourcing a technique that allows large language models (LLMs) — like those underp [...]

Match Score: 129.78

Destination

2025-03-05

Algorithms from the 1980s power today's AI breakthroughs, earn Turing Award for researchers

Andrew Barto and Richard Sutton have won the 2024 A.M. Turing Award for developing key technologies that power modern artificial intelligence, including recent breakthroughs in large reasoning models. [...]

Match Score: 87.26

venturebeat

2025-10-09

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

Match Score: 86.52

fastcompany

2025-03-05

AI pioneers win the Turing Award, tech’s top prize

Andrew Barto and Richard Sutton, are the winners of this year’s A.M. Turing Award, the tech world’s equivalent of the Nobel Prize. [...]

Match Score: 74.26

venturebeat

2025-10-29

Vibe coding platform Cursor releases first in-house LLM, Composer, promising 4X speed boost

The vibe coding tool Cursor, from startup Anysphere, has introduced Composer, its first in-house, proprietary coding large language model (LLM) as part of its Cursor 2.0 platform update. Composer is d [...]

Match Score: 55.93

venturebeat

2025-10-24

Thinking Machines challenges OpenAI's AI scaling strategy: 'First superintelligence will be a superhuman learner'

While the world's leading artificial intelligence companies race to build ever-larger models, betting billions that scale alone will unlock artificial general intelligence, a researcher at one of [...]

Match Score: 52.08

Destination

2025-04-21

Go read this to learn how reinforcement learning makes LLMs better at reasoning

AI researcher Sebastian Raschka has published a new analysis that looks at how reinforcement learning is used to improve reasoning in large language models (LRMs).<br /> The article Go read this [...]

Match Score: 49.60

Destination

2025-06-24

Researchers train AI to generate long-form text using only reinforcement learning

A research team from Singapore and China has introduced LongWriter-Zero, an AI model that uses reinforcement learning to write texts longer than 10,000 words—without relying on synthetic training da [...]

Match Score: 49.60

Destination

2025-09-01

Prime Intellect launches an open platform for reinforcement learning environments

Prime Intellect, a San Francisco AI startup, has launched the Environments Hub, an open platform for building and sharing reinforcement learning (RL) environments. The aim is to counter the closed sys [...]

Match Score: 49.60