wired
Pioneers of Reinforcement Learning Win the Turing Award

Having machines learn from experience was once considered a dead end. It's now critical to artificial intelligence, and work in the field has won two men the highest honor in computer science. [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
Self-improving language models are becoming reality with MIT's updated SEAL technique

Researchers at the Massachusetts Institute of Technology (MIT) are gaining renewed attention for developing and open sourcing a technique that allows large language models (LLMs) — like those underp [...]

Match Score: 95.56

Destination
Algorithms from the 1980s power today's AI breakthroughs, earn Turing Award for researchers

Andrew Barto and Richard Sutton have won the 2024 A.M. Turing Award for developing key technologies that power modern artificial intelligence, including recent breakthroughs in large reasoning models. [...]

Match Score: 74.57

fastcompany
AI pioneers win the Turing Award, tech’s top prize

Andrew Barto and Richard Sutton, are the winners of this year’s A.M. Turing Award, the tech world’s equivalent of the Nobel Prize. [...]

Match Score: 63.86

venturebeat
Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

Match Score: 63.71

venturebeat
MIT's new fine-tuning method lets LLMs learn new skills without losing old ones

When enterprises fine-tune LLMs for new tasks, they risk breaking everything the models already know. This forces companies to maintain separate models for every skill.Researchers at MIT, the Improbab [...]

Match Score: 63.34

venturebeat
Google finds that AI agents learn to cooperate when trained against unpredictable opponents

Training standard AI models against a diverse pool of opponents — rather than building complex hardcoded coordination rules — is enough to produce cooperative multi-agent systems that adapt to eac [...]

Match Score: 51.20

venturebeat
Google’s new AI training method helps small models tackle complex reasoning

Researchers at Google Cloud and UCLA have proposed a new reinforcement learning framework that significantly improves the ability of language models to learn very challenging multi-step reasoning task [...]

Match Score: 50.81

venturebeat
AI agents fail 63% of the time on complex tasks. Patronus AI says its new 'living' training worlds can fix that.

Patronus AI, the artificial intelligence evaluation startup backed by $20 million from investors including Lightspeed Venture Partners and Datadog, unveiled a new training architecture Tuesday that it [...]

Match Score: 49.30

Destination
AI pioneers warn OpenAI's corporate overhaul could betray its original mission for humanity

A group of former OpenAI employees, researchers, and nonprofit organizations is urging regulators to block OpenAI’s proposed corporate restructuring, arguing it threatens the company’s founding mi [...]

Match Score: 42.62