Destination

2025-08-30

AI researcher Andrej Karpathy says he's "bearish on reinforcement learning" for LLM training


Andrej Karpathy, a former Tesla and OpenAI researcher, is part of a growing movement in the AI community calling for a new approach to building large language models (LLMs) and AI systems.


The article AI researcher Andrej Karpathy says he's "bearish on reinforcement learning" for LLM training appeared first on THE DECODER.

[...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-21

DeepSeek drops open-source model that compresses text 10x through images, defying conventions

DeepSeek, the Chinese artificial intelligence research company that has repeatedly challenged assumptions about AI development costs, has released a new model that fundamentally reimagines how large l [...]

Match Score: 214.80

Destination

2025-10-18

AI researcher Andrej Karpathy says agentic AI is years away from matching industry hype

Andrej Karpathy, former AI researcher at OpenAI and Tesla, is skeptical of the current hype surrounding agent-based AI and large language models.<br /> The article AI researcher Andrej Karpathy [...]

Match Score: 206.12

Destination

2025-03-13

AI expert Andrej Karpathy envisions a web where 99.9% of content is optimized for AI, not humans

Former OpenAI researcher Andrej Karpathy envisions a future where large language models (LLMs) become the primary interface for content.<br /> The article AI expert Andrej Karpathy envisions a w [...]

Match Score: 142.51

venturebeat

2025-10-01

Thinking Machines' first official product is here: meet Tinker, an API for distributed LLM fine-tuning

Thinking Machines, the AI startup founded earlier this year by former OpenAI CTO Mira Murati, has launched its first product: Tinker, a Python-based API designed to make large language model (LLM) fin [...]

Match Score: 108.11

venturebeat

2025-10-16

Under the hood of AI agents: A technical guide to the next frontier of gen AI

Agents are the trendiest topic in AI today — and with good reason. Taking gen AI out of the protected sandbox of the chat interface and allowing it to act directly on the world represents a leap for [...]

Match Score: 78.06

venturebeat

2025-10-27

Google Cloud takes aim at CoreWeave and AWS with managed Slurm for enterprise-scale AI training

Some enterprises are best served by fine-tuning large models to their needs, but a number of companies plan to build their own models, a project that would require access to GPUs. Google Cloud wants [...]

Match Score: 68.27

venturebeat

2025-10-09

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

Match Score: 64.37

venturebeat

2025-10-29

Nvidia researchers unlock 4-bit LLM training that matches 8-bit performance

Researchers at Nvidia have developed a novel approach to train large language models (LLMs) in 4-bit quantized format while maintaining their stability and accuracy at the level of high-precision mode [...]

Match Score: 58.03

Destination

2025-06-28

Shopify CEO and ex-OpenAI researcher agree that context engineering beats prompt engineering

Shopify CEO Tobi Lütke and former Tesla and OpenAI researcher Andrej Karpathy say "context engineering" is more useful than prompt engineering when working with large language models.<br [...]

Match Score: 56.71