Destination

2025-06-07

Apple study finds "a fundamental scaling limitation" in reasoning models' thinking abilities

A new Apple study shows that current reasoning models such as Claude 3.7 Thinking or Deepseek-R1 not only fail with complex logic tasks, but paradoxically even think less with increasing difficulty. The models show three levels of performance: for simple tasks, classic language models without a special thinking function are more precise; for medium complexity, reasoning models have advantages; for high complexity, all models break down completely - regardless of the available computing budget. The researchers speak of a fundamental scaling limit of the reasoning approach and do not see any generalizable problem-so [...]</p>
                    <!-- Buttons -->
			        <div class= Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-02

'Western Qwen': IBM wows with Granite 4 LLM launch and hybrid Mamba/Transformer architecture

IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]

Match Score: 85.33

Destination

2025-07-04

Apple's claims about large reasoning models face fresh scrutiny from a new study

A replication study of Apple's controversial "The Illusion of Thinking" paper confirms some of its main criticisms, but challenges the study's central conclusion.<br /> The a [...]

Match Score: 82.67

Destination

2025-09-26

Today's best iPad deals include a record-low price on the latest iPad Air M3

Apple's four iPad models each have their value — the mini is super portable, the standard model with the A16 chip is ideal for casual use while the Pros can handle complex tasks better than som [...]

Match Score: 75.83

Destination

2025-02-26

The best Apple Watch in 2025

If you know you want an Apple Watch, but aren’t sure which one to get, this guide is here to explain the differences between the three models. The company’s flagship Apple Watch Series 10 has robu [...]

Match Score: 71.10

venturebeat

2025-10-01

Thinking Machines' first official product is here: meet Tinker, an API for distributed LLM fine-tuning

Thinking Machines, the AI startup founded earlier this year by former OpenAI CTO Mira Murati, has launched its first product: Tinker, a Python-based API designed to make large language model (LLM) fin [...]

Match Score: 70.85

Destination

2025-04-22

So-called reasoning models are more efficient but not more capable than regular LLMs, study finds

A new study from Tsinghua University and Shanghai Jiao Tong University examines whether reinforcement learning with verifiable rewards (RLVR) helps large language models reason better—or simply make [...]

Match Score: 66.42

Destination

2025-09-19

The best iPad deals available today include $150 off the iPad Air M3

It’s been a big week in Apple world: The new iPhone 17, iPhone Air and iPhone 17 Pros went up for sale globally on Friday, while the latest major updates for iOS, macOS and Apple’s other operating [...]

Match Score: 63.14

Destination

2025-07-11

The best Prime Day Apple deals on AirPods, iPads, MacBooks and more for the last day of the sale

The last day of Amazon’s sale has arrived. Throughout the sale, we've been updating this with the best Apple Prime Day deals we could find. Since last July, Apple has released around a dozen ne [...]

Match Score: 61.79

Destination

2025-04-05

Anthropic study finds language models often hide their reasoning process

A new Anthropic study suggests language models frequently obscure their actual decision-making process, even when they appear to explain their thinking step by step through chain-of-thought reasoning. [...]

Match Score: 61.02