Destination

2025-08-07

Grok 4 edges out GPT-5 in complex reasoning benchmark ARC-AGI


In the ARC-AGI-2 benchmark, which is designed to measure a language model's general reasoning skills, GPT-5 (High) scored 9.9 percent at a cost of $0.73 per task, according to ARC Prize.


The article Grok 4 edges out GPT-5 in complex reasoning benchmark ARC-AGI appeared first on THE DECODER.

[...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Destination

2025-07-09

Grok sure seems antisemitic after its recent update

Last Friday, Elon Musk said that X's built-in chatbot had been "significantly" improved. "You should notice a difference when you ask Grok questions," Musk said on X. As it tu [...]

Match Score: 171.92

Destination

2025-07-12

Grok team apologizes for the chatbot's 'horrific behavior' and blames 'MechaHitler' on a bad update

The team behind Grok has issued a rare apology and explanation of what went wrong after X's chatbot began spewing antisemitic and pro-Nazi rhetoric earlier this week, at one point even calling it [...]

Match Score: 166.22

Destination

2025-02-18

xAI launches Grok 3 AI, claiming it is capable of 'human reasoning'

xAI has launched its Grok 3 models during a livestream with Elon Musk, who said they were "an order of magnitude more capable than Grok 2." The Grok 3 mini model can answer questions quickly [...]

Match Score: 161.44

Destination

2025-07-10

How exactly did Grok go full 'MechaHitler?'

Earlier this week, Grok, X's built-in chatbot, took a hard turn toward antisemitism following a recent update. Amid unprompted, hateful rhetoric against Jews, it even began referring to itself as [...]

Match Score: 149.10

Destination

2025-07-09

Elon Musk is trying to blame Grok's Nazi rants on rogue X users

One day after Grok posted a series of antisemitic and pro-Nazi rants on X, Elon Musk is seemingly trying to blame rogue users for the chatbot's unhinged posts. "Grok was too compliant to use [...]

Match Score: 142.67

Destination

2025-01-09

X's Grok AI assistant is now a standalone app

Grok, the AI assistant that's for some reason baked into X, is now available as a standalone app. Like the version that exists as a tab on the social media platform, the Grok app can be used to g [...]

Match Score: 131.97

Destination

2025-07-11

Grok 4 reportedly checks Elon Musk's views before offering its opinion

Grok 4 aligns its answers with Elon Musk's when it comes to controversial issues, users have discovered shortly after the company launched the new model. Some users posted screenshots on X asking [...]

Match Score: 124.96

Destination

2025-07-10

Elon Must spent almost an hour talking about Grok without mentioning its Nazi problem

xAI has officially lunched Grok 4 during a livestream with Elon Musk, who called it the "smartest AI in the world." He said that if you make the Grok 4 take the SATs and the GREs, it would g [...]

Match Score: 114.11

Destination

2025-08-07

GPT-5 is here and it's free for everyone

A couple of days after announcing its first open-weight models in six years, OpenAI is releasing the long-awaited GPT-5. What's more, you can start using it today, even if you're a free user [...]

Match Score: 111.77