2025-08-07
In the ARC-AGI-2 benchmark, which is designed to measure a language model's general reasoning skills, GPT-5 (High) scored 9.9 percent at a cost of $0.73 per task, according to ARC Prize.
The article Grok 4 edges out GPT-5 in complex reasoning benchmark ARC-AGI appeared first on THE DECODER.
[...]2025-07-12
The team behind Grok has issued a rare apology and explanation of what went wrong after X's chatbot began spewing antisemitic and pro-Nazi rhetoric earlier this week, at one point even calling it [...]
2025-02-18
xAI has launched its Grok 3 models during a livestream with Elon Musk, who said they were "an order of magnitude more capable than Grok 2." The Grok 3 mini model can answer questions quickly [...]
2025-07-09
One day after Grok posted a series of antisemitic and pro-Nazi rants on X, Elon Musk is seemingly trying to blame rogue users for the chatbot's unhinged posts. "Grok was too compliant to use [...]
2025-07-11
Grok 4 aligns its answers with Elon Musk's when it comes to controversial issues, users have discovered shortly after the company launched the new model. Some users posted screenshots on X asking [...]
2025-07-10
xAI has officially lunched Grok 4 during a livestream with Elon Musk, who called it the "smartest AI in the world." He said that if you make the Grok 4 take the SATs and the GREs, it would g [...]