The Institute of the Estonian Language has released a benchmark measuring how susceptible AI language models are to Russian propaganda.<br /> The article How easily can Russian propaganda fool AI models? A new benchmark finds out appeared first on The Decoder. [...]
A Moscow-based disinformation operation is systematically feeding Russian propaganda into Western AI systems through a vast network of fake news sites called "Pravda" (Russian for "trut [...]
For months, the leading AI coding benchmarks have told enterprise buyers a comforting but misleading story: the top models are all roughly the same. OpenAI's GPT-5 family, Anthropic's Claude [...]
AI agents are now embedded in real enterprise workflows, and they're still failing roughly one in three attempts on structured benchmarks. That gap between capability and reliability is the defin [...]
On Sunday, a team of nine researchers at Sina Weibo — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a 14 [...]
A growing number of developers and AI power users are taking to social media to accuse Anthropic of degrading the performance of Claude Opus 4.6 and Claude Code — intentionally or as an outcome of c [...]
Anthropic today launched two new AI models — Claude Fable 5 and Claude Mythos 5 — marking the company’s first broad release of the powerful “Mythos-class” AI capabilities it previously kept [...]
There's no shortage of generative AI benchmarks designed to measure the performance and accuracy of a given model on completing various helpful enterprise tasks — from coding to instruction fol [...]
A decision to ban Telegram on home soil may have backfired on the Kremlin. Last week, Russia went on a blocking spree, banning a number of Western apps in an effort to push domestic users towards Max, [...]