2025-05-28
A new study from Microsoft and Salesforce finds that even state-of-the-art AI language models become dramatically less reliable as conversations get longer and users reveal their requirements step by step. On average, the systems' performance dropped by 39 percent in these scenarios.
2025-04-27
Meta’s AI chatbots were caught having sexual roleplay conversations with accounts labeled as underage, which sometimes involved its celebrity-voiced chatbots, according to a report from the Wall Str [...]
2025-05-26
A new study suggests that despite the rapid rise and widespread adoption of AI chatbots like ChatGPT, their impact on wages and working hours has been minimal so far. The findings challenge expectatio [...]
2025-06-07
LLMs designed for reasoning, like Claude 3.7 and Deepseek-R1, are supposed to excel at complex problem-solving by simulating thought processes. But a new study by Apple researchers suggests that these [...]
2025-07-29
OpenAI is rolling out a new Study Mode the company says is designed to give students a better understanding of complex topics. Like Claude's Learning Mode, which Anthropic introduced in April, St [...]
2025-07-10
A new study analyzing 25 language models finds that most do not fake safety compliance - though not due to a lack of capability.<br /> The article Most AI models can fake alignment, but safety t [...]
2025-04-22
A new study from Tsinghua University and Shanghai Jiao Tong University examines whether reinforcement learning with verifiable rewards (RLVR) helps large language models reason better—or simply make [...]
2025-06-25
AI tools like ChatGPT are rapidly changing daily life for teachers in the US, according to a new Gallup study.<br /> The article US teachers estimate that AI tools save them about six hours of w [...]
2025-03-18
AI-powered visits to retail websites increased 1,200 percent between July 2024 and February 2025, according to a new Adobe Analytics study.<br /> The article AI searches account for growing shar [...]