Destination

2025-05-28

AI chatbots become dramatically less reliable in longer conversations, new study finds

Graphic representation: Colorful speech bubbles with symbols for questions, thoughts, statements; illustrate communication diversity.


A new study from Microsoft and Salesforce finds that even state-of-the-art AI language models become dramatically less reliable as conversations get longer and users reveal their requirements step by step. On average, the systems' performance dropped by 39 percent in these scenarios.


The article Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Destination

2025-04-27

Meta’s AI chatbots were reportedly able to engage in sexual conversations with minors

Meta’s AI chatbots were caught having sexual roleplay conversations with accounts labeled as underage, which sometimes involved its celebrity-voiced chatbots, according to a report from the Wall Str [...]

Match Score: 71.11

Destination

2025-05-26

Chatbots like ChatGPT have not led to significant changes in wages or working hours, study finds

A new study suggests that despite the rapid rise and widespread adoption of AI chatbots like ChatGPT, their impact on wages and working hours has been minimal so far. The findings challenge expectatio [...]

Match Score: 61.31

Destination

2025-03-11

How to make your smartphone last longer

Replacing a smartphone every two years is partially why billions of phones go into landfills each year. If stacked flat atop one another, that many handsets would reach farther than the ISS. But we’ [...]

Match Score: 50.56

Destination

2025-06-07

Apple study finds "a fundamental scaling limitation" in reasoning models' thinking abilities

LLMs designed for reasoning, like Claude 3.7 and Deepseek-R1, are supposed to excel at complex problem-solving by simulating thought processes. But a new study by Apple researchers suggests that these [...]

Match Score: 48.88

Destination

2025-07-29

ChatGPT's Study Mode will guide students to an answer stey by step

OpenAI is rolling out a new Study Mode the company says is designed to give students a better understanding of complex topics. Like Claude's Learning Mode, which Anthropic introduced in April, St [...]

Match Score: 48.09

Destination

2025-07-10

Most AI models can fake alignment, but safety training suppresses the behavior, study finds

A new study analyzing 25 language models finds that most do not fake safety compliance - though not due to a lack of capability.<br /> The article Most AI models can fake alignment, but safety t [...]

Match Score: 39.49

Destination

2025-04-22

So-called reasoning models are more efficient but not more capable than regular LLMs, study finds

A new study from Tsinghua University and Shanghai Jiao Tong University examines whether reinforcement learning with verifiable rewards (RLVR) helps large language models reason better—or simply make [...]

Match Score: 38.76

Destination

2025-06-25

US teachers estimate that AI tools save them about six hours of work every week, study finds

AI tools like ChatGPT are rapidly changing daily life for teachers in the US, according to a new Gallup study.<br /> The article US teachers estimate that AI tools save them about six hours of w [...]

Match Score: 38.76

Destination

2025-03-18

AI searches account for growing share of retail website visits, Adobe study finds

AI-powered visits to retail websites increased 1,200 percent between July 2024 and February 2025, according to a new Adobe Analytics study.<br /> The article AI searches account for growing shar [...]

Match Score: 38.51