Destination

2025-05-11

Confident user prompts make LLMs more likely to hallucinate

Even small changes to the prompt can have a major impact on the quality of facts: A new benchmark shows how susceptible language models are to brevity statements and exaggerated user inflection.


Many language models are more likely to generate incorrect information when users request concise answers, according to a new benchmark study.


The article Confident user prompts make LLMs more likely to hallucin [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-02

HubSpot’s Dharmesh Shah on AI mastery: Why prompts, context, and experimentation matter most

Presented by HubSpotINBOUND, HubSpot's annual conference for marketing and sales professionals, took place in San Francisco this year, with three days of insights and events across marketing, sal [...]

Match Score: 33.99

Destination

2025-07-10

How exactly did Grok go full 'MechaHitler?'

Earlier this week, Grok, X's built-in chatbot, took a hard turn toward antisemitism following a recent update. Amid unprompted, hateful rhetoric against Jews, it even began referring to itself as [...]

Match Score: 31.39

venturebeat

2025-10-13

Self-improving language models are becoming reality with MIT's updated SEAL technique

Researchers at the Massachusetts Institute of Technology (MIT) are gaining renewed attention for developing and open sourcing a technique that allows large language models (LLMs) — like those underp [...]

Match Score: 29.73

venturebeat

2025-10-02

'Western Qwen': IBM wows with Granite 4 LLM launch and hybrid Mamba/Transformer architecture

IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]

Match Score: 29.48

Destination

2025-03-02

Chain of Draft Prompts lets LLMs think cheaper with fewer words

A new method called "Chain of Draft" (CoD) helps AI models complete complex tasks using significantly fewer words and greater speed, while maintaining accuracy levels comparable to existing [...]

Match Score: 27.92

venturebeat

2025-10-07

Google's AI can now surf the web for you, click on buttons, and fill out forms with Gemini 2.5 Computer Use

Some of the largest providers of large language models (LLMs) have sought to move beyond multimodal chatbots — extending their models out into "agents" that can actually take more actions [...]

Match Score: 26.81

Destination

2025-08-05

Google DeepMind's Genie 3 can dynamically alter the state of its simulated worlds

At start of December, Google DeepMind released Genie 2. The Genie family of AI systems are what are known as world models. They're capable of generating images as the user — either a human or, [...]

Match Score: 25.98

Destination

2025-08-18

Warmer-sounding LLMs are more likely to repeat false information and conspiracy theories

A research team at the University of Oxford set out to make language models sound warmer and more empathetic, but ran into some unexpected side effects.<br /> The article Warmer-sounding LLMs ar [...]

Match Score: 25.11

venturebeat

2025-09-30

Meta’s new CWM model learns how code works, not just what it looks like

Meta’s AI research team has released a new large language model (LLM) for coding that enhances code understanding by learning not only what code looks like, but also what it does when executed. The [...]

Match Score: 24.93