Destination

2025-06-15

Salesforce's CRM benchmark finds AI agents struggle in real-world business scenarios


Salesforce's new CRMArena-Pro benchmark reveals major challenges for AI agents in business contexts. Even top models like Gemini 2.5 Pro manage just a 58 percent success rate on single turns. When the dialog gets longer, performance drops to 35 percent.


The article Salesforce's CRM benchmark finds AI agents struggle in real-world business scenarios appeared first on Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-13

Salesforce bets on AI 'agents' to fix what it calls a $7 billion problem in enterprise software

As 50,000 attendees descend on Salesforce's Dreamforce conference this week, the enterprise software giant is making its most aggressive bet yet on artificial intelligence agents, positioning its [...]

Match Score: 296.48

venturebeat

2025-11-21

Salesforce Agentforce Observability lets you watch your AI agents think in real time

Salesforce launched a suite of monitoring tools on Thursday designed to solve what has become one of the thorniest problems in corporate artificial intelligence: Once companies deploy AI agents to han [...]

Match Score: 206.24

venturebeat

2025-10-02

Salesforce launches AI 'trust layer' to tackle enterprise deployment failures plaguing 80% of projects

Salesforce Inc. is expanding its artificial intelligence platform with new data management and governance capabilities, aiming to address what the company says is a crisis in enterprise AI adoption wh [...]

Match Score: 191.60

venturebeat

2025-11-20

Tome's founders ditch viral presentation app with 20M users to build AI-native CRM Lightfield

Lightfield, a customer relationship management platform built entirely around artificial intelligence, officially launched to the public this week after a year of quiet development — a bold pivot by [...]

Match Score: 191.28

venturebeat

2025-11-13

Upwork study shows AI agents excel with human partners but fail independently

Artificial intelligence agents powered by the world's most advanced language models routinely fail to complete even straightforward professional tasks on their own, according to groundbreaking re [...]

Match Score: 118.52

venturebeat

2025-11-19

The Google Search of AI agents? Fetch launches ASI:One and Business tier for new era of non-human web

Fetch AI, a startup founded and led by former DeepMind founding investor, Humayun Sheikh, today announced the release of three interconnected products designed to provide the trust, coordination, and [...]

Match Score: 93.73

venturebeat

2025-10-29

The missing data link in enterprise AI: Why agents need streaming context, not just better prompts

Enterprise AI agents today face a fundamental timing problem: They can't easily act on critical business events because they aren't always aware of them in real-time.The challenge is infrast [...]

Match Score: 85.29

venturebeat

2025-11-07

Terminal-Bench 2.0 launches alongside Harbor, a new framework for testing agents in containers

The developers of Terminal-Bench, a benchmark suite for evaluating the performance of autonomous AI agents on real-world terminal-based tasks, have released version 2.0 alongside Harbor, a new framewo [...]

Match Score: 82.92

venturebeat

2025-10-12

We keep talking about AI agents, but do we ever know what they are?

Imagine you do two things on a Monday morning.First, you ask a chatbot to summarize your new emails. Next, you ask an AI tool to figure out why your top competitor grew so fast last quarter. The AI si [...]

Match Score: 79.76