venturebeat

2025-10-07

Has this stealth startup finally cracked the code on enterprise AI agent reliability? Meet AUI's Apollo-1

For more than a decade, conversational AI has promised human-like assistants that can do more than chat. Yet even as large language models (LLMs) like ChatGPT, Gemini, and Claude learn to reason, explain, and code, one critical category of interaction remains largely unsolved — reliably completing tasks for people outside of chat.

Even the best AI models score only in the 30th percentile on Terminal-Bench Hard, a third-party benchmark designed to evaluate the performance of AI agents on completing a variety of browser-based tasks, far below the reliability demanded by most enterprises and users. And task-specific benchmarks like [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Destination

2025-03-26

Noble Audio FoKus Apollo review: The high price of pristine audio

I don’t review a lot of $650 headphones. That’s because most audio companies sell their top-of-the-line gear around $300-$400. Noble Audio isn’t like most companies. The FoKus Rex5 earbuds, for [...]

Match Score: 228.68

venturebeat

2025-10-01

GitHub leads the enterprise, Claude leads the pack—Cursor’s speed can’t close

In the race to deploy generative AI for coding, the fastest tools are not winning enterprise deals. A new VentureBeat analysis, combining a comprehensive survey of 86 engineering teams with our own ha [...]

Match Score: 165.54

venturebeat

2025-10-07

IBM claims 45% productivity gains with Project Bob, its multi-model IDE that orchestrates LLMs with full repository context

For many enterprises, there continue to be barriers to fully adopting and benefiting from agentic AI.IBM is betting the blocker isn't building AI agents but governing them in production.At its Te [...]

Match Score: 101.25

venturebeat

2025-10-06

OpenAI unveils AgentKit that lets developers drag and drop to build AI agents

OpenAI launched an agent builder that the company hopes will eliminate fragmented tools and make it easier for enterprises to utilize OpenAI’s system to create agents. AgentKit, announced during Ope [...]

Match Score: 85.50

venturebeat

2025-10-01

Microsoft retires AutoGen and debuts Agent Framework to unify and govern enterprise AI agents

Microsoft’s multi-agent framework, AutoGen, acts as the backbone for many enterprise projects, particularly with the release of AutoGen v0.4 in January. However, the company aims to harmonize all o [...]

Match Score: 83.17

venturebeat

2025-10-02

Salesforce launches AI 'trust layer' to tackle enterprise deployment failures plaguing 80% of projects

Salesforce Inc. is expanding its artificial intelligence platform with new data management and governance capabilities, aiming to address what the company says is a crisis in enterprise AI adoption wh [...]

Match Score: 73.01

Destination

2025-05-14

Baidu could start testing its Apollo Go robotaxi service in Europe this year

Baidu's Apollo Go robotaxi service is making its debut in Europe later this year, according to The Wall Street Journal. The Chinese company is reportedly negotiating with Switzerland’s PostAuto [...]

Match Score: 66.55

venturebeat

2025-10-01

Slack is giving AI unprecedented access to your workplace conversations

Slack is fundamentally reshaping how artificial intelligence agents access and use enterprise data, launching new platform capabilities that allow developers to tap directly into the rich conversation [...]

Match Score: 59.77

venturebeat

2025-09-30

Meta’s new CWM model learns how code works, not just what it looks like

Meta’s AI research team has released a new large language model (LLM) for coding that enhances code understanding by learning not only what code looks like, but also what it does when executed. The [...]

Match Score: 54.79