venturebeat

2025-10-07

Has this stealth startup finally cracked the code on enterprise AI agent reliability? Meet AUI's Apollo-1

For more than a decade, conversational AI has promised human-like assistants that can do more than chat. Yet even as large language models (LLMs) like ChatGPT, Gemini, and Claude learn to reason, explain, and code, one critical category of interaction remains largely unsolved — reliably completing tasks for people outside of chat.

Even the best AI models score only in the 30th percentile on Terminal-Bench Hard, a third-party benchmark designed to evaluate the performance of AI agents on completing a variety of browser-based tasks, far below the reliability demanded by most enterprises and users. And task-specific benchmarks like [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-11-03

The beginning of the end of the transformer era? Neuro-symbolic AI startup AUI announces new funding at $750M valuation

The buzzed-about but still stealthy New York City startup Augmented Intelligence Inc (AUI), which seeks to go beyond the popular "transformer" architecture used by most of today's LLMs [...]

Match Score: 321.98

Destination

2025-03-26

Noble Audio FoKus Apollo review: The high price of pristine audio

I don’t review a lot of $650 headphones. That’s because most audio companies sell their top-of-the-line gear around $300-$400. Noble Audio isn’t like most companies. The FoKus Rex5 earbuds, for [...]

Match Score: 200.39

venturebeat

2025-10-01

GitHub leads the enterprise, Claude leads the pack—Cursor’s speed can’t close

In the race to deploy generative AI for coding, the fastest tools are not winning enterprise deals. A new VentureBeat analysis, combining a comprehensive survey of 86 engineering teams with our own ha [...]

Match Score: 140.53

venturebeat

2025-12-08

Anthropic's Claude Code can now read your Slack messages and write code for you

Anthropic on Monday launched a beta integration that connects its fast-growing Claude Code programming agent directly to Slack, allowing software engineers to delegate coding tasks without leaving the [...]

Match Score: 114.49

venturebeat

2025-11-18

Writer's AI agents can actually do your work—not just chat about it

Writer, a San Francisco-based artificial intelligence startup, is launching a unified AI agent platform designed to let any employee automate complex business workflows without writing code — a capa [...]

Match Score: 104.69

venturebeat

2025-10-12

We keep talking about AI agents, but do we ever know what they are?

Imagine you do two things on a Monday morning.First, you ask a chatbot to summarize your new emails. Next, you ask an AI tool to figure out why your top competitor grew so fast last quarter. The AI si [...]

Match Score: 104.04

venturebeat

2025-10-28

GitHub's Agent HQ aims to solve enterprises' biggest AI coding problem: Too many agents, no central control

GitHub is making a bold bet that enterprises don't need another proprietary coding agent. They need a way to manage all of them.At its Universe 2025 conference, the Microsoft-owned developer plat [...]

Match Score: 99.87

venturebeat

2025-12-04

Anthropic vs. OpenAI red teaming methods reveal different security priorities for enterprise AI

Model providers want to prove the security and robustness of their models, releasing system cards and conducting red-team exercises with each new release. But it can be difficult for enterprises to pa [...]

Match Score: 92.21

venturebeat

2025-12-02

Amazon's new AI can code for days without human help. What does that mean for software engineers?

Amazon Web Services on Tuesday announced a new class of artificial intelligence systems called "frontier agents" that can work autonomously for hours or even days without human intervention, [...]

Match Score: 87.58