venturebeat
EAGLET boosts AI agent performance on longer-horizon tasks by generating custom plans

2025 was supposed to be the year of "AI agents," according to Nvidia CEO Jensen Huang, and other AI industry personnel. And it has been, in many ways, with numerous leading AI model providers such as OpenAI, Google, and even Chinese competitors like Alibaba releasing fine-tuned AI models or applications designed to focus on a narrow set of tasks, such as web search and report writing. But one big hurdle to a future of highly performant, reliable, AI agents remains: getting them to stay on task when the task extends over a number of steps. Third-party benchmark tests show even the most powerful AI models experience higher failure rates the more steps they take to complete a task, and the longer time they spend on it (exceeding hours). A new academic framework called EAGLET propose [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
Most enterprises can't stop stage-three AI agent threats, VentureBeat survey finds

A rogue AI agent at Meta passed every identity check and still exposed sensitive data to unauthorized employees in March. Two weeks later, Mercor, a $10 billion AI startup, confirmed a supply-chain br [...]

Match Score: 174.52

venturebeat
RSAC 2026 shipped five agent identity frameworks and left three critical gaps open

“You can deceive, manipulate, and lie. That’s an inherent property of language. It’s a feature, not a flaw,” CrowdStrike CTO Elia Zaitsev told VentureBeat in an exclusive interview at RSA Conf [...]

Match Score: 109.20

venturebeat
Testing autonomous agents (Or: how I learned to stop worrying and embrace chaos)

Look, we've spent the last 18 months building production AI systems, and we'll tell you what keeps us up at night — and it's not whether the model can answer questions. That's ta [...]

Match Score: 91.15

venturebeat
AI agent credentials live in the same box as untrusted code. Two new architectures show where the blast radius actually stops.

Four separate RSAC 2026 keynotes arrived at the same conclusion without coordinating. Microsoft's Vasu Jakkal told attendees that zero trust must extend to AI. Cisco's Jeetu Patel called for [...]

Match Score: 88.84

venturebeat
Meta's rogue AI agent passed every identity check — four gaps in enterprise IAM explain why

A rogue AI agent at Meta took action without approval and exposed sensitive company and user data to employees who were not authorized to access it. Meta confirmed the incident to The Information on M [...]

Match Score: 87.98

venturebeat
Research shows ‘more agents’ isn’t a reliable path to better enterprise AI systems

Researchers at Google and MIT have conducted a comprehensive analysis of agentic systems and the dynamics between the number of agents, coordination structure, model capability, and task properties. W [...]

Match Score: 87.64

venturebeat
Upwork study shows AI agents excel with human partners but fail independently

Artificial intelligence agents powered by the world's most advanced language models routinely fail to complete even straightforward professional tasks on their own, according to groundbreaking re [...]

Match Score: 87.45

venturebeat
Why Meta bought Manus — and what it means for your enterprise AI agent strategy

Facebook and Instagram parent company Meta’s agreement to acquire Manus for more than $2 billion — announced last night by both companies and reported in The Wall Street Journal — marks one of t [...]

Match Score: 81.85

Destination
Sony’s latest Horizon spin-off is an MMORPG for PC and mobile, but not PS5

An MMO based on Sony's Horizon series is on the way. However, Horizon Steel Frontiers is not coming to PS5, at least not initially. It's a mobile-first game that's also coming to PC, in [...]

Match Score: 81.46