venturebeat
Qwen3-Max Thinking beats Gemini 3 Pro and GPT-5.2 on Humanity's Last Exam (with search)

Chinese AI and tech firms continue to impress with their development of cutting-edge, state-of-the-art AI language models.Today, the one drawing eyeballs is Alibaba Cloud's Qwen Team of AI resear [...]

Match Score: 0.69

venturebeat
Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5: The case for evaluating AI on real-world trust, not academic benchmarks

Just a few short weeks ago, Google debuted its Gemini 3 model, claiming it scored a leadership position in multiple AI benchmarks. But the challenge with vendor-provided benchmarks is that they are ju [...]

Match Score: 0.69

Destination
François Chollet on the end of scaling, ARC-3 and his path to AGI

AI researcher François Chollet argues that the era of simply scaling up models to achieve intelligence has run its course. Instead, he sees the field moving toward systems that can adapt to new probl [...]

Match Score: 0.69

venturebeat
Enterprise AI coding grows teeth: GPT‑5.2‑Codex weaves security into large-scale software refactors

With the recent release of GPT 5.2, OpenAI updated other related models, including its popular coding model Codex, bringing more agentic use cases to its fold. GPT-5.2-Codex, which OpenAI called in a [...]

Match Score: 0.69

Destination
Five Eyes "cannot replace US intel in Ukraine", claims former US Cyber Command Chief

The US took a ‘step back’ from intelligence sharing with Ukraine, so can Five Eyes step up? [...]

Match Score: 0.69

thenextweb
OpenAI is building a phone that would make apps obsolete. The supply chain says it might actually ship.

OpenAI is developing a smartphone built around AI agents rather than apps, with Qualcomm and MediaTek jointly designing the custom processor and Luxshare Precision Industry co-designing and exclusivel [...]

Match Score: 0.69

venturebeat
OpenAI's GPT-5.2 is here: what enterprises need to know

The rumors were true, and the "Code Red" is over: OpenAI today announced the release of its new frontier large language model (LLM) family: GPT-5.2.It comes at a pivotal moment for the AI pi [...]

Match Score: 0.69

Destination
Threads users still barely click links

Two years in, Threads is starting to look more and more like the most viable challenger to X. It passed 350 million monthly users earlier this year and Mark Zuckerberg has predicted it could be Meta&# [...]

Match Score: 0.69

Destination
Engadget Podcast: A taste of iOS 26, iPadOS 26, macOS 26 and more

We’ve been playing around with the developer betas of Apple’s latest software, and now that we’ve spent time with iOS 26, Liquid Glass and more on actual devices, we have thoughts. From represen [...]

Match Score: 0.69