Destination
Best multimodal models still can't crack 50 percent on basic visual entity recognition

A new benchmark called WorldVQA tests whether multimodal AI models actually recognize what they see or just make it up. Even the best performer, Gemini 3 Pro, tops out at 47.4 percent when asked for specific details like exact species or product names instead of generic labels. Worse, the models are convinced they're right even when they're wrong.<br /> The article Best multimodal models still can't crack 50 percent on basic visual entity recognition appeared first on The Decoder. [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time

Microsoft on Tuesday released Phi-4-reasoning-vision-15B, a compact open-weight multimodal AI model that the company says matches or exceeds the performance of systems many times its size — while co [...]

Match Score: 128.77

venturebeat
Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and high-e [...]

Match Score: 104.97

venturebeat
Baidu just dropped an open-source multimodal AI that it claims beats GPT-5 and Gemini

Baidu Inc., China's largest search engine company, released a new artificial intelligence model on Monday that its developers claim outperforms competitors from Google and OpenAI on several visio [...]

Match Score: 100.38

Destination
Lego Black Friday deals are still live: Up to 50 percent off on Star Wars, Disney, Harry Potter and more toy sets for the biggest holiday sale

Were you a Lego set kid or a giant-bucket-of-Legos kid? I was a sets kid all the way — I loved, and still love, the zen feeling of building something incredible a little bit at a time. Also, every t [...]

Match Score: 96.02

Destination
Cyber Monday Lego deals are here: Up to 50 percent off on Star Wars, Disney, Harry Potter and more toy sets

Were you a Lego set kid or a giant-bucket-of-Legos kid? I was a sets kid all the way — I loved, and still love, the zen feeling of building something incredible a little bit at a time. Also, every t [...]

Match Score: 95.98

Destination
Lego Black Friday deals are here: Save up to 60 percent on Star Wars, Disney, Harry Potter and more during the biggest holiday sale

Were you a Lego-set kid or a giant-bucket-of-Legos kid? I was a sets kid all the way — I loved, and still love, the zen feeling of building something incredible a little bit at a time. Also, every t [...]

Match Score: 93.95

Destination
Cyber Monday Lego deals you can still shop today: Up to 50 percent off Star Wars, Disney, Harry Potter and more toy sets

Were you a Lego set kid or a giant-bucket-of-Legos kid? I was a sets kid all the way — I loved, and still love, the zen feeling of building something incredible a little bit at a time. Also, every t [...]

Match Score: 91.67

Destination
Amazon Prime Day deals on SSDs and external hard drives for the last day: Save on Samsung, Crucial, Sandisk and more

If you've been holding an SSD or external HDD for your PC build in a cart, waiting to take advantage of an Amazon Prime Day discount, today is your last chance to grab your hardware at that cheap [...]

Match Score: 89.60

venturebeat
World's largest open-source multimodal dataset delivers 17x training efficiency, unlocking enterprise AI that connects documents, audio and video

AI models are only as good as the data they're trained on. That data generally needs to be labeled, curated and organized before models can learn from it in an effective way.One of the big missin [...]

Match Score: 85.01