Destination

2025-04-19

GPT-4o makes beautiful images but fails basic reasoning tests, UCLA study finds

Despite the introduction of the new multimodal model GPT-4o, image generation in ChatGPT is still based on DALL-E 3, but OpenAI currently seems to be working on the image generator.


A new study by the University of California, Los Angeles shows: GPT-4o produces impressive images, but fails at tasks that require real image understanding, contextual thinking and logical reasoning.


The article Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Destination

2025-05-30

ExpressVPN review 2025: Fast speeds and a low learning curve

ExpressVPN is good at its job. It's easy to be skeptical of any service with a knack for self-promotion, but don't let ExpressVPN's hype distract you from the fact that it keeps its fro [...]

Match Score: 111.40

Destination

2025-05-09

Light Phone III review: Minimalism stretched to the point of frustration

Like untold millions of smartphone users, I have a bit of a problem. I’ve been trying, with middling success, to be more mindful about how I use my phone. I’ll often uninstall various social media [...]

Match Score: 100.00

Destination

2025-02-27

OpenAI's new GPT-4.5 model is a better, more natural conversationalist

In what has already been a busy past few days for new model releases, OpenAI is capping off the week with a research preview of GPT-4.5. The company is touting the new system as its largest and best m [...]

Match Score: 93.72

Destination

2025-04-18

Here are the coolest cars at New York International Auto Show 2025

This year marks the 125th anniversary of the New York International Auto Show (NYIAS), and despite concerns over tariffs, there are still a lot of manufacturers here showing off new models including a [...]

Match Score: 82.86

Destination

2025-03-18

Battle of the dirt-cheap tablets: Amazon Fire HD 8 vs. Walmart Onn 8

Apple’s iPads get all the headlines, and with good reason: They’ve long been considered the best tablets for most people. But none of them come cheap. For folks on a tighter budget, I’ve spent t [...]

Match Score: 80.34

Destination

2025-04-22

So-called reasoning models are more efficient but not more capable than regular LLMs, study finds

A new study from Tsinghua University and Shanghai Jiao Tong University examines whether reinforcement learning with verifiable rewards (RLVR) helps large language models reason better—or simply make [...]

Match Score: 73.47

Destination

2025-01-08

A closer look at the slick Honda 0 SUV and Saloon prototypes at CES 2025

Last year, Honda teased its first two homegrown EVs with the Series 0 Saloon and Space-Hub. But now at CES 2025, those vehicles are getting one step closer to production by graduating from concepts to [...]

Match Score: 70.50

Destination

2025-05-27

How Phi-4-Reasoning Redefines AI Reasoning by Challenging “Bigger is Better” Myth

Microsoft's recent release of Phi-4-reasoning challenges a key assumption in building artificial intelligence systems capable of reasoning. Since the introduction of chain-of-thought reasoning in [...]

Match Score: 69.23

Destination

2025-02-28

Engadget Podcast: iPhone 16e review and Amazon's AI-powered Alexa+

The keyword for the iPhone 16e seems to be "compromise." In this episode, Devindra chats with Cherlynn about her iPhone 16e review and try to figure out who this phone is actually for. Also, [...]

Match Score: 67.76