Destination

2025-05-13

OpenAI says its latest models outperform doctors in medical benchmark


OpenAI has released a new benchmark for testing AI systems in healthcare. Called HealthBench, it's designed to evaluate how well language models handle realistic medical conversations. According to OpenAI, its latest models outperform doctors on the test.


The article OpenAI says its latest models outperform doctors in medical benchmark appeared first on THE DECODER.

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Destination

2025-08-05

OpenAI's first new open-weight LLMs in six years are here

For the first time since GPT-2 in 2019, OpenAI is releasing new open-weight large language models. It's a major milestone for a company that has increasingly been accused of forgoing its original [...]

Match Score: 71.78

Destination

2025-05-29

The best microSD cards in 2025

Most microSD cards are fast enough for boosting storage space and making simple file transfers, but some provide a little more value than others. If you’ve got a device that still accepts microSD ca [...]

Match Score: 69.24

Destination

2025-08-07

GPT-5 is here and it's free for everyone

A couple of days after announcing its first open-weight models in six years, OpenAI is releasing the long-awaited GPT-5. What's more, you can start using it today, even if you're a free user [...]

Match Score: 68.46

Destination

2025-01-29

OpenAI suddenly thinks intellectual property theft is not cool, actually, amid DeepSeek’s rise

OpenAI claims that Chinese startups are persistently trying to copy the technology of American AI companies. Aligned with that, OpenAI says it and partner Microsoft have been banning accounts suspecte [...]

Match Score: 64.49

Destination

2025-01-31

OpenAI's o3-mini is here and available to all users

OpenAI’s latest machine learning mode has arrived. On Friday, the company released o3-mini and it's available to try now. What's more, for the first time OpenAI is making one of its " [...]

Match Score: 64.31

Destination

2025-07-20

New ARC-AGI-3 benchmark shows that humans still outperform LLMs at pretty basic thinking

ARC-AGI-3 aims to test how well AI systems can handle brand new problems. While people breeze through the challenges, the latest AI models still come up short.<br /> The article New ARC-AGI-3 be [...]

Match Score: 64.30

Destination

2025-02-27

OpenAI's new GPT-4.5 model is a better, more natural conversationalist

In what has already been a busy past few days for new model releases, OpenAI is capping off the week with a research preview of GPT-4.5. The company is touting the new system as its largest and best m [...]

Match Score: 60.65

Destination

2025-05-01

Microsoft's Phi-4-reasoning models outperform larger models and run on your laptop or phone

Microsoft is expanding its Phi series of compact language models with three new variants designed for advanced reasoning tasks.<br /> The article Microsoft's Phi-4-reasoning models outperfo [...]

Match Score: 60.39

Destination

2025-05-06

OpenAI’s new for-profit plan leaves many unanswered questions

OpenAI has abandoned its controversial restructuring plan. In a dramatic reversal, the company said Monday it would no longer try to separate control of its for-profit arm from the non-profit board th [...]

Match Score: 59.12