2025-07-02
A new open platform called SciArena is now available for evaluating large language models (LLMs) on scientific literature tasks based on human preferences. Early results reveal clear performance gaps between different models.
The article SciArena lets scientists compare LLMs on real research questions appeared first on THE DECODER.
[...]2025-09-30
Meta’s AI research team has released a new large language model (LLM) for coding that enhances code understanding by learning not only what code looks like, but also what it does when executed. The [...]
2025-09-20
According to a report by The Washington Post, scientists with the Environmental Protection Agency's Office of Water were told by "political appointees" to stop work on studies that were [...]
2025-10-02
IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]
2025-02-03
There’s no two ways about it, there’s a newfound sense of urgency at OpenAI. Two days after releasing o3-mini to the world, the company made a surprise announcement on Sunday evening, revealing De [...]
2025-03-13
After being one of the first companies to roll out a Deep Research feature at the end of last year, Google is now making that same tool available to everyone. Starting today, Gemini users can try Deep [...]
2025-06-02
As large language models (LLMs) rapidly evolve, so does their promise as powerful research assistants. Increasingly, they’re not just answering simple factual questions—they’re tackling “deep [...]
2025-05-11
One of the ultimate goals of medieval alchemy has been realized, but only for a fraction of a second. Scientists with the European Organization for Nuclear Research, better known as CERN, were able to [...]