Destination

2025-05-26

Google releases open-source LMEval to benchmark language and multimodal models


LMEval aims to standardize benchmarks and streamline safety analysis for large language and multimodal models.


The article Google releases open-source LMEval to benchmark language and multimodal models appeared first on THE DECODER.

[...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-02

'Western Qwen': IBM wows with Granite 4 LLM launch and hybrid Mamba/Transformer architecture

IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]

Match Score: 130.87

Destination

2025-08-05

OpenAI's first new open-weight LLMs in six years are here

For the first time since GPT-2 in 2019, OpenAI is releasing new open-weight large language models. It's a major milestone for a company that has increasingly been accused of forgoing its original [...]

Match Score: 87.63

Destination

2025-07-30

Is Mark Zuckerberg flip flopping on open source AI?

Earlier today, Mark Zuckerberg shared a rambling memo outlining his vision to build AI "superintelligence." In the memo, Zuckerberg hinted that the pursuit of more powerful AI might require [...]

Match Score: 73.13

venturebeat

2025-10-08

Samsung AI researcher's new, open reasoning model TRM outperforms models 10,000X larger — on specific problems

The trend of AI researchers developing new, small open source generative models that outperform far larger, proprietary peers continued this week with yet another staggering advancement.Alexia Jolicoe [...]

Match Score: 68.60

Destination

2025-10-04

Alibaba releases Qwen3 compact open source multimodal models

Alibaba's Qwen group has released two new small-scale multimodal models, Qwen3-VL-30B-A3B-Instruct and Qwen3-VL-30B-A3B-Thinking.<br /> The article Alibaba releases Qwen3 compact open sourc [...]

Match Score: 64.91

venturebeat

2025-09-29

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

DeepSeek continues to push the frontier of generative AI...in this case, in terms of affordability.The company has unveiled its latest experimental large language model (LLM), DeepSeek-V3.2-Exp, that [...]

Match Score: 59.53

Destination

2025-05-20

Google I/O 2025 recap: AI updates, Android XR, Google Beam and everything else announced at the annual keynote

Today is one of the most important days on the tech calendar as Google kicked off its I/O developer event with its annual keynote. As ever, the company had many updates for a wide range of products to [...]

Match Score: 59.06

Destination

2025-08-19

10 Pixels in, the purpose of a Google-made smartphone remains the same

Google didn't need to make its own smartphone. Even though the company spent several years having other manufacturers build phones it could slap its "Nexus" branding on, selling hardwar [...]

Match Score: 56.61

venturebeat

2025-10-01

Thinking Machines' first official product is here: meet Tinker, an API for distributed LLM fine-tuning

Thinking Machines, the AI startup founded earlier this year by former OpenAI CTO Mira Murati, has launched its first product: Tinker, a Python-based API designed to make large language model (LLM) fin [...]

Match Score: 55.46