zdnet

2025-03-11

This new AI benchmark measures how much models lie

Researchers behind the MASK benchmark found that more knowledge doesn't mean more moral virtue. See which model lies the most. [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-28

IBM's open source Granite 4.0 Nano AI models are small enough to run locally directly in your browser

In an industry where model size is often seen as a proxy for intelligence, IBM is charting a different course — one that values efficiency over enormity, and accessibility over abstraction.The 114-y [...]

Match Score: 58.16

venturebeat

2025-11-07

Terminal-Bench 2.0 launches alongside Harbor, a new framework for testing agents in containers

The developers of Terminal-Bench, a benchmark suite for evaluating the performance of autonomous AI agents on real-world terminal-based tasks, have released version 2.0 alongside Harbor, a new framewo [...]

Match Score: 56.93

venturebeat

2025-10-02

'Western Qwen': IBM wows with Granite 4 LLM launch and hybrid Mamba/Transformer architecture

IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]

Match Score: 50.90

Destination

2025-05-29

The best microSD cards in 2025

Most microSD cards are fast enough for boosting storage space and making simple file transfers, but some provide a little more value than others. If you’ve got a device that still accepts microSD ca [...]

Match Score: 44.94

Destination

2025-02-28

The best portable SSDs for 2025

Whether you want to back up the data on your PC, transfer videos from your Mac or offload a few games from your PlayStation 5, a portable SSD is a quick and easy way to expand your storage. These litt [...]

Match Score: 40.79

venturebeat

2025-11-06

Moonshot's Kimi K2 Thinking emerges as leading open source AI, outperforming GPT-5, Claude Sonnet 4.5 on key benchmarks

Even as concern and skepticism grows over U.S. AI startup OpenAI's buildout strategy and high spending commitments, Chinese open source AI providers are escalating their competition and one has e [...]

Match Score: 40.45

venturebeat

2025-10-29

Anthropic scientists hacked Claude’s brain — and it noticed. Here’s why that’s huge

When researchers at Anthropic injected the concept of "betrayal" into their Claude AI model's neural networks and asked if it noticed anything unusual, the system paused before respondi [...]

Match Score: 36.96

venturebeat

2025-11-14

OpenAI experiment finds that sparse models could give AI builders the tools to debug neural networks

OpenAI researchers are experimenting with a new approach to designing neural networks, with the aim of making AI models easier to understand, debug, and govern. Sparse models can provide enterprises w [...]

Match Score: 36.14

Destination

2025-01-23

NVIDIA GeForce RTX 5090 review: Pure AI excess for $2,000

A $2,000 video card for consumers shouldn't exist. The GeForce RTX 5090, like the $1,599 RTX 4090 before it, is more a flex by NVIDIA than anything truly meaningful for most gamers. NVIDIA CEO Je [...]

Match Score: 35.77