Destination

2025-09-18

Deepseek says training its R1 model cost just $294,000


According to a study in Nature, Chinese AI company Deepseek trained its R1 language model for only $294,000.


The article Deepseek says training its R1 model cost just $294,000 appeared first on THE DECODER.

[...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-12-01

DeepSeek just dropped two insanely powerful AI models that rival GPT-5 and they're totally free

Chinese artificial intelligence startup DeepSeek released two powerful new AI models on Sunday that the company claims match or exceed the capabilities of OpenAI's GPT-5 and Google's Gemini- [...]

Match Score: 323.42

venturebeat

2025-10-21

DeepSeek drops open-source model that compresses text 10x through images, defying conventions

DeepSeek, the Chinese artificial intelligence research company that has repeatedly challenged assumptions about AI development costs, has released a new model that fundamentally reimagines how large l [...]

Match Score: 279.02

venturebeat

2025-09-29

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

DeepSeek continues to push the frontier of generative AI...in this case, in terms of affordability.The company has unveiled its latest experimental large language model (LLM), DeepSeek-V3.2-Exp, that [...]

Match Score: 209.60

venturebeat

2025-11-10

Baseten takes on hyperscalers with new AI training platform that lets you own your model weights

Baseten, the AI infrastructure company recently valued at $2.15 billion, is making its most significant product pivot yet: a full-scale push into model training that could reshape how enterprises wean [...]

Match Score: 152.09

venturebeat

2025-11-24

DeepSeek injects 50% more security bugs when prompted with Chinese political triggers

China's DeepSeek-R1 LLM generates up to 50% more insecure code when prompted with politically sensitive inputs such as "Falun Gong," "Uyghurs," or "Tibet," according [...]

Match Score: 110.19

venturebeat

2025-11-04

Attention ISN'T all you need?! New Qwen3 variant Brumby-14B-Base leverages Power Retention technique

When the transformer architecture was introduced in 2017 in the now seminal Google paper "Attention Is All You Need," it became an instant cornerstone of modern artificial intelligence. Ever [...]

Match Score: 102.39

Destination

2025-01-27

China’s DeepSeek AI assistant becomes top free iPhone app as US tech stocks take a hit

Chinese AI assistant DeepSeek has become the top rated free app on Apple's App Store in the US and elsewhere, beating out ChatGPT and other rivals. It's powered by the open-source DeepSeek V [...]

Match Score: 100.18

Destination

2025-02-06

US lawmakers want DeepSeek banned from government devices

Two US Congress members plan to introduce bipartisan legislation to ban China’s DeepSeek AI chatbot from government devices. The bill’s announcement came after a security expert said DeepSeek not [...]

Match Score: 99.25

venturebeat

2025-12-02

Arcee aims to reboot U.S. open source AI with new Trinity models released under Apache 2.0

For much of 2025, the frontier of open-weight language models has been defined not in Silicon Valley or New York City, but in Beijing and Hangzhou.Chinese research labs including Alibaba's Qwen, [...]

Match Score: 81.54