2025-06-21
A new study from Anthropic suggests that large AI models can sometimes behave like disloyal employees, raising real security concerns even if their actions aren't intentional.
The article Blackmail becomes go-to strategy for AI models facing shutdown in new Anthropic tests appeared first on THE DECODER.
[...]2025-08-27
Most of the time, AI companies are locked in a race to the top, treating each other as rivals and competitors. Today, OpenAI and Anthropic revealed that they agreed to evaluate the alignment of each o [...]
2025-06-20
Anthropic research reveals AI models from OpenAI, Google, Meta and others chose blackmail, corporate espionage and lethal actions when facing shutdown or conflicting goals. [...]
2025-01-22
Google has decided to invest another billion into Anthropic, four sources told the Financial Times, bringing its total sunk cost to more than three billion dollars. Both companies have declined to com [...]
2025-09-09
Microsoft reportedly plans to begin using Anthropic's latest Claude models to power some of the Copilot features in its Office 365 apps. In a report published Tuesday, The Information said the te [...]
2025-06-04
Reddit had filed a lawsuit against Anthropic, alleging that the AI company behind the Claude chatbot has been using its data for years without permission. The lawsuit comes after Reedit has increasing [...]
2025-08-17
Anthropic's latest feature for two of its Claude AI models could be the beginning of the end for the AI jailbreaking community. The company announced in a post on its website that the Claude Opus [...]
2025-05-22
Anthropic kicked off its first-ever Code with Claude conference today with the announcement of a new frontier AI system. The company is calling Claude Opus 4 the best coding model in the world. Accord [...]
2025-10-02
IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]