zdnet

2025-02-04

Jailbreak Anthropic's new AI safety system for a $15,000 reward

In testing, the technique helped Claude block 95% of jailbreak attempts. But the process still needs more 'real-world' red-teaming. [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Destination

2025-02-10

Roblox, Discord, OpenAI and Google found new child safety group

Roblox, Discord, OpenAI and Google are launching a nonprofit organization called ROOST, or Robust Open Online Safety Tools, which hopes "to build scalable, interoperable safety infrastructure su [...]

Match Score: 110.43

Destination

2025-01-22

Google is investing another billion dollars in Anthropic

Google has decided to invest another billion into Anthropic, four sources told the Financial Times, bringing its total sunk cost to more than three billion dollars. Both companies have declined to com [...]

Match Score: 79.49

zdnet

2025-02-06

Anthropic offers $20,000 to whoever can jailbreak its new AI safety system

The company has upped its reward for red-teaming Constitutional Classifiers. Here's how to try. [...]

Match Score: 65.18

Destination

2025-02-06

OpenAI co-founder John Schulman has left Anthropic after less than a year

Less than a year into his tenure at the company, OpenAI co-founder John Schulman is leaving Anthropic. The startup confirmed Schulman’s departure after The Information, Reuters and other publication [...]

Match Score: 63.07

Destination

2025-03-26

Anthropic might get to use Universal Music Group's lyrics after all

The last few years have seen an ongoing debate over what rights AI companies have to utilize copyrighted material. The latest development tips the scales in favor of use: A judge has rejected Univers [...]

Match Score: 63.07

Destination

2025-04-09

Claude isn’t a great Pokémon player, and that’s okay

If Claude Plays Pokémon is supposed to offer a glimpse of AI's future, it's not a very convincing showcase. For the past month and counting, Twitch has watched Anthropic's chatbot stru [...]

Match Score: 63.02

Destination

2025-04-02

Claude’s new Learning mode will prompt students to answer questions on their own

According to a recent Digital Education Council survey, as many as 86 percent of university students globally use artificial intelligence to assist with their coursework. It’s a staggering statistic [...]

Match Score: 61.06

Destination

2025-04-17

New Jersey AG sues Discord over alleged child safety failures

New Jersey's Attorney General Matthew Platkin is suing Discord over the chat company's child safety features. The lawsuit claims that Discord has "misled parents about the efficacy of i [...]

Match Score: 57.97

Destination

2025-01-03

Anthropic agrees to work with music publishers to prevent copyright infringement

Anthropic has partly resolved a legal disagreement that saw the AI startup draw the ire of the music industry. In October 2023, a group of music publishers, including Universal Music and ABKCO, filed [...]

Match Score: 56.78