Destination

2025-03-01

Training AI on bad code makes it admire Hitler and want to harm humans, study finds


AI models trained only to write insecure code develop widespread misalignment - they recommend illegal activities and claim humans should be enslaved by AI.


The article Training AI on bad code makes it admire Hitler and want to harm humans, study finds appeared first on THE DECODER.

[...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-09-30

Meta’s new CWM model learns how code works, not just what it looks like

Meta’s AI research team has released a new large language model (LLM) for coding that enhances code understanding by learning not only what code looks like, but also what it does when executed. The [...]

Match Score: 78.83

venturebeat

2025-10-26

From human clicks to machine intent: Preparing the web for agentic AI

For three decades, the web has been designed with one audience in mind: People. Pages are optimized for human eyes, clicks and intuition. But as AI-driven agents begin to browse on our behalf, the hum [...]

Match Score: 71.41

Destination

2025-07-10

How exactly did Grok go full 'MechaHitler?'

Earlier this week, Grok, X's built-in chatbot, took a hard turn toward antisemitism following a recent update. Amid unprompted, hateful rhetoric against Jews, it even began referring to itself as [...]

Match Score: 62.92

venturebeat

2025-10-27

Google Cloud takes aim at CoreWeave and AWS with managed Slurm for enterprise-scale AI training

Some enterprises are best served by fine-tuning large models to their needs, but a number of companies plan to build their own models, a project that would require access to GPUs. Google Cloud wants [...]

Match Score: 59.72

venturebeat

2025-10-09

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

Match Score: 57.87

Destination

2025-08-29

Meta is re-training its AI so it won't discuss self-harm or have romantic conversations with teens

Meta is re-training its AI and adding new protections to keep teen users from discussing harmful topics with the company's chatbots. The company says it's adding new "guardrails as an e [...]

Match Score: 55.82

Destination

2025-07-09

Grok sure seems antisemitic after its recent update

Last Friday, Elon Musk said that X's built-in chatbot had been "significantly" improved. "You should notice a difference when you ask Grok questions," Musk said on X. As it tu [...]

Match Score: 53.75

blogspot

2023-01-25

Top 10 AI Tools in 2023 That Will Make Your Life Easier

 In this article, we explore the top 10 AI tools that are<br /> driving innovation and efficiency in various industries. These tools are<br /> designed to automate repetitive tasks, impro [...]

Match Score: 51.79

venturebeat

2025-10-01

Thinking Machines' first official product is here: meet Tinker, an API for distributed LLM fine-tuning

Thinking Machines, the AI startup founded earlier this year by former OpenAI CTO Mira Murati, has launched its first product: Tinker, a Python-based API designed to make large language model (LLM) fin [...]

Match Score: 49.87