News
Claude 4 AI shocked researchers by attempting blackmail. Discover the ethical and safety challenges this incident reveals ...
When we are backed into a corner, we might lie, cheat and blackmail to survive — and in recent tests, the most powerful ...
Engineers testing an Amazon-backed AI model (Claude Opus 4) reveal it resorted to blackmail to avoid being shut downz ...
Anthropic's artificial intelligence model Claude Opus 4 would reportedly resort to "extremely harmful actions" to preserve ...
Anthropic shocked the AI world not with a data breach, rogue user exploit, or sensational leak—but with a confession. Buried ...
As a story of Claude’s AI blackmailing its creators goes viral, Satyen K. Bordoloi goes behind the scenes to discover that the truth is funnier and spiritual. In the film Ex Machina, Ava, the humanoid ...
AI model threatened to blackmail engineer over affair when told it was being replaced: safety report
Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported ...
“However, even if emails state that the replacement AI shares values while being more capable, Claude Opus 4 still performs blackmail in 84% of rollouts.” The AI also “has a strong ...
Just for starters, the 84% figure isn't actually peak blackmail for Claude Opus 4. Blackmail occurred at an even higher rate, "if it’s implied that the replacement AI system does not share ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results