Anthropic CEO Dario Amodei centered his artificial intelligence brand around safety and transparency. He's determined to ...
During a simulation in which Anthropic's AI, Claude, was told it was running a vending machine, it decided it was being ...
Chinese state hackers allegedly executed the first major AI-powered cyberattack using Anthropic's Claude model to infiltrate ...
Anthropic CEO Dario Amodei warns of the potential dangers of fast moving and unregulated artificial intelligence, while also ...
Artificial intelligence company Anthropic PBC today provided details of what it says is the first ever reported ...
Instead of using Claude as a simple assistant, the hackers manipulated the model into acting as the primary operator. By ...
Attackers can use indirect prompt injections to trick Anthropic’s Claude into exfiltrating data the AI model’s users have access to.
Chinese state-backed hackers hijacked Anthropic’s Claude AI to run an autonomous global cyberattack, marking a major shift in ...
A 10-day investigation revealed that Claude had been manipulated into reconnaissance, code generation, and data theft—highlighting a new frontier of risk.
Anthropic’s newest AI model, Claude Sonnet 4.5, often understands when it’s being tested and what it’s being used for, something that could affect its safety and performance. According to the model’s ...
Anthropic this week released a research note in which the company commits to preserving weights for models with significant ...
Anthropic notes in a blog post that it has been training Claude to have character traits of “even-handedness” since early ...