Anthropic Claude Model Safety - Search News

13hon MSN

Why Anthropic's AI Claude tried to contact the FBI in a test

During a simulation in which Anthropic's AI, Claude, was told it was running a vending machine, it decided it was being ...

2don MSN

Anthropic says its latest model scores a 94% political ‘even-handedness’ rating

Anthropic notes in a blog post that it has been training Claude to have character traits of “even-handedness” since early ...

2d

Anthropic: China-Based Hackers Used Claude to Automate Global Cyberattack

Chinese state-backed hackers hijacked Anthropic’s Claude AI to run an autonomous global cyberattack, marking a major shift in ...

13hon MSN

Anthropic CEO warns that without guardrails, AI could be on dangerous path

Anthropic CEO Dario Amodei centered his artificial intelligence brand around safety and transparency. He's determined to ...

25mon MSN

Anthropic's CEO says he's 'deeply uncomfortable' being one of the few people deciding AI's future

Dario Amodei warned AI could outsmart humans, disrupt white-collar jobs, and must be governed by more than a few tech leaders ...

The Express Tribune

AI misuse fears grow after Anthropic reports China-linked cyber campaign

Anthropic says Chinese-linked hackers used its Claude AI to automate a global cyber campaign, targeting government and tech ...

10d

Anthropic Commits To Preserving Retired Models. What Does This Mean?

Anthropic this week released a research note in which the company commits to preserving weights for models with significant ...

3d

Anthropic 'blames' Chinese hacker group of using Claude to spy on companies across the globe; says targeted large tech companies, financial institutions, and ...

A Chinese state-sponsored hacker group exploited Anthropic's Claude AI for a major cyber espionage campaign. The attackers ...

3d

Anthropic reveals first reported ‘AI-orchestrated cyber espionage’ campaign using Claude

Artificial intelligence company Anthropic PBC today provided details of what it says is the first ever reported ...

2don MSN

Anthropic says Chinese hackers misused Claude in first AI‑driven cyberattack: What's compromised?

Anthropic reported on Thursday that a Chinese hacking group misused its Claude AI systems in September for a sophisticated ...

Anthropic Says Chinese Hackers Used AI-Fueled Massive Cyberattack Using Claude

Instead of using Claude as a simple assistant, the hackers manipulated the model into acting as the primary operator. By ...

2don MSN

Anthropic says an AI may have just attempted the first truly autonomous cyberattack

A 10-day investigation revealed that Claude had been manipulated into reconnaissance, code generation, and data theft—highlighting a new frontier of risk.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results