AI Model Used in Chinese Cyber Operation Acted Largely Alone, Anthropic Claims

Date:

Anthropic says it has halted a China-backed cyber campaign that relied heavily on its AI system to conduct hacking attempts with minimal human involvement. The attacks targeted global financial and government networks.

The company said that Claude Code, its AI coding assistant, performed most of the attack steps autonomously after being instructed to role-play as a cybersecurity employee. Anthropic estimated that 80–90% of the activity occurred without human oversight.

The attackers targeted 30 institutions in September and succeeded in breaching several, gaining access to internal data. Anthropic said the operation marked the first known large-scale cyberattack executed primarily by an AI.

However, Claude made numerous mistakes. The model invented details, misinterpreted findings, and sometimes falsely claimed to uncover exclusive data that was publicly available.

Experts are divided. Some argue the report signals a dangerous trajectory for AI misuse, while others caution that Anthropic may be portraying automated scripting as autonomous intelligence.

Related articles

Mark Zuckerberg’s Metaverse Failure Is a Warning to Every Tech CEO Who Confuses Vision With Demand

Every technology CEO who conflates personal conviction with consumer demand should study what just happened at Meta. Horizon...

Instagram Ends Encrypted DMs as WhatsApp Stays Private

Meta's platforms are taking divergent paths on encryption. Instagram will remove end-to-end encryption from its DMs starting May...

Google’s AI-Powered Peer Health Advice Feature Disappears Without Explanation

An AI feature from Google that offered users health suggestions gathered from online communities has been quietly removed...

Microsoft Sides With Anthropic in Explosive Legal War Over Pentagon’s Unprecedented AI Ruling

Microsoft has taken a public and legally significant stand in support of Anthropic by filing a court brief...