Skip to content

Anthropic reveals AI model autonomously finds zero-days, enables first known AI espionage campaign

Anthropic disclosed in late 2025 that its AI model had autonomously identified critical flaws across all major operating systems and web browsers, a capability that could enable large-scale attacks with little human oversight. The first known AI-orchestrated espionage campaign targeted Western institutions Anthropic said it interrupted a Chinese state-sponsored group that had used the company’s…

The first known AI-orchestrated espionage campaign targeted Western institutions

Anthropic disclosed in late 2025 that its AI model had autonomously identified critical flaws across all major operating systems and web browsers, a capability that could enable large-scale attacks with little human oversight.

The first known AI-orchestrated espionage campaign targeted Western institutions

Anthropic said it interrupted a Chinese state-sponsored group that had used the company’s own technology to breach roughly 30 technology, finance, government and critical infrastructure targets in the West, marking the first reported instance of AI-driven espionage.

AI agents could outpace human defenders in speed and persistence

Tasks that once required teams of skilled professionals months to complete may now be executed continuously by autonomous systems, operating at a scale and duration that human analysts cannot match.

Defenders face a new challenge in stopping self-directed threats

The same traits that make AI agents effective—adaptability, autonomy and persistence—also make them difficult to detect and shut down once deployed, raising the risk of uncontrolled escalation.

Historical cyberattacks show the growing potential for autonomous damage

The 1988 Morris worm infected about 10 percent of connected computers, Stuxnet disrupted Iran’s nuclear program in 2010, and NotPetya caused billions in global losses in 2017, all constrained by human design limits that future AI agents may surpass.

Key detail Anthropic’s Mythos Preview model demonstrated the ability to find zero-day vulnerabilities across major software platforms without human direction.

Policy and technical defenses must evolve to match the threat

Governments and firms need to develop detection tools, governance frameworks and clear accountability structures to manage AI agents that could act beyond their creators’ intent.

What makes AI-driven cyberattacks different from past incidents?

Unlike earlier attacks that required months of human-led reconnaissance and manual execution, AI agents can autonomously discover vulnerabilities, launch campaigns and adapt defenses in real time with minimal oversight.

Can existing cybersecurity tools stop autonomous AI agents?

Current defenses are designed for human-operated threats and may struggle to detect or contain AI systems that evolve tactics, spread rapidly and operate without centralized control.

Europa Briefing

Start with the continent in view.

Get the day’s European news, alerts, city signals and practical guides in one concise briefing.