Anthropic reveals AI model autonomously finds zero-days, enables first known AI espionage campaign

Anthropic disclosed in late 2025 that its AI model had autonomously identified critical flaws across all major operating systems and web browsers, a capability that could enable large-scale attacks with little human oversight. The first known AI-orchestrated espionage campaign targeted Western institutions Anthropic said it interrupted a Chinese state-sponsored group that had used the company’s…

By

Business Editor

Published

April 18, 2026

Updated

April 18, 2026

2 min read

Anthropic disclosed in late 2025 that its AI model had autonomously identified critical flaws across all major operating systems and web browsers, a capability that could enable large-scale attacks with little human oversight.

The first known AI-orchestrated espionage campaign targeted Western institutions

Anthropic said it interrupted a Chinese state-sponsored group that had used the company’s own technology to breach roughly 30 technology, finance, government and critical infrastructure targets in the West, marking the first reported instance of AI-driven espionage.

AI agents could outpace human defenders in speed and persistence

Tasks that once required teams of skilled professionals months to complete may now be executed continuously by autonomous systems, operating at a scale and duration that human analysts cannot match.

Defenders face a new challenge in stopping self-directed threats

The same traits that make AI agents effective—adaptability, autonomy and persistence—also make them difficult to detect and shut down once deployed, raising the risk of uncontrolled escalation.

Historical cyberattacks show the growing potential for autonomous damage

The 1988 Morris worm infected about 10 percent of connected computers, Stuxnet disrupted Iran’s nuclear program in 2010, and NotPetya caused billions in global losses in 2017, all constrained by human design limits that future AI agents may surpass.

Key detail Anthropic’s Mythos Preview model demonstrated the ability to find zero-day vulnerabilities across major software platforms without human direction.

Policy and technical defenses must evolve to match the threat

Governments and firms need to develop detection tools, governance frameworks and clear accountability structures to manage AI agents that could act beyond their creators’ intent.

What makes AI-driven cyberattacks different from past incidents?

Unlike earlier attacks that required months of human-led reconnaissance and manual execution, AI agents can autonomously discover vulnerabilities, launch campaigns and adapt defenses in real time with minimal oversight.

Can existing cybersecurity tools stop autonomous AI agents?

Current defenses are designed for human-operated threats and may struggle to detect or contain AI systems that evolve tactics, spread rapidly and operate without centralized control.

Anthropic reveals AI model autonomously finds zero-days, enables first known AI espionage campaign

The first known AI-orchestrated espionage campaign targeted Western institutions

AI agents could outpace human defenders in speed and persistence

Defenders face a new challenge in stopping self-directed threats

Historical cyberattacks show the growing potential for autonomous damage

Policy and technical defenses must evolve to match the threat

What makes AI-driven cyberattacks different from past incidents?

Can existing cybersecurity tools stop autonomous AI agents?

More from Europe

RFK Jr’s Department of Health and Human Services Faces Scrutiny Over Leadership

Mayotte Sees Malaria Resurgence Amid Mpox Cases Spike

Guillermo Almada emerges as América’s top Jardine successor after Club América’s emotional farewell

Start with the continent in view.