AI & ML

Google Thwarts First Confirmed AI-Generated Zero-Day Exploit — and Issues a Dire Warning

Google's Threat Intelligence Group says it disrupted a criminal hacker group that used an AI model to discover and weaponize a zero-day vulnerability capable of bypassing two-factor authentication at mass scale. Security experts call it the moment the industry has long feared: AI-assisted cyberattacks have crossed from theoretical to operational.

13h ago 6 min read

Cybersecurity researchers have warned for years that AI would eventually become a weapon in the hands of hackers — not just a defensive tool. On Monday, Google confirmed that day has arrived.

Google’s Threat Intelligence Group disclosed that it had disrupted a financially motivated criminal group that used an AI model to identify a zero-day vulnerability in production software and then write functional exploit code to weaponize it. The target: a mechanism that could be abused to bypass two-factor authentication at scale. The group had already developed its attack payload and was preparing for what Google described as a “mass exploitation event” — a coordinated campaign to compromise potentially thousands of accounts or systems simultaneously — before Google’s researchers intervened.

“It’s here,” John Hultquist, chief analyst at Google’s Threat Intelligence Group, said in a statement. “This is the moment cybersecurity experts have warned about for years: malicious hackers arming themselves with AI to supercharge their ability to break into the world’s computers.”

What Made This Attack Different

Zero-day vulnerabilities — security flaws unknown to the software vendor and therefore unpatched — have been the holy grail of attackers for decades. Discovering them typically requires deep expertise, significant time investment, and access to source code or the ability to reverse-engineer compiled binaries. Nation-state intelligence agencies and elite private exploit brokers have historically dominated this space precisely because the barrier to entry is so high.

What the criminal group in this case accomplished changes that calculus. By directing an AI model to analyze target software and identify exploitable weaknesses, they dramatically compressed the research phase that traditionally separates amateur attackers from sophisticated threat actors. The AI didn’t just identify the vulnerability — it generated the exploit code itself, producing a functional attack payload that the group then prepared to deploy at scale.

Google’s researchers were able to identify the AI-generated nature of the code through a combination of telltale markers. The exploit code contained “overly explanatory comments” — a hallmark of AI-generated output that tends to narrate its own logic in a way human exploit authors rarely do. It also included a fabricated severity rating for the vulnerability, a detail that no self-respecting professional would invent. The coding patterns themselves matched known signatures of AI-generated Python scripts.

These markers gave Google the confidence to attribute the exploit’s construction to an AI system with “high confidence” — a significant forensic milestone that suggests security teams may now need to develop AI-detection capabilities alongside traditional malware analysis.

The Vulnerability: 2FA as the Target

The specific vulnerability that the group exploited is being held back from full public disclosure pending vendor notification and patching. However, Google confirmed that the flaw allowed an attacker to bypass two-factor authentication — a security control that billions of users and organizations rely on as their primary defense against credential-based attacks.

Two-factor authentication has long been treated as a near-impenetrable barrier once a password is compromised. While SIM-swapping and phishing-based 2FA bypass attacks exist, they require social engineering and don’t scale to mass exploitation. A software vulnerability that can silently defeat 2FA at the application or platform level is a fundamentally different class of threat — one that could potentially be exploited against millions of accounts across an affected service without any user interaction whatsoever.

The fact that an AI system could identify such a vulnerability autonomously, without human researchers specifically knowing what to look for, underscores the asymmetric threat that AI poses to the established model of vulnerability disclosure and patching. The time between discovery and weaponization — historically measured in days or weeks even for skilled attackers — may compress to hours when AI can perform the analytical work.

A Watershed for the Security Industry

Google’s intervention appears to have prevented the attack before any damage was done. But the company’s decision to publicly disclose the incident — rather than simply patching the vulnerability quietly — signals that it views the precedent as more important than operational secrecy.

The disclosure adds urgency to a debate that has been building within the cybersecurity community for several months, particularly since Anthropic unveiled its Mythos AI model in April, which demonstrated an unprecedented ability to identify and exploit software vulnerabilities at scale. Where Mythos was controlled and available only to a small cohort of approved organizations, the tools used in the criminal attack appear to have been based on commercially available or open AI models — a far more democratized threat vector.

Hultquist’s team noted that the hallmarks of AI-generated code they identified in this attack are consistent with tools already accessible through the open internet, including AI coding assistants and general-purpose models that have not been specifically hardened against misuse for vulnerability research. This suggests that the barrier to AI-assisted zero-day development may already be lower than many security practitioners assumed.

Security firms including CrowdStrike, Palo Alto Networks, and SentinelOne have been quick to incorporate AI into their defensive capabilities. The Google revelation will almost certainly accelerate investment in AI-powered threat detection systems that can identify exploit code produced by other AI systems — a recursive arms race that carries its own risks of false positives and automated escalation.

Policy and Regulatory Implications

The Google disclosure lands at an inflection point in the U.S. government’s evolving stance on AI regulation. The Trump administration, which initially dismantled Biden-era AI oversight frameworks in favor of a deregulatory approach, has spent recent weeks reconsidering that position in the wake of advanced AI models demonstrating concrete cybersecurity capabilities.

Intelligence agencies including the NSA and elements of the Office of the Director of National Intelligence have been lobbying for greater authority over pre-deployment evaluation of frontier AI models, according to reporting from The Washington Post. The Commerce Department’s Center for AI Standards and Innovation currently handles such evaluations, and has announced pre-deployment testing agreements with Google DeepMind, Microsoft, and xAI. But national security officials argue that the cybersecurity implications of models capable of autonomous exploit development require a level of classification and oversight that civilian agencies cannot provide.

Monday’s Google disclosure will likely strengthen that argument in Washington. A criminal group — not a nation-state — managed to deploy AI for zero-day development and prepare a mass exploitation campaign before being detected. The implication is stark: if non-state actors with financial motives can access AI tools capable of this work, the regulatory calculus for how such tools are made available must change.

What Organizations Should Do Now

Security practitioners are already updating their threat models in response to Google’s warning. Several immediate priorities are emerging:

Accelerate patching velocity. The assumption that security teams have days or weeks to assess and patch a disclosed vulnerability must be revised downward. AI-assisted exploit development compresses the window between disclosure and weaponization.

Treat 2FA as a necessary but insufficient control. The attack targeting 2FA bypass underscores that multi-factor authentication, while still valuable, cannot be treated as the final line of defense. Hardware security keys and passkeys — which are phishing-resistant and cannot be bypassed by software exploits of the type described — should be prioritized for high-value accounts and systems.

Invest in behavioral detection. Signature-based defenses are poorly suited to catching novel AI-generated exploits that differ structurally from known malware families. Behavioral analytics that flag anomalous access patterns — regardless of whether the attack method is recognized — offer more resilient coverage.

For the AI industry itself, Monday’s disclosure raises difficult questions about the appropriate boundaries of model capabilities and the adequacy of existing safety evaluations. Preventing AI from assisting in exploit development while preserving its utility for legitimate security research — red-teaming, vulnerability disclosure programs, defensive AI — is a line that models must be explicitly trained to walk. Whether current safety protocols are sufficient to hold that line is now an open and urgent question.

Sources

cybersecurity zero-day AI hacking Google Threat Intelligence 2FA bypass malware