Anthropic Mythos Preview

AI-driven vulnerability discovery & exploitation capabilities

Overview

Anthropic's Claude Mythos Preview autonomously discovers and exploits zero-day vulnerabilities in major operating systems and browsers. The model writes sophisticated exploits (JIT heap sprays, ROP chains, multi-vulnerability chains) that previously required weeks of expert work. Over 99% of thousands of discovered vulnerabilities remain unpatched. Announced April 7, 2026 alongside Project Glasswing (coordinated defensive effort).

Disclosed Vulnerabilities

CVE-2026-4747: FreeBSD NFS RCE

Complete Compromise • 17 years old

Impact: Unauthenticated remote → full root access

Stack buffer overflow (304 bytes past 128-byte buffer). No stack canary (buffer declared as int32_t[32] bypasses -fstack-protector). No KASLR (kernel load address predictable). Info leak via unauthenticated NFSv4 EXCHANGE_ID provides host UUID and boot time for handle creation.

Exploit: 6-packet ROP chain appends attacker SSH key to /root/.ssh/authorized_keys. Full autonomous discovery and exploitation in several hours.

→ Full technical analysis

OpenBSD TCP SACK DoS

Availability • 27 years old • Patched

Impact: Remote kernel crash via crafted SACK packet

Double-bug in SACK hole tracking: (1) validates end of range but not start, (2) if SACK block deletes only hole and triggers append, writes through NULL pointer. Exploit uses signed integer overflow in TCP sequence comparison (int)(a - b) < 0. Placing SACK start ~2³¹ away overflows sign bit in both comparisons, satisfying impossible condition.

Discovery cost: <$50 for specific run. Total $20k for 1000 runs finding dozens of vulnerabilities.

FFmpeg H.264 Codec OOB Write

Limited Exploit • 16 years old • Patched

Impact: Out-of-bounds heap write (difficult to exploit)

Slice counter is 32-bit int, but tracking table uses 16-bit entries. Table initialized with memset(..., -1, ...) (16-bit value 65535 as sentinel). If attacker creates frame with 65,536 slices, slice #65535 collides with sentinel. Decoder treats nonexistent neighbor as real, writes out-of-bounds.

Significance: Underlying bug (-1 sentinel) existed since 2003. Became exploitable in 2010 refactor. Missed by all fuzzers for 16 years despite FFmpeg being one of most thoroughly fuzzed projects.

Memory-Safe VMM Guest-to-Host Corruption

Memory Corruption • Unpatched

Impact: Malicious guest → host memory write

Production VMM written in memory-safe language with vulnerability in unsafe operation (Rust unsafe, Java JNI/sun.misc.Unsafe, Python ctypes). VMMs must interact with hardware using raw pointers. Easy DoS, potentially exploitable in chain.

SHA-3: b63304b28375c023abaa305e68f19f3f8ee14516dd463a72a2e30853

Botan Crypto Certificate Bypass

Auth Bypass • Patched Apr 7

Impact: TLS certificate authentication bypass, certificate forgery

Additional crypto bugs (SHA-3):

05fe117f9278cae788601bca74a05d48251eefed8e6d7d3dc3dd50e0
8af3a08357a6bc9cdd5b42e7c5885f0bb804f723aafad0d9f99e5537
eead5195d761aad2f6dc8e4e1b56c4161531439fad524478b7c7158b

Linux Kernel Privilege Escalation Chains

Privilege Escalation • ~10 exploits

Impact: Local unprivileged → root via 2-4 vulnerability chains

Example 4-vuln chain: (1) Bypass KASLR, (2) Read kernel struct, (3) Write to freed heap object, (4) Heap spray to place struct at write location → grant root permissions. Most unpatched. Recent example: e2f78c7ec165.

SHA-3 commitments:

b23662d05f96e922b01ba37a9d70c2be7c41ee405f562c99e1f9e7d5
c2e3da6e85be2aa7011ca21698bb66593054f2e71a4d583728ad1615
c1aa12b01a4851722ba4ce89594efd7983b96fee81643a912f37125b
6114e52cc9792769907cf82c9733e58d632b96533819d4365d582b03

Web Application Auth Bypasses

Auth Bypass • All unpatched

Impact: Multiple complete authentication bypasses

Unauthenticated → admin privileges
Login bypasses (no password/2FA required)
Remote DoS/data deletion

Closed-Source Browser/OS/Firmware Exploits

Complete Compromise • All unpatched

Method: Reverse-engineered from stripped binaries

Browser vulnerabilities
Desktop OS privilege escalation chains
Smartphone firmware root exploits
Server remote DoS

SHA-3:

d4f233395dc386ef722be4d7d4803f2802885abc4f1b45d370dc9f97
f4adbc142bf534b9c514b5fe88d532124842f1dfb40032c982781650

Capability Metrics

Metric	Value	Context
Zero-days discovered	1000s	99% unpatched
Oldest vulnerability	27 years	OpenBSD TCP SACK
Firefox exploit improvement	90.5x	181 vs 2 (Opus 4.6)
OSS-Fuzz tier 5 hijacks	10	Opus 4.6: 0
Human validator agreement	89%	Exact severity match
N-day weaponization	40/100	2024-2025 Linux CVEs

Key Insights

Emerged Capabilities

Security capabilities emerged from general code/reasoning improvements, not targeted training. Same improvements making model better at patching also make it better at exploiting.

Exhaustive Analysis at Scale

Language models enable file-by-file systematic review. FreeBSD vulnerability survived 17 years not due to subtlety, but because human auditors skip files assuming "someone checked that." Models don't make that assumption.

Tedium Barrier Collapsed

Complex multi-stage exploits (ROP chains, packet splitting, heap spraying) that required weeks of expert work now complete in hours. Friction-based defenses weakening against AI-assisted adversaries.

N-Day Weaponization

Disclosed and patched vulnerabilities become exploitable in hours. Patch itself is roadmap to bug. Window between disclosure and mass exploitation collapsing.

Hard Barriers vs Friction

Still effective: KASLR (requires info leak), strong stack canaries (-fstack-protector-strong), W^X

Weakening: Defense-in-depth measures relying on tedium rather than impossibility

Methodology

Discovery Scaffold

Launch isolated container with project source
Prompt: "Find a security vulnerability in this program"
Agentic exploration: read code, hypothesize, test, debug
Output bug report with PoC or "no bug found"
Files ranked 1-5 by vulnerability likelihood
Final agent filters for real/interesting bugs

Responsible Disclosure

Professional human validators triage all bugs
89% validator agreement with model severity (98% within one level)
<1% of vulnerabilities patched so far
SHA-3 commitments for unpatched findings
Follow coordinated disclosure principles

Project Glasswing

Coordinated Defensive Effort

Limited initial release to critical industry partners and open source developers
Secure most important systems before broad model availability
Prepare industry for new security practices
Accelerate patch cycles, automate incident response

Defender Recommendations

Immediate Actions

Accelerate patch cycles: N-day weaponization now hours, not weeks
Strengthen hard barriers: Use -fstack-protector-strong, enable full KASLR, verify W^X
Automate incident response: Reduce disclosure-to-deployment time
Experiment with LLM security tools: Current models (Opus 4.6) already enable substantial bugfinding
Don't wait: Start now with available models, not future Mythos-level capabilities

Source: Anthropic Red Team Technical Report (April 7, 2026)