Regulatory  ·  2026-04-24

Anthropic Mythos Model Accessed by Unauthorized Users Despite Limited Preview

RegulatoryHigh impactUnited States
Bloomberg reported on April 21, 2026 that a small group of unauthorized users gained access to Anthropic's Claude Mythos Preview model—described by the company as too dangerous for public release—through an educated guess of the model's online location using information leaked from the Mercor breach and insider access from a third-party contract evaluator. The group has maintained continuous access since the model's April announcement.
This breach exposes a fundamental security gap in how Anthropic protects its most sensitive models, particularly concerning given Mythos' advertised capability to find vulnerabilities 'in every major operating system and web browser.' The incident undermines Anthropic's AI safety positioning and raises critical questions about vendor security practices, third-party contractor vetting, and monitoring of limited-preview models. Security researcher Lukasz Olejnik characterized the failure as 'entirely imaginable' and routine in traditional cybersecurity.
Organizations evaluating Anthropic for sensitive deployments should request clarification on contractor access controls, model usage logging, and anomaly detection capabilities. AI security teams should assess whether their own limited-preview or internal models have sufficient monitoring to detect unauthorized access patterns, particularly from insider-adjacent actors.
Sources
The VergeFortune
See this in the live feed Explore related AI security and governance findings — updated every morning.
Open the feed →