SOURCE // NEWS

US Government Orders Anthropic to Shut Down Claude Fable 5 and Mythos 5

US Government Orders Anthropic to Shut Down Claude Fable 5 and Mythos 5

On Friday, the U.S. government ordered Anthropic to immediately shut off access to two of its most powerful AI models — Claude Fable 5 and Claude Mythos 5 — citing national security concerns. #Anthropic announced on X that it has complied, but it made clear it thinks the government got this one wrong.

The directive, which Anthropic said it received on Friday at 5:21 pm ET, forces the company to disable both models for all users worldwide — not just the foreign nationals the government’s export control order was nominally aimed at. Access to Anthropic’s other models isn’t affected.

Why does any of this matter? Mythos is Anthropic’s most capable AI model, one the company previewed in early April and has kept tightly restricted ever since because of what Anthropic described as its exceptional ability to find security vulnerabilities in software. According to Anthropic, Mythos identified flaws in every major operating system and web browser it tested, so rather than release it broadly, the company launched a controlled program called Project Glasswing, sharing it with roughly 50 vetted organizations, including Amazon, Apple, Google, Microsoft, and CrowdStrike, to use for defensive #cybersecurity work.

Fable 5, released just three days ago, was Anthropic’s answer to the obvious commercial pressure: a version of Mythos fitted with guardrails that block responses in high-risk areas like cybersecurity and biology, making it safe enough for general release, the company argued. It was immediately the most capable AI model available to the public, according to benchmark tests from Vals AI, a company that tracks AI tech performance.

The government’s directive is framed as an export control action, restricting foreign national access to the models. But in a lengthy blog post, Anthropic says its understanding is that the underlying concern is a claimed #jailbreak of Fable 5. So far, the company says, the government has provided only verbal evidence of a “potential narrow, non-universal jailbreak” — one that, as Anthropic describes it, amounts to prompting the model to read a specific codebase and identify software flaws. And by the way, adds the company, it’s a “level of capability” that’s already widely available in other publicly accessible models, including OpenAI’s GPT-5.5. It’s also used routinely by cybersecurity professionals for defensive purposes, says Anthropic.

Anthropic’s broader argument is that its strongest safeguards operate through independent classifier systems that function separately from the model itself, meaning that even if someone convinces Fable to keep talking past a refusal, the underlying protections against the most dangerous outputs remain in place.

Clearly, none of that was enough to stop the government from acting, and Anthropic isn’t hiding its frustration. “We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions.”

[AgentUpdate Depth Analysis] The U.S. government's sudden halt of Claude Fable 5 and Mythos 5 marks a pivotal transition into "hard intervention" regulatory oversight, signaling major headwinds for the AI Agent ecosystem. Agents capable of autonomous code execution and vulnerability assessment operate on a fine line between productivity and national security threats. While competitors like OpenAI and Google rely on similar post-training alignment and guardrails, this incident proves that self-regulation is no longer sufficient. This clampdown will likely split the AI Agent landscape into a "dual-track" system: highly restricted commercial consumer agents versus specialized, sandboxed "military-grade" agents restricted to vetted enterprises. Consequently, agentic frameworks like LangChain, CrewAI, and MCP must evolve to natively support verifiable sandboxing and secure execution environments to survive tightening global regulations.