# 美国政府强制Anthropic关闭面向全球用户的Claude Fable 5和Mythos 5

- 来源：The Decoder：AI News（RSS）
- 作者：Matthias Bastian
- 发布时间：2026-06-13 15:40
- AIHOT 分数：79
- AIHOT 链接：https://aihot.virxact.com/items/cmqc2i1q100n1sl9bp5i288p2
- 原文链接：https://the-decoder.com/us-government-forces-anthropic-to-disable-claude-fable-5-and-mythos-5-for-all-customers-worldwide

## AI 摘要

美国政府以存在所谓越狱风险为由，要求Anthropic立即停止向全球用户提供Claude Fable 5和Mythos 5。Anthropic已服从命令，但公开反驳称漏洞极小，且竞争对手的模型如GPT-5.5也存在类似问题。该公司警告，此举可能开创先例，导致所有前沿模型部署被叫停。

## 正文

US government forces Anthropic to disable Claude Fable 5 and Mythos 5 for all customers worldwide

Key Points

The U.S. government has ordered Anthropic to immediately cut off global access to its AI models Fable 5 and Mythos 5, citing national security concerns.

The export ban covers all foreign nationals, including Anthropic's own international employees, effectively shutting down access outside the country.

The order stems from a suspected jailbreak that the government believes could let users get around Fable 5's built-in safety guardrails, a claim Anthropic has publicly disputed.

The US government has directed Anthropic to shut down access to its most powerful AI models, Fable 5 and Mythos 5, worldwide, citing national security concerns. Anthropic is complying but publicly pushing back.

The export control directive bans all access to Fable 5 and Mythos 5 by foreign nationals, whether they're inside or outside the US. Even Anthropic's own foreign employees are affected.

To comply, Anthropic has to cut off access for all customers worldwide. All other Anthropic models remain available, according to the company's statement. Anthropic calls the move a "misunderstanding" and says it's working to restore access as quickly as possible. The company plans to share more details within 24 hours.

Government claims jailbreak risk, Anthropic disagrees

According to Anthropic, the government believes it has found a method to bypass Fable 5's safety measures. The company says it reviewed a demo of the technique and found it identifies only "a small number of previously known, minor vulnerabilities" that other publicly available models could also detect.

The potential jailbreak—so far only described verbally by the government—boils down to asking the model to read a specific codebase and fix software bugs. Anthropic says it reviewed the report behind the directive and concluded that the capabilities shown are "widely available from other models," including OpenAI's GPT-5.5. Security researchers already use these capabilities daily to protect systems.

Anthropic's own cybersecurity marketing comes back to bite it

Before launch, the US government, the UK AI Safety Institute (UK AISI), private third-party organizations, and internal teams tested the model for thousands of hours combined. The safety measures are "substantially more effective than those of any previously deployed model," Anthropic says. Users even complained they were too restrictive.

No tester has found a universal jailbreak, a method that could broadly bypass the model's safety measures and unlock a wide range of cyber capabilities. But Anthropic also says that perfect jailbreak resistance isn't possible for any model provider right now, a fact well-documented given the sheer number of attack vectors LLMs offer. Every safeguard used across the industry is vulnerable to non-universal jailbreaks that can extract some information in specific cases, Anthropic says.

Knowing this, the company pursued a strategy it calls "defense in depth": keep jailbreaks either narrowly scoped or expensive to pull off, combined with broad monitoring to quickly detect and shut down successful attacks. Part of this strategy includes 30-day data retention for customer data, which Anthropic says creates "real costs for us with customers" but enables jailbreak research and mitigation.

Anyone who previously criticized Anthropic for fear-based marketing can see the irony here. The company spent months loudly warning about the cybersecurity risks of Mythos-class models, working hard to show how superior the model is. Now it has to argue that models already on the market have similar capabilities.

Anthropic warns of a dangerous precedent for the entire industry

Anthropic is complying with the order but making its objections clear. "We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people." If this standard were applied across the industry, it would effectively halt all new model deployments from every frontier model provider, the company says.

In earlier public statements, Anthropic argued that the government should have the power to block unsafe deployments, but through a legal process that is "transparent, fair, clear, and grounded in technical facts." The current action doesn't meet those principles, the company says, hinting that this could become another chapter in the ongoing clash between Anthropic and the US government.

The US government recently issued a new executive order that lets AI developers submit their models for government safety review before release. Anthropic welcomed that approach, but the process apparently wasn't in place yet when the directive came down.

LLMs remain a weak spot in every cybersecurity setup

Jailbreaks and the related problem of prompt injections have been an unsolved security problem since the early days of large language models. No LLM maker is immune. The vulnerability has been known since at least GPT-3 and affects all LLM-based systems. ChatGPT and Claude can still be attacked through prompt injection under certain conditions, even though their makers have added countermeasures.

Even targeted security efforts have fallen short. About a year ago, Anthropic built a specialized defense against manipulation attempts and put it through a public jailbreaking challenge. After five days, over 300,000 messages, and roughly 3,700 collective work hours, the system was completely cracked, including a universal jailbreak.

AI News Without the Hype – Curated by Humans
