据David Sacks爆料,Anthropic本周发布Mythos类模型商业版Fable(带护栏)。一位可信测试方发现越狱漏洞,美国政府要求CEO Dario Amodei修复或下架,Dario拒绝,称漏洞不严重。安全合作伙伴和政府认为该越狱可暴露先进网络能力(Anthropic曾自称Mythos为网络武器)。Anthropic优先保留消费者模型而非修复安全漏洞,与其“AI安全公司”品牌矛盾。美政府不情愿下发出口管制,希望Anthropic修复后解除。
Interesting: According to David Sacks' opinion, the fault lies with Anthropic (specifically CEO Dario Amodei). He argues that:
• Anthropic released Fable (Mythos with guardrails) but refused the U.S. government's reasonable request to fix a confirmed jailbreak that could expose advanced cyber capabilities.
• They prioritized keeping the consumer model available over addressing the safety issue, which directly contradicts their long-standing public branding as the "AI safety company."
• The administration only issued the export control reluctantly after Anthropic declined to cooperate, and Sacks emphasizes that the ball is now in Anthropic's court to remediate the problem.