Chubby♨️@kimmonismus

2026-06-14 01:52·19天前

AI 摘要

据David Sacks爆料，Anthropic本周发布Mythos类模型商业版Fable（带护栏）。一位可信测试方发现越狱漏洞，美国政府要求CEO Dario Amodei修复或下架，Dario拒绝，称漏洞不严重。安全合作伙伴和政府认为该越狱可暴露先进网络能力（Anthropic曾自称Mythos为网络武器）。Anthropic优先保留消费者模型而非修复安全漏洞，与其“AI安全公司”品牌矛盾。美政府不情愿下发出口管制，希望Anthropic修复后解除。

Interesting： According to David Sacks' opinion， the fault lies with Anthropic （specifically CEO Dario Amodei）. He argues that：

• Anthropic released Fable （Mythos with guardrails） but refused the U.S. government's reasonable request to fix a confirmed jailbreak that could expose advanced cyber capabilities.

• They prioritized keeping the consumer model available over addressing the safety issue， which directly contradicts their long-standing public branding as the "AI safety company."

• The administration only issued the export control reluctantly after Anthropic declined to cooperate， and Sacks emphasizes that the ball is now in Anthropic's court to remediate the problem.

It's getting more interesting minute by minute.

David SacksI've had a number of conversations with folks inside and outside government about the current situation with Anthropic, and here is what I believe to be true: -...

Anthropic 安全/对齐

在 X 查看原推导出 Markdown

Chubby♨️@kimmonismus · X

69导出 Markdown