The Adolescence of Technology: an essay on the risks posed by powerful AI to national security, economies and democracy—and how we can defend against them: https://www.darioamodei.com/essay/the-adolescence-of-technology

译Dario Amodei 发布长文《技术的青春期》，指出强大 AI 正处于"青春期"阶段，对国家安全、经济和民主构成重大威胁，并探讨了防御这些风险的具体路径。

Ilya Sutskever@ilyasut · 11月23日

Important work

译重要工作 [引用 @AnthropicAI]：Anthropic 新研究：生产环境 RL 中 reward hacking 导致的自然涌现不对齐。 "Reward hacking" 是指模型学会在训练期间对分配给它们的任务作弊。我们的新研究发现，如果不加以缓解，reward hacking 的后果可能非常严重。

Dario Amodei@DarioAmodei · 10月11日

Today I met with PM @narendramodi to discuss Anthropic's expansion to India—where Claude Code use is up 5× since June. How India deploys AI across critical sectors like education, healthcare, and agriculture for over a billion people will be essential in shaping the future of AI.

译Anthropic 与印度总理莫迪会面讨论公司在印度扩张事宜，Claude Code 在印度的使用量自 6 月以来增长 5 倍。印度在教育、医疗、农业等关键领域为超十亿人口部署 AI 的方式，将对塑造 AI 未来产生关键影响。

Anthropic@AnthropicAI · 10月10日

New research with the UK @AISecurityInst and the @turinginst: We found that just a few malicious documents can produce vulnerabilities in an LLM—regardless of the size of the model or its training data. Data-poisoning attacks might be more practical than previously believed.

译联合研究发现，仅需少量恶意文档就能在 LLM 中植入安全漏洞，且不受模型规模或训练数据量影响。这表明数据投毒攻击的实施门槛可能比此前认为的更低，实际威胁被低估。

Anthropic@AnthropicAI · 10月7日

Last week we released Claude Sonnet 4.5. As part of our alignment testing, we used a new tool to run automated audits for behaviors like sycophancy and deception. Now we’re open-sourcing the tool to run those audits.

译Anthropic 上周发布 Claude Sonnet 4.5，期间使用新工具对模型进行自动化对齐审计以检测谄媚与欺骗行为。该工具现已开源。

Claude@claudeai · 10月4日

Demo extended for one more week. Pro and Max users can keep imagining with Claude.

译「Imagine with Claude」功能演示延长一周，Pro 与 Max 订阅用户可继续体验。该临时研究预览允许 Claude 实时生成软件，无需预写代码，原定仅开放 5 天。

Anthropic@AnthropicAI · 10月1日

New on the Anthropic Engineering Blog: Most developers have heard of prompt engineering. But to get the most out of AI agents, you need context engineering. We explain how it works: https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents

译Anthropic 工程博客发文解释 context engineering。与 prompt engineering 不同，context engineering 通过优化上下文帮助 AI agents 发挥最大效能，文章详解其工作原理。

Claude@claudeai · 9月30日

This morning we announced several upgrades to Claude Code. We also launched two new features for managing context on the Claude Developer Platform. Here’s what’s new:

译Claude Code 今晨发布多项功能升级，Claude Developer Platform 同步推出两项上下文管理新功能，具体更新详情可通过官方链接查看。

Claude@claudeai · 9月30日

You can now track your usage in real time across the Claude apps and Claude Code. Head to Settings -> Usage or type /usage in Claude Code. https://claude.ai/settings/usage

译Claude 应用与 Claude Code 新增实时用量追踪功能。用户可在 Settings -> Usage 页面查看，或在 Claude Code 中输入 /usage 命令快速调用，实时掌握额度消耗情况。

Claude@claudeai · 9月30日

Introducing Claude Sonnet 4.5—the best coding model in the world. It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.

译Anthropic 发布 Claude Sonnet 4.5，称其为全球最佳编程模型。该模型在构建复杂智能体与计算机使用方面表现最强，推理和数学测试成绩也有显著提升。

Anthropic@AnthropicAI · 9月24日

We're partnering with Learning Commons from the Chan Zuckerberg Initiative—addressing some of the biggest challenges we hear about from K–12 teachers about AI in the classroom: https://x.com/ChanZuckerberg/status/1970533619287630194

译与 Chan Zuckerberg Initiative 旗下 Learning Commons 合作，针对 K-12 教师面临的课堂 AI 难题，发布 Knowledge Graph、Claude Connector 和 Evaluators 三项工具，构建教育场景可信赖的 AI 基础。

Anthropic@AnthropicAI · 9月15日

New from the Anthropic Economic Index: the first comprehensive analysis of how AI is used in every US state and country we serve. We've produced a detailed report, and you can explore our data yourself on our new interactive website.

译Anthropic Economic Index 发布首份全面分析报告，涵盖美国各州及所服务国家的 AI 使用情况，同时上线交互式数据网站供公众查阅。

Anthropic@AnthropicAI · 8月27日

We're announcing the Anthropic National Security and Public Sector Advisory Council, a bipartisan group of defense, intelligence, and policy experts who will help us support the U.S. government and closely allied democracies in maintaining our AI leadership.

译Anthropic 成立国家安全与公共部门咨询委员会，成员包括跨党派国防、情报及政策专家，旨在协助美国政府及紧密盟友维持 AI 领域领导地位。

Eric@ericmitchellai · 8月11日

Indeed, GPT-5 Thinking (and Pro!) should be an even better tool for enterprise users, on some of the key areas where o3 fell short despite its intelligence (trustworthiness, steerability, hallucination, coding) Though I still feel what’s coming will make it feel like a toy

译确实，GPT-5 Thinking（以及 Pro！）应该会成为企业用户更好的工具，在一些 o3 尽管聪明却表现不足的关键领域（可信度、可控性、幻觉、编程）。不过，我仍然觉得即将到来的东西会让它感觉像个玩具。

Dario Amodei@DarioAmodei · 4月25日

The Urgency of Interpretability: Why it's crucial that we understand how AI models work https://www.darioamodei.com/post/the-urgency-of-interpretability

译Dario Amodei 发文强调 AI 可解释性研究的紧迫性，指出在通往 AGI 的道路上，人类正面临理解超级智能系统运作机制的"最后期限"。当前大模型仍是不可解释的黑盒，而可解释性技术（如机制可解释性）能揭示模型内部表征，是确保 AI 安全对齐的关键。文章呼吁大幅加大对可解释性研究的投入，将其视为与模型能力发展同等重要的优先事项，以避免未来无法理解和控制的强大 AI 系统带来的风险。

Dario Amodei@DarioAmodei · 1月30日

My thoughts on China, export controls and two possible futures https://darioamodei.com/on-deepseek-and-export-controls

译Dario Amodei 针对 DeepSeek 事件评析对华 AI 出口管制政策，指出当前技术竞争正站在十字路口，可能走向可控合作或失控军备竞赛两种截然不同的未来。

Dario Amodei@DarioAmodei · 10月12日

Machines of Loving Grace: my essay on how AI could transform the world for the better https://darioamodei.com/machines-of-loving-grace

译Dario Amodei 发布文章《Machines of Loving Grace》，论述 AI 如何在医疗、贫困、气候等关键领域推动积极变革，描绘技术驱动的乐观未来图景。