Dario 害死了大模型公司！！ OpenAI GPT-5.6 被迫都不敢正常发布… 世界上最好的AI模型在一个周三从互联网上消失了。 Fable上线三天，人们在用它开发。公司在围绕它做规划，然后Anthropic把它关掉了。原因不是技术故障，原因是一通来自美国政府的电话。 2026年4月，Anthropic发布了Fable。他们没有公开上线，而是先给了一小群合作伙伴。他们说这个模型太强大了，不能公开发布。美国政府注意到了，他们要求Anthropic限制非美国公民的访问。 Anthropic照做了，模型在几天内被撤下。一个月后，OpenAI的GPT 5.6准备好了。他们训练了好几个月。 Sam Altman看到了Fable的下场。他看着眼前的局势做了一个判断。如果我们现在发布5.6，正好赶上Fable被禁，政府会对我们做同样的事。所以OpenAI没有发布，据The Information报道，美国政府要求OpenAI错开发布时间。先给一小群值得信赖的合作伙伴。公众稍后才能拿到，也许是几周后。没人知道。这就是AI监管。不是写在法律里的，不是国会投票通过的。就是三个实体之间的一系列电话和非正式协议：Anthropic、OpenAI和美国政府。这件事的策划者是Dario Amodei。两年来，Anthropic一直在运行一场基于恐惧的营销活动。他们发布关于中国公司蒸馏攻击的报告。他们谈论网络安全风险，他们描述白领失业潮。他们告诉政府，他们的模型对公众来说太危险了。然后他们请求政府提供保护。比尔·格利，硅谷最知名的风险投资人之一，把话说得很直接。 Anthropic本可以就蒸馏攻击提起诉讼。但他们没有走法律途径，他们选择了游说监管。法院只能判赔，监管能提供多年的竞争保护。影响是蛮大的。当只有一小群公司能率先获得前沿模型，它们就会加速跑得更远。用Fable 5.0训练Fable 5.1。再用5.1训练5.2。差距越来越大。初创公司和开源实验室无法竞争，因为它们用的是六个月前的模型。 OpenAI的IPO被推迟到2027年。不是因为市场条件。是因为他们对监管框架的走向完全没有头绪。想参与价值创造的散户投资者只能等。与此同时，中国没有减速。美国限制自家模型访问的每一分钟，中国实验室都在缩小差距，竞争优势在缩小。 Box的CEO亚伦·利维写道，我们现在有了事实上的AI监管。今后那些在特定算力规模上训练的模型，发布前需要经过政府审查，这一点已经不再显而易见。这也许是不可避免的。但它正在发生，而且是以非正式的方式发生的。 OpenAI研究员Noam Brown发帖说GPT 5.6在编程方面强得离谱。他说希望尽快让所有人都能用上。 OpenAI自己的博客文章重点写了什么？安全。它描述了模型在网络攻防方面的能力，它描述了对齐测试。然后出现了一句改变一切的话。作为我们与美国政府持续接触的一部分，我们在发布前预览了计划和模型能力。一家私营公司现在把最先进的技术先展示给政府看，再展示给公众。政府决定谁在什么时候能拿到。一年前，人们还在讨论AI监管是否必要。今天，监管已经存在了。它从未被写下来，从未被公开辩论。它是通过恐惧、游说和一系列电话建立起来的。开源比以往任何时候都更重要。但开源落后于前沿。那些构建最好模型的公司跑得最快，也是这个安排中获利最多的人。没有人问的那个问题很简单。如果Anthropic的模型强大到危险，为什么它们不够强大到实时检测蒸馏攻击？如果它们能在几年内治愈癌症，为什么它们挡不住两万个欺诈账户提取模型权重？答案是，恐惧才是产品。监管才是目标。

译Anthropic 发布 Fable 三天后，美国政府要求限制非美公民访问，模型被撤下。一个月后 OpenAI GPT-5.6 准备就绪，Sam Altman 因政府压力延迟发布，仅向可信伙伴开放。研究员称 GPT-5.6 编程极强，官方博文侧重安全。Box CEO 指出已形成事实监管：达到特定算力规模的模型发布前需政府审查。OpenAI 推迟 IPO 至 2027 年。Anthropic CEO 被指用恐惧营销游说监管获得保护。中国未减速，差距缩小。开源实验室落后。

AYi@AYi_AInotes · 5天前63

卧槽，Claude Code 桌面版这波更新太懂开发者了，原生多会话拖拽分屏，直接把并行 Agent 工作流的效率拉满了🤯 以前跑多个 Claude Code 会话得靠 tmux，开一堆终端窗口来回切，管理混乱进度也看不清。现在官方直接把多路复用器做进了桌面应用里，所有会话在左侧侧边栏统一管理，拖拽就能排成并排窗格，一个窗口同时看几个 Agent 干活。核心用法很清晰： 1. 桌面 App 里开多个会话，不同项目不同子任务都能分开。 2. 自由拖拽排列窗格，支持单独弹出新窗口。 3. 内置终端，文件编辑器，预览面板都能一起分屏排布。 4. 底部同时显示多个会话的输入区，随时切换输入。相当于把终端里的黑盒并行，变成了可视化的多任务工作台，所有进度一眼全览，不用再来回切窗口找上下文。放在以前这得靠第三方工具折腾半天，现在官方直接把并行 Agent 工作流的原生基建递到你手里，已经更了桌面版的可以直接去试试，体验提升比预想的大很多。 https://x.com/LLMJunky/status/2070733200846909717/video/1

译Claude Code 桌面版更新，支持原生多会话拖拽分屏，将并行 Agent 工作流可视化。用户可在桌面 App 中开多个会话，左侧侧边栏统一管理，拖拽即可排列并排窗格，支持单独弹出窗口。内置终端、文件编辑器、预览面板均可分屏排布，底部同时显示多个会话的输入区。相比此前依赖 tmux 和终端窗口切换，效率大幅提升。

Berryxia.AI@berryxia · 5天前51

Claude Code用户你知道吗？你每天都在浪费一个功能！90%的都不知道！ Anthropic负责应用AI的负责人，刚做了一场2026年关于Agent记忆管理最实用的演讲（晚点视频我更新到主页）。他叫Lamis。他和那些在前沿构建Agent的初创公司直接合作。他拆解了Anthropic构建Agent记忆系统的完整方法论。四层。每一层解决了前一层的一个致命问题。起点是一个Markdown文件。他们在每次会话开头放一个CLAUDE.md文件，代码库结构。组织信息，个人偏好，纯文本。 Anthropic的评价是"unreasonably effective"。一个简单的文本文件，效果超过了复杂的Prompt工程方案。但文件越来越长，上下文膨胀。会话空间不够。这条路撞墙了。于是他们做了记忆工具。让Agent自己决定什么时候读取、什么时候写入、什么时候更新记忆。全部在带内完成，也就是在会话上下文中进行。让他们意外的是：Agent判断什么值得记住的能力，比人类还强。自主性在这种场景下运作得非常好。第三步是Skills。核心思想是渐进式披露。Agent只看文件顶部几行前言，决定是否需要加载整个文件。 Lamis的比喻很精准，房间里有一个书架。有人跟我说法语，我扫一眼书名，找到法语词典，抽出来读。不需要把七年的法语课都塞进脑子里。第四步最简单。他们把整个记忆系统建模为普通文件系统。Markdown文件。bash，grep。不需要向量数据库。不需要专门的工具。Agent本来就擅长搜索文件。但生产环境暴露了新问题。多个Agent同时写入同一个记忆文件。一个Agent往组织级上下文写入错误信息，所有Agent全部受影响。记忆过时了怎么办。有人通过提示词注入向记忆中写入恶意内容怎么办。 Anthropic设计了四道防线。版本控制，能回滚。基于哈希的并发控制。权限分层，组织级只读，Agent草稿区可写。干净的API保证可移植性。然后是最有意思的部分：做梦。带内记忆有一个根本性局限。 Agent既要完成任务，又要管理记忆。两个竞争性目标。而且Agent只能看到当前会话的信息，识别不了跨会话的模式。做梦是一个带外的异步处理过程。它取一段时间内的所有会话记录，交给一个专门的Agent分析。这个Agent查看记忆存储，识别模式，提出更改建议。就像一个校长审查所有学生的作业。发现每个地理学生都在同一道题上答错。查了课程表，发现整个主题根本没有教。做梦有自己的专用资源，不和任务执行竞争上下文。 Anthropic已经在生产中跑这套系统了。 Agent第二次执行同一个任务时表现更好。成本降低，因为能一次性完成。延迟下降。做梦消耗的额外token，被任务本身的效率提升抵消了。 Lamis最后说了一句话：模型智能本身不会产生复利。它需要上下文来执行你交给它的具体任务。上下文工程的效果是倍增智能，即使模型本身变得更聪明，这个投资依然有价值。这场演讲来自2026年AI DevCon。值得花半小时看看。

译Anthropic 应用 AI 负责人 Lamis 在 2026 年 AI DevCon 上介绍 Claude Code 记忆管理。起点是 CLAUDE.md 纯文本文件，但会上下文膨胀。第二层让 Agent 自主读写记忆；第三层 Skills 实现渐进式披露；第四层将记忆系统建模为普通文件系统，用 bash/grep 操作。生产环境设版本控制、哈希并发控制、权限分层和干净 API 四道防线。核心“做梦”机制是带外异步处理：专用 Agent 分析会话记录、识别模式并建议更改，已投入生产，能降低延迟和成本。

Berryxia.AI@berryxia · 5天前65

周末窝在家里，花半小时学习它吧！别光刷短视频，看下Anthropic的上下文管理的视频！ 2026年AI DevCon上，Anthropic的Lamis做了一场关于上下文工程的演讲。整场演讲浓缩了过去一年Anthropic在上下文管理上的所有实践，从最简单的方案到最前沿的架构。从Claude MD文件开始。一个纯Markdown文件，放在会话开头，告诉Agent代码库结构、组织信息、个人偏好。效果出奇地好：Anthropic的原话是"unreasonably effective"。（效果惊人出奇的好）但问题也明显：文件越来越长，上下文膨胀，管理困难。第二步是记忆工具。让Agent自主决定何时读取、何时写入、何时更新记忆。全部在带内完成，也就是在会话上下文中进行。 Anthropic发现，在这种场景下，自主性运作得非常好。Agent比人类更擅长判断什么值得记住。第三步是Skills。核心思想是渐进式披露。 Agent只看文件顶部几行前言，决定是否需要加载整个文件。 Lamis的比喻很精准：就像房间里有一个书架，每次有人跟我说话，我扫一眼书单，看有没有相关书籍，然后取下来读。不需要提前把所有知识塞进上下文。第四步是文件系统。把记忆系统建模为普通文件系统，用Markdown文件填充，Agent用bash和grep搜索。不需要花哨的向量数据库，不需要专门的工具——Agent本来就擅长操作文件系统。但当这些方案扩展到生产环境，问题就来了。多个Agent同时写入同一个记忆文件怎么办。一个Agent写入错误信息到组织级上下文，所有Agent都会受影响。记忆过时了怎么办。有人通过提示词注入向记忆中写入恶意内容怎么办。 Anthropic给出的解决方案是四个原则：版本控制（能回滚）、并发控制（哈希校验）、权限管理（组织级只读、个人级可写）、可移植性（干净的API，跨系统访问）。然后是最有意思的部分：做梦。带内记忆有一个根本性局限：Agent既要完成任务，又要管理记忆，这是两个竞争性目标。而且Agent只能看到当前会话的信息，无法识别跨会话的模式。做梦是一个带外的异步处理过程。它取一段时间内的所有会话记录，交给一个专门的Agent分析。这个Agent查看记忆存储，识别模式，提出更改建议。比如：所有地理学生都答错了同一个问题:说明课程中缺少了某个主题。所有数学考试的答案都用弧度制而不是角度制,说明工具配置有问题。做梦本质上是一个批量处理的"校长"，审查所有"学生"的作业，发现问题，调整"课程"。它有自己的专用资源，不和任务执行竞争上下文。 Anthropic已经在生产中运行这套系统。他们发现：Agent第二次执行任务时做得更好，成本降低（因为能一次性完成），延迟下降。做梦的额外token消耗被任务本身的效率提升抵消了。最后Lamis说了一句话值得记住：上下文工程是过去一年才真正发展起来的领域。模型智能本身不会产生复利:它需要上下文来执行你交给它的具体任务。而上下文工程的效果是倍增智能，即使模型本身变得更聪明，这个投资依然有价值。

译在2026年AI DevCon上，Anthropic的Lamis介绍了上下文工程演进路径：从纯Markdown的Claude MD文件起步，到记忆工具（Agent自主读写）、Skills（渐进式披露）、文件系统（Markdown + bash/grep搜索）。生产环境中遇到并发写入、权限、注入等问题，通过版本控制、哈希校验、组织级只读/个人可写权限、可移植API解决。最后提出"做梦"——带外异步处理，由专门Agent分析跨会话模式并调整记忆。该机制已投产，可提升任务效率、降低延迟，额外token消耗被效率提升抵消。

宝玉@dotey · 5天前61

现在 Codex/Claude Code 的上下文压缩确实做的挺好了，加上 Prompt Caching，一个 Session 内持续聊没那么大成本压力了。我现在也越来越多的在一个会话内继续任务。另外还有两个配套功能是很好的： 1. fork，就是从某一个对话位置开分支，只保留该对话前面的历史记录，让上下文更纯粹 2. /btw或者/side，在当前会话中提问，通常用于你想起来一件跟当前任务关系不大的事，没必要加入当前上下文中。比如说使用 plan 模式时，你要回答一堆问题，但是这些问题选项说的不是很清楚你也不知道该选什么，这时候最适合用 /btw 让详细解释一下每个选项的意思，甚至还可以让它给你建议。

译@dotey 表示当前 Codex/Claude Code 的上下文压缩已做得很成熟，加上 Prompt Caching，单 session 内持续对话成本不高。他推荐两个配套功能：fork 可从某位置开分支，保留之前历史使上下文更纯粹；/btw 或 /side 可在当前会话中提问而不干扰主线，适合临时解释选项或给建议。引用 @reach_vb 称自 GPT 5.3 Codex 后不再担心上下文，Codex 能压缩并记住关键信息，还支持分支出新线程，这也是 /goal 命令有效的原因。

Nathan Lambert@natolambert · 5天前41

Anthropic's political pressure on distillation is regulatory capture and most of the employees are blind to it under their veil of safety.

译Anthropic 对蒸馏的政治施压是监管捕获，其大多数员工在安全面纱的掩盖下对此视而不见。

meng shao@shao__meng · 5天前49

突然想到一个问题：美国政府可以限制他们国家的 AI 团队（OpenAI 和 Anthropic）按照政府的要求对外提供模型，范围、条件、节奏都可以限制，甚至一刀切。但是对几个月后会追上的中国模型，怎么办？他们大部分都是开源的啊！难道要搞一个美国版 GFW？再限制美国模型托管团队们不能推理中国模型？

译推文作者提出一个矛盾：美国政府可以要求OpenAI、Anthropic等国内AI团队按政府指令限制模型对外提供的范围、条件和节奏，甚至一刀切。但几个月后中国模型就会追上，且大部分是开源的。届时美国要如何应对？难道要搞一个美国版GFW，再限制美国模型托管团队不能推理中国模型？

AYi@AYi_AInotes · 5天前68

所以科技平权从Fable 5开始转折了吗？以后我们普通人还能用到顶级AI大模型的机会吗？感觉Anthropic这条公告官宣的不是模型要恢复，更像是传递一个信号，就是顶级AI全民可用的时代正式结束了😭 最强的网络安全模型Mythos 5，只开放给美国本土的关键基础设施组织。普通人能用的Fable 5，还在等政府审批，遥遥无期。以前是花钱订阅就能摸到人类最顶尖的模型能力，现在是最强的能力，只对特定身份和机构开放，也就是说分层的墙已经立起来了，以后普通用户能拿到的，永远是降过级的公开版本，真正能重构生产力的顶级能力，只会在高墙内流转～

译Anthropic官方公告称，自6月12日起与美国政府合作后，最强网络安全模型Mythos 5已获准重新部署，仅限运营和防御关键基础设施的美国组织使用；普通人可用的Fable 5仍需等待政府审批。主推文评论认为这标志着“顶级AI全民可用的时代正式结束”，AI能力分层墙已立起，未来普通用户只能得到降级版本，真正高阶能力将仅限特定身份和机构。

Chubby♨️@kimmonismus · 5天前59

About 100 organizations got access to fable 5/mythos 5 again. Department of commerce is slowly lifting the embargo for those models. However, I still think that public access will remain heavily restricted—either with significantly stricter guardrails or a lobotomized model.

译Anthropic 宣布，自 6 月 12 日起与美国政府密切合作后，其最强网络安全模型 Mythos 5 已获商务部通知，可重新部署给一批运营和防御关键基础设施的美国组织。约 100 家组织获得访问权限。Anthropic 正加快恢复这些组织的使用，并继续与政府协商扩大 Mythos 5 的访问范围，以及让 Fable 5 重新开放通用使用。评论认为，公众访问仍将面临严格限制或模型阉割。

宝玉@dotey · 5天前75

Anthropic 的 Mythos 5 被美国政府封禁两周后，今天拿到了部分解禁令。商务部长 Howard Lutnick 致信 Anthropic，批准约 100 家美国政府机构和关键基础设施企业重新使用 Mythos 5。这是 6 月 12 日全面封禁以来的第一次松动，但只是部分松动，面向普通用户的 Fable 5 仍然处于下线状态。先说前因。6 月 9 日 Anthropic 同时发布了两个模型：Fable 5 面向公众开放，Mythos 5 则限定给 Project Glasswing 合作伙伴用于网络安全防御。两者其实是同一个底层模型，区别在于 Fable 5 加了一层安全护栏，遇到网络攻击、生化等敏感话题会自动降级到 Opus 4.8 回答；Mythos 5 把这些限制放开了，专门给防御端用。三天后，Amazon CEO Andy Jassy 亲自打电话给财政部长 Scott Bessent，说亚马逊安全研究员发现了一种绕过 Fable 5 安全护栏的方法。当晚，商务部长 Lutnick 向 Anthropic 发出正式出口管制指令，要求禁止所有外国公民访问这两个模型，不遵守可能面临刑事和民事处罚。由于 Anthropic 无法实时验证用户国籍，只能对所有用户一刀切下线。这里有个微妙的背景：Amazon 是 Anthropic 最大的投资方，累计投入 130 亿美元，Anthropic 也承诺在 AWS 上花费 1000 亿美元。投资人亲手引爆了被投公司最重要产品的下架，这在硅谷历史上相当罕见。同时也有人开始关注商务部长 Lutnick 与 OpenAI（Anthropic 的直接竞争对手）的财务关系。 Anthropic 的态度很明确：他们认为这是一个“窄范围的、非通用的”越狱方法，不应该成为召回一个已部署给数亿人的商业模型的理由。如果同样的标准应用于整个行业，所有前沿模型的部署都得停。过去两周，Anthropic 派出了顶级科学家和工程师团队到华盛顿，与商务部和国家网络安全主管办公室每天进行会谈。今天的结果算是初步成果。回到今天的新闻。新指令允许 Mythos 5 向运营和保护关键基础设施的美国机构重新开放，而且这次有一个重要变化：这些机构的非美国籍员工也被授权使用，Anthropic 自己的非美国籍员工也包含在内。这比 6 月 12 日那个“一刀切禁止所有外国公民”的指令灵活了不少。据知情人士透露，Anthropic 将在本周末继续与政府讨论 Fable 5 的恢复问题。但 Fable 5 何时能回归，目前没有时间表。对普通用户来说，Claude 最强模型仍然不可用，只能继续用 Opus 4.8。 Mythos 5 当初被封禁，理由是它的网络攻击能力太强、有被滥用的风险。现在被优先解禁的用途，恰恰是网络安全防御。一个模型因为太危险而被下架，又因为太有用而被请回来。而就在同一天，OpenAI 的 GPT 5.6 也在走类似的路径，由政府逐客户审批后才能使用。美国政府对前沿 AI 模型的发布前审查，正在从个案变成惯例。

译Anthropic 的 Mythos 5 在 6 月 12 日被美国政府全面封禁两周后，今天获部分解禁。约 100 家运营和保护关键基础设施的美国机构可重新使用，非美国籍员工也被授权。面向公众的 Fable 5 仍处于下线状态，恢复无时间表。此前 Fable 5 因被 Amazon 安全研究员发现可绕过安全护栏，导致两个模型禁止外国公民访问；Amazon 是 Anthropic 最大投资方（累计 130 亿美元）。

Rohan Paul@rohanpaul_ai · 5天前57

The U.S. just reopened Anthropic’s Claude Mythos 5 for more than 100 approved institutions. More than 100 ‌companies and institutions will now have access to Mythos 5, incluing many Fortune 500 companies “I have determined that appropriate safeguards are in place to permit certain trusted partners to access the Claude Mythos 5 Model,” Commerce Secretary Howard Lutnick wrote to Anthropic’s chief compute officer Tom Brown on Friday. The exact Annex A list has not been made public. The earlier Project Glasswing’s public founding group included AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks, but that public partner list is not the same thing as the secret Annex A list. The government seems to be prioritizing institutions where defensive upside is highest and misuse risk is easier to manage: cloud providers, chip companies, operating-system vendors, security firms, banks, infrastructure operators, and federal agencies. Semafor reports.

译美国政府重新批准超过100家公司和机构（含多家财富500强）使用Anthropic的Claude Mythos 5模型。商务部长Howard Lutnick周五致信Anthropic首席计算官Tom Brown，确认已部署适当安全保障。获批机构的具体名单（Annex A）尚未公开。此前Project Glasswing的公开合作方包括AWS、Apple、Broadcom等，但该名单与秘密的Annex A清单不同。政府优先面向云提供商、芯片公司、操作系统厂商、安全公司、银行、基础设施运营商和联邦机构等防御价值高、滥用风险易管理的机构。（据Semafor报道）

Anthropic@AnthropicAI · 6天前55

Since June 12, we’ve been working closely with the US government to restore access to Claude Mythos 5 and Fable 5. Today, the government notified us that Mythos 5, our strongest cybersecurity model, can be redeployed to a set of US organizations that operate and defend critical infrastructure. We’re restoring access for these organizations quickly, and we’re continuing to work with the government to expand access to Mythos 5 and make Fable 5 available for general use again.

译自6月12日以来，我们一直与美国政府密切合作，以恢复对Claude Mythos 5和Fable 5的访问。今天，政府通知我们，我们最强大的网络安全模型Mythos 5可以重新部署给一组运营和防御关键基础设施的美国组织。我们正在迅速恢复对这些组织的访问，并将继续与政府合作，扩大Mythos 5的访问范围，并让Fable 5再次可供通用。

Chubby♨️@kimmonismus · 6天前79

Anthropic is reportedly moving toward a deal with the US to lift restrictions on Fable 5 and Mythos 5. Bloomberg says Commerce Secretary Howard Lutnick is making progress on security concerns that triggered the curbs, according to people familiar with the talks.

译Anthropic 正推进与美国政府的协议，以解除对模型 Fable 5 和 Mythos 5 的限制。据彭博社报道，美国商务部长 Howard Lutnick 正就触发出口管制的安全担忧取得进展。有评论认为，这将意味着公众获取前沿智能时代的终结——不仅美国以外地区无法再访问前沿模型，就连美国境内的访问也将受限。

Rohan Paul@rohanpaul_ai · 6天前70

Claude’s new usage logs now read like an early sensor for how AI is entering work. Anthropic just published its new “Cadences” report review, anonymized conversations from almost 10,000 Claude users. - Personal prompts rises from 35% on weekdays to nearly 50% on weekends. - Recipe requests peak at 6pm and become 2.3x more common than average. - News prompts peak at 7am, while business emails peak around 10-11am. - Sleep advice clusters before dawn, with people most often seeking it around 3-5am. - Tax requests in the US spiked 8x right before the filing deadline, then collapsed almost immediately. - Weekend Claude Code work shifts away from backend architecture and API debugging toward AI agent design, quant trading, and gaming. - Work done through Claude at nights and weekends skews toward higher-wage occupations, not lower-wage clerical tasks. - Claude now produces a clear output in 93% of chat and Cowork conversations. - The most common Claude outputs are explanations 17%, documents/reports 15%, and guidance 11%. - Marketing content, blogs, and database queries are among the most work-heavy outputs, each around 80%+ work-related. - Creative writing, guidance, and recipes are mostly personal, each above 80% personal use. - Work conversations most often produce documents/reports 20%, while personal conversations most often produce explanations 25% and recommendations 22%. - Higher-wage work burns more compute, with top-wage occupation conversations using about 2.07x as many tokens as bottom-wage ones. - App-building conversations use more than 3x the median tokens, while basic explanations use about 1/5 of the median.

译Anthropic发布“Cadences”报告，分析近1万名Claude用户的匿名对话。个人提示词周末从35%升至近50%；食谱请求下午6点达峰值（平均值2.3倍）；新闻早7点峰值；商务邮件集中在10-11点；睡眠建议凌晨3-5点；美国税务请求在申报截止日前飙升8倍后骤降。周末Claude Code工作从后端转向AI agent设计、量化交易和游戏。93%对话产生清晰输出，最常见为解释（17%）、文档/报告（15%）和指导（11%）。高薪职业对话所用token数是低薪职业的约2.07倍。

DogeDesigner@cb_doge · 6天前49

Chamath Palihapitiya was asked to choose between keeping free shares in OpenAI, Anthropic or SpaceX, on the Axios Show. He chose SpaceX. He said the world’s communications infrastructure is overdue for a major overhaul, Starlink is positioned to capture a huge share of that transition, and what may sound like sci-fi today, building the same kinds of businesses beyond Earth, gives SpaceX enormous long-term optionality.

译Chamath Palihapitiya 在 Axios Show 上被问到要在 OpenAI、Anthropic 或 SpaceX 的免费股份中保留哪一个。他选择了 SpaceX。他说世界通信基础设施早就需要一次重大升级，Starlink 定位于抓住这一转变的巨大份额，而今天听起来像科幻的事——在地球之外建立同样的业务——给了 SpaceX 巨大的长期选择空间。

Anthropic@AnthropicAI · 6天前60

To keep pace with AI progress, we're advancing how we study Claude's economic impact. Hourly sampling and survey data show us how the cadences of life shape usage, what people produce with Claude, and how perceptions of AI's impact may be changing. https://www.anthropic.com/research/economic-index-june-2026-report

译为跟上AI进步的步伐，我们正在推进研究Claude经济影响的方式。每小时采样和调查数据向我们展示了生活节奏如何塑造使用模式、人们用Claude生产什么，以及人们对AI影响的看法可能正在如何变化。https://www.anthropic.com/research/economic-index-june-2026-report

Rohan Paul@rohanpaul_ai · 6天前55

The Information reports that Anthropic’s Mythos preview spooked DeepSeek into fundraising. Because, CEO Liang Wenfeng realized it needed a much bigger cash pile to compete. And DeepSeek now plans to double its workforce too. AI competition is no longer just about model cleverness; it is about compute reserves, talent density, secure infrastructure, product surface area, and enough cash to survive several failed training runs. --- theinformation .com/articles/anthropics-mythos-spooked-deepseek-prompting-7-4-billion-fundraising

译The Information报道，Anthropic的Mythos预览版让DeepSeek感到震惊，CEO梁文峰意识到需要更大现金储备来竞争。DeepSeek随即启动74亿美元融资，并计划将所有部门员工数量翻倍，招聘覆盖AI核心研发、算法、深度学习、全栈开发和产品岗位，表明DeepSeek正从仅调模型转向构建完整系统。AI竞争已变为计算储备、人才密度、基础设施、产品表面积和现金储备的综合较量。

Chubby♨️@kimmonismus · 6天前72

If you look at the documents again closely, it was never the US government's goal to take down Fable 5. It was simply a consequence of the impossibility of doing so for Anthropic. (i am re-reading the documents atm to get a even better picture of where we are heading) The single most load-bearing document in the whole affair is the letter itself, obtained in full by Bloomberg four days after it landed. It is on Commerce Department letterhead, addressed to Anthropic CEO Dario Amodei, and its operative sentence is dry to the point of menace: a license is now required for the "export, reexport, or transfer (in-country)" of Fable 5 and Mythos 5 "to all destinations worldwide and to all 'foreign persons' ... wherever located." It threatens "prompt criminal and civil penalties" and stays in force "until superseded by a subsequent letter." Read that closely, because the whole kill-switch question lives inside it. The license attaches to foreign persons, not to Americans. The letter never orders Anthropic to take the models offline for US citizens. But American export law has a peculiar reach. Under a Cold War provision known as a deemed export, the moment you give a foreign national access to controlled technology, even on US soil, the law treats it as an export to that person's home country. A German engineer logging into Fable 5 from an office in California becomes, on paper, an export to Germany. And because Anthropic cannot sort its users by passport in real time across an API, the only way to comply with a rule aimed at every foreign person on the planet was to pull the plug for everyone. A rule that targeted only foreign users could be obeyed only by taking the model away from all of them. Therefore, the question arises whether the US government may have acted too shortsightedly and is now itself, also with Anthropic, looking for a solution that simultaneously secures its national supremacy.

译美国商务部致函Anthropic，要求对Fable 5与Mythos 5的“出口、再出口或国内转让”实施全球许可证管制，对象涵盖所有外国人士。基于冷战时期“视为出口”条款，外国人在美境内访问受控技术也被视为向国籍国出口。因Anthropic无法通过API实时区分用户国籍，唯一合规方案是向所有用户关闭模型。推主质疑美国政府短视，目前正与Anthropic寻求兼顾国家安全与可用的解决方案。引用认为，能力封锁将使开源模型更具吸引力，尤其利好中国开源，OpenAI与Anthropic将因此受损。

Chubby♨️@kimmonismus · 6天前61

I think many people are not yet aware of the tectonic shift taking place. By preventing state-of-the-art capabilities - at least insofar as we are able to use them - open source becomes not only more attractive for one’s own applications, but more attractive overall. This also applies, for example, to entire states such as the European Union, provided it genuinely sets itself the goal of implementing AI. This not only attracts new investments and fresh capital, but also creates an opportunity for a PR coup that draws many idealists toward open-source companies. In that sense, I can well imagine that companies like OpenAI and Anthropic will suffer the most from regulation by the U.S. government - on two fronts - and that this indirectly does open source a tremendous service, especially given that it now primarily comes from China.

译Kim 指出，美国政府限制前沿AI能力（阻止SOTA被使用），反而让开源模型更吸引自有应用和整体市场，欧盟等国家也可受益。这吸引新投资与理想主义人才，OpenAI 和 Anthropic 将最受监管反噬，间接助推开源（尤其来自中国）。引用称，Anthropic 4月预览 Mythos 后，DeepSeek 因无法竞争而融资74亿美元；此前该实验室靠 CEO 梁文锋个人财富，现有约300人，计划至少翻倍。

Chubby♨️@kimmonismus · 6天前68

Anthropic’s Mythos preview reportedly pushed DeepSeek into a $7.4B fundraising - because they could not compete with Mythos. Until now, the three-year-old Chinese AI lab had relied on CEO Liang Wenfeng’s personal wealth instead of outside capital. The Information reports the shift came after Anthropic’s April preview of Mythos. Liang reportedly concluded DeepSeek could not compete at that level without a much larger war chest. DeepSeek is now expanding aggressively: around 300 employees today, with plans to "at least double" headcount across AI systems, infrastructure, product and research.

译Anthropic 4 月预览的 Mythos 模型据报迫使 DeepSeek 转向外部融资，筹集 74 亿美元。此前这家成立三年的中国 AI 实验室一直靠 CEO 梁文锋个人财富运营。The Information 报道称，梁文锋认为 DeepSeek 若无更大资金储备将无法与 Mythos 竞争。DeepSeek 正激进扩张：当前约 300 名员工，计划在 AI 系统、基础设施、产品和研究部门至少翻倍人员规模。

数字生命卡兹克@Khazix0918 · 6天前57

http://x.com/i/article/2070397668338307072 # 想玩Loop Engineering，可以先从这6个Hook玩法开始。最近Loop Engineering越来越火了，也有好几个朋友问我，这个东西怎么入手，我们到底应该开始从哪设计一个loop。这其实是一个非常有意思的问题，如果让我真说一个东西的话，我觉得是我之前文章中反复提到的一个东西。 Hook。每一个Agent里，几乎都会有Hook这个东西，Claude Code和Codex的自动化里面，背后也都有。 loop的意思是循环，那我们任何循环，其实都有一个最基础最初始的东西，就是触发器，也就是如果你触发了某某动作，就会去执行某某命令。其实非常像现在我们家里的一些智能家居，比如到了10点，窗帘就拉开，比如识别到我出门了，就关闭家里的所有的灯，等等等等。这个触发的条件，就是一个Hook。生活中也到处都是Hook，比如到公司，手机自动切换工作模式，早上闹钟到点了自己响，这些全是Hook。 Agent里面也是如此，你可以通过给Hook设置特定的规则，自动化做很多事。比如让AI在编辑修改文件前，先检查命令有没有风险。代码修改完毕后，自动跑lint检查质量。以及跑长任务的时候，你切到别的页面干别的事，它干完了发推送告诉你。当然，Hook的用途远远不止这些。在Claude Code里也一样，把Hook挂在那些你关心的时刻上，提前设好什么时候做什么。事情一来，它自己跑。你不在屏幕前也没关系。打开Claude Code，在底部输入/hooks，会看到这个界面。按下回车后，他会列出所有可用的Hook事件。我记得年初看的时候只有13个，现在有将近30个了，翻了一倍多。不过别被30这个数字吓到，我们日常真正会用到的，可能也就常用的那几个。这里，我也整理了6个我觉得比较好玩的Hook玩法，希望抛砖引玉，能够给大家一些思路。一、权限弹窗提醒可能很多朋友都遇到过这种情况，不敢给Claude Code下了指令，切到别的页面干别的事。过了十分钟回来一看，还没开始执行，卡在了请求权限那一步。其实只需要跟Claude Code说一句：帮我配置一个通知的Hook，每当需要我授权的时候，调用系统自带的工具给我来一个弹窗提醒。发送给AI，它就会帮你配好了。配完之后可以让Claude Code测试一下。当需要授权时，右上角会弹出提醒。再也不怕切走窗口时，AI卡住不动，白白浪费时间了。这时候肯定会有人问了，那每次授予权限的时候都会弹窗提醒，那岂不是很浪费Token。绝大多数的Hook，其实本质上就是个代码，是个写死的规则脚本，所以在运行的时候，跟AI几乎没有啥关系，所以是也不会耗啥Token。当然，Hook能做的远不止弹个窗，还有其他我觉得更好玩、更有意思的。二、开机日程播报我们有时候打开Claude Code，看到的就是一个冰冷冷的界面，不好玩。那你就可以在对话框输入：帮我创建一个会话开始的Hook，每当我打开或恢复Claude Code的时候，输出一段元气满满的问候。告诉我北京朝阳区今天的天气，会不会下雨、要不要带伞，调用飞书CLI拉出当天的日程安排，内容要有趣一些。重新打开Claude Code后，就自动弹出这个提醒，让原本枯燥的终端，多了点温度。当然这只是一个前菜。接下来这个，是我觉得最实用的一个。三、摘要卡片有天晚上，我想找Claude Code上周帮我改的一个方案，翻了半小时聊天记录，没找到。。。我坐在那想了很久，我那天到底让它帮我干了什么，想了好久也没想起来。因为每天我的Agent用的太碎了，我手上起码现在有4～5个是我长期在迭代的项目，有的时候经常会并行跑，甚至AIHOT这样的大型一点的项目，有时候是开着分支就并行着三个。所以我经常就是确认完你的确认你的，来来回回，化身Agent鸡排哥，一天下来，你自己甚至都不一定记得今天到底发生了什么。。。而且很多真正有价值的结论，都藏在那些长对话里，一旦上下文被压缩了，或者我一个/claer命令，后面再想找，就非常痛苦。所以我做了个Hook，直接把这段话发给他：帮我编写一个Hook，当上下文处于预压缩时，生成一张摘要卡片，记录当前上下文的概要内容，方便我后续查看，将文件保存到一个跨项目也可以查看的地方，总结完毕后打印到Claude Code中，方便我查看。之后，在上下文快被压缩、还没丢掉的时候，他就会赶紧生成一张摘要卡片。这玩意的意义还是很大的，聊天记录太长，回看成本极高，你根本不想翻。但这不一样，它其实是一张AI替你写的工作日记。以后你想找某天做过什么，不用翻几万字对话，翻这些看就行，一两分钟就能看完一天到底干了些啥。能大大释放你的脑容量空间，非常好用，甚至还可以再加一个定时Hook，比如，每周五的时候，再把这些摘要日记，自动写成一个周报。这个价值，你懂的。四、文件自动整理还有一个Hook的玩法我自己特别喜欢。就是前段时间的时候，我整理电脑里面的下载文件夹，那玩意贼乱，截图、文档、PDF全混在一起，每次找东西都得翻半天。然后我突然想到，为啥不让Claude Code帮我干这事呢，我自己每次手动整理，也太蠢了。所以，我就做了一个Hook，逻辑特别简单，指定一个文件夹，每次有新文件丢进来，它自己看一下这是什么、内容是什么，然后自动重命名，再挪到该去的地方。不过文件整理这件事，光靠简单代码搞不定，所以这里用了一个组合技，Hook+Skill。 Claude自己有个比喻我觉得特别准，Hook是门铃，Skill是开门以后真正干活的人。门铃响了，说明有新东西来了。但来了以后怎么处理，还是需要模型的能力的，比如识别文件内容、判断归哪一类、按规则重命名、挪到对应文件夹等等等等，这些，靠的还是Skill最方便。 Skill也非常简单，你直接用嘴让Claude Code给你写就行了，因为每个人的需求不一样，所以还是写一个自己的是最好的。这个Hook设置好以后，你只需要不关Claude Code，然后呢，它就会在后台帮你悄悄盯着那个文件夹。但凡有一个新文件进来，等几秒确认传完了，它就开始干活，然后帮你自动处理完。不管是PDF还是图片，它都能自己识别内容，会议纪要归到会议那栏，发票归到报销那栏，截图还会按内容起个看得懂的名字，然后帮你挪到对应文件夹。整个过程你什么都不需要做，你只需要把文件丢进去了，然后它自己就整理好了。这种感觉太爽了。你想想看，这个模式不只我这种乱七八糟的下载文件夹整理能用。盯着比如工作项目文件夹也行，新文件按客户名和日期自动重命名等等，有很多种自动化的变体玩法，很有意思。五、久坐提醒 AI替你干活的感觉很爽，但他有一个副作用，就是太爽了，一坐就是十几个小时。上周有一天，我早上九点打开Claude Code，想修一个小功能。等我再抬头，下午四点了。我那一刻，真的感觉回到了我十年前在学校打《文明6》的感觉。然后我发现，这事不是我一个人，很多用AI写代码的人都这样。以前沉迷打游戏，现在沉迷Vibe Coding。所以，我当时就想，做一个久坐提醒的小东西，虽然Apple Watch也有久坐提醒，每隔一小时提醒一次，但在Vibe Coding上头的时候，有的时候不太感受的到。所以，既然长期坐在电脑前，直接在电脑上推送不就行了。于是简单描述了一下需求，只要我启动了Claude Code之后，只要过了一个小时，Claude Code就会给我发通知提醒我休息了。健康还是很重要的，身体才是革命的本钱，Vibe Coding上头的时候，你根本想不起来需要站起来活动，有这么一个小提醒，还是很管用的。这里也提醒大家，让AI帮你提高效率的同时，也要多多保重身体，坐久了就起来活动一下。后面我其实还想做一个硬件，就是更加强制性的那种。。。比如，直接给我把键盘关了之类的，强制站起来= = 六、长任务完成推送然后还有一个我自己的刚需。昨天去录了一趟严敏老师的综艺，在开始之前，我让Claude Code帮我做一个比较大的功能，场上要用的，而且还有点急的那种。我坐在电脑前，看着它一行一行地出结果，看起来一切正常，就忙别的事去了，十几分钟后突然想起来，不知道有没有开发完，然后就回到电脑前一看，还在跑。来来回回折腾好几轮。我就想，得让它干完活了直接叫我。于是让Claude Code帮我研究了一下，看看有没有什么办法能让它干完活了通知我一下，最好是可以和常用软件提示音区分开的。然后它就跟我说了Bark。这是AppStore直接下载就能用的推送工具。免费，也不需要注册，装完给你一个推送用的链接，让AI帮你配置进去就行了。于是我顺手让Claude Code帮我写了个调用Bark的Hook。这下就舒服了。手机和手表同时收到消息，还可以自定义推送声音，跟微信、飞书、短信这些区分开，一听声音就知道AI干完活了，可以切回去查看成果了，而且还是中文。这个体验真的很爽。有了这个，你就可以放心离开电脑去干别的事，根本不用惦记着切回来瞄一眼。这个玩法也特别容易扩展，比如任务成功了发个轻松的提示音，任务失败了发个明显的提示音，让你知道要回去看看。需要输入的时候，推送里直接写清楚它在等什么。写在最后未来越来越多的AI工作流，我觉得一定是事件驱动的。新的一天开始了，它帮你启动，文件出现了，它去处理，上下文快满了，它先归档，任务完成了，它来通知，一天结束了，它自动总结。包括现在Github上，很多项目是用Agent监控问题，别人提出了问题，它就调用Agent自动去修，修完了自动推送，推送完自动回复。。。这件事一点都不玄乎，就是让AI从一个被动聊天框，慢慢变成你工作生活的一部分。当然，我也不建议大家一上来就搞得太复杂。 Hook一旦开始接入真实工作流，就一定要注意稳定性和边界，尤其是涉及文件移动、删除、重命名、填表这种动作，别一上来就让它在你的重要文件夹里横冲直撞。但只要你把边界设计好，它真的会非常好用。 Prompt解决的，是一次对话。 Skill解决的，是一类能力。 Hook解决的，是一个时刻。从对话，到能力，到时刻，再到循环。 AI越来越成为一个替你运转的系统。让你有时间，去做更有趣的事情。

译Hook 是 Claude Code 的触发器，可设置规则让 AI 自动执行操作，不消耗 Token。当前支持近 30 个 Hook 事件。文章整理了 6 种玩法：权限弹窗提醒、开机日程播报、上下文预压缩时生成摘要卡片、文件自动整理、久坐提醒，以及通过 Bark 工具向手机发送完成推送。

Ethan Mollick@emollick · 6天前70

If you want to read an interesting AI thinking trace, try "I want you to suggest two poems that you think apply very well to the current state of GenAI models like you. Don’t just pick popular poems and back justify. Think hard about options first" in either GLM-5.2 or Opus 4.8

译如果你想看一个有趣的AI思考轨迹，可以试试在GLM-5.2或Opus 4.8中输入："我希望你推荐两首你认为非常适合描述像你这样的GenAI模型当前状态的诗歌。不要只是选流行的诗然后反向证明。先仔细考虑选项。"

Berryxia.AI@berryxia · 6天前63

Anthropic又扔了个新东西，叫Claude Tag。他们把它描述成Agent的下一个进化形态：一个带记忆、有身份、能主动行动的多玩家Agent，跑在Claude Code之上。意思是它不再是单人聊天工具，而是能和团队一起协作、记住上下文、主动推进任务的Agent。从描述来看，它的核心是把Agent从“工具”变成了“团队成员”，有持续记忆、能多人在同一个Agent上协作，还能主动发起行动。这和之前很多单轮或单用户Agent的玩法明显不一样。不过评论区现在基本没人关心这个，大家都在狂催Fable 5。可见大家对真正强力的长期agent还是更期待。

译Anthropic发布新产品Claude Tag，将其描述为Agent的下一个进化形态。这是一个运行在Claude Code之上的多玩家AI Agent，具备持续记忆和身份，能主动发起行动并与团队协作。核心转变是将Agent从单人聊天工具变为拥有长期上下文的“团队成员”，支持多人在同一Agent上协同工作。此前大多数Agent仅支持单轮或单用户交互。不过，评论区热议焦点并非该产品，而是用户更期待的Fable 5。

歸藏(guizang.ai)@op7418 · 7天前69

Anthropic 还是达成了他的目的。现在 OpenAI 的模型已经不能按照之前的模式正常推送上线了，需要先向一些符合要求的 B 端客户提供，并由美国政府对这些客户逐一审核。结合这几天 GLM 5.2 以及 Anthropic 指控中国公司“蒸馏”他们模型的事来看，我们已经无法将模型发布当作正常的互联网产品上线逻辑来看待了。它已经变成了类似军民两用的战略资产、网络武器这一等级的管控。谁能使用先进模型，已经变成了一个国家安全问题。这会带来以下几个变化：发布节奏放缓整个头部模型的发布节奏会严重放缓。虽然模型训练的节奏会变慢，但拿到许可的公司会先利用先进模型和头部模型建立竞争优势。准入门槛提高先进模型的使用准入会变得更加严格，需要经过更严苛的审查，尤其是美国的这些模型。美国模型公司的护城河正在从“模型是否最强”转变为“是否能被安全、合法且持续地交付”，这变成了一种准入机制，还要看与政府的关系。但这也会造成一个问题：如果我们中国的开源模型继续变强，可能会吃掉更多美国模型的订单。不过，不排除美国会对中国开源模型进行更严格的监管和控制（例如禁止美国公司使用中国开源模型），但这种做法是否能管得住，还需要再看。

译Anthropic 达成目的后，OpenAI 模型无法按原有模式上线，需先向特定 B 端客户提供并由美国政府逐客户审核。结合 GLM 5.2 发布及 Anthropic 指控中国公司“蒸馏”其模型，推文认为模型发布已类似军民两用战略资产，导致头部模型发布节奏放缓、使用准入更严。美国模型公司护城河从“模型最强”转向“能否安全合法持续交付”。中国开源若持续变强可能吃掉美国订单，但美国可能进一步限制中国开源模型使用。

meng shao@shao__meng · 7天前56

Snowflake CEO @RamaswmySridhar 做了一个深度实验，对比 GLM vs Opus 成本，发现 GLM token 消耗是 Opus 的 2 倍？先看看实验设计 · 任务集：103 个 dbt 任务，每模型跑 3 轮，同一 harness、同一任务集——变量控制扎实 · 原始 token：GLM 860M vs Opus 439M，约 2× 差距 Token 差距的三个原因 · 平均轮次/轮：99 vs. 80，多轮 = 每轮重发全量上下文，token 按轮次线性放大 · 工具调用粒度：一次一查 vs. SQL批量，原子化调用产生大量重复上下文回传 · 缓存命中率：53% vs. 96%，缓存未命中部分按全价计费，是成本杠杆最大的一环关键洞察：尾部效应而非整体劣化 · 两个模型都能解决的任务上，GLM 只多用约 17% 的调用，远不到 2× · 2× 的差距几乎全部来自尾部失败案例：GLM 在某些任务上陷入 400+ 次调用的"螺旋失败" · 这说明 token 消耗是重尾分布：少数失控任务主导了整体均值。这同时也意味着——GLM 的稳定性/收敛性是比"单价"更值得关注的实际问题成本重算的方法论作者把两者统一归一化到 90% 缓存命中率后比较： · GLM-5.2 (Fireworks)：$1.12/session · Opus-4.7 (Anthropic)：$2.14/session · → GLM 便宜约 48% 可以借鉴的三个点 · 指标要分层：token 量、调用次数、单价、缓存率、稳定性是五条独立的轴，混为一谈会得出错误结论 · 尾部决定均值：在 agentic 场景，少数失控会话主导成本与体验，优化应优先砍尾部而非压单价 · harness 即杠杆：缓存率、批量化、轮次控制都受调用框架影响——同一模型换个 harness，经济性可数量级变化。结尾的 coco harness 预告正是这个论点的延续。

译Snowflake CEO 用 103 个 dbt 任务×3 轮对比 GLM 与 Opus 成本。原始 token：GLM 860M、Opus 439M（约 2 倍）。原因包括平均轮次多（99 vs 80）、工具调用粒度细、缓存命中率低（53% vs 96%）。差异几乎全部来自尾部失败案例（少数任务 400+ 次调用）。归一化至 90% 缓存率后，GLM 每 session $1.12，Opus $2.14，GLM 便宜约 48%。建议：分层考量 token 量、调用次数、单价、缓存率、稳定性；优先削减尾部失控会话；同一模型换 harness 经济性可数量级变化。

elvis@omarsar0 · 7天前72

Highly-recommended reading. "Rent the intelligence, but own the context". It's roughly the same framework for how I combine frontier models (reasoning and intelligence) and open models (context/knowledge and verifiers/judges).

译Elvis Saravia（DAIR.AI）推荐ashwingop分析称，Claude Tag看似让Slack中的AI像队友般流畅协作，实则导致“上下文锁定”——公司运营记忆（异常处理、客户承诺、失败经验等）被锁定在Anthropic的agent层。模型可换，上下文几乎无法迁移；基于token的无限计价使劳动支出被单一供应商捕获。正确架构：从任何顶级模型供应商租用智能，但企业自己拥有可检查、有权限、可移植、模型中立的上下文层。

Thariq@trq212 · 7天前33

I'll be talking more about Claude Tag with @petergyang and at AIE with @_catwu. Let me know if there's anything you'd like us to dive into more!

译我将与 @petergyang 以及在 AIE 上与 @_catwu 进一步讨论 Claude Tag。如果你们有任何希望我们深入探讨的内容，请告诉我！

ClaudeDevs@ClaudeDevs · 7天前72

Claude Tag is the next evolution of agents. It's a proactive, multiplayer agent with memory and identity, built on top of Claude Code. Learn more about how Claude Tag works and best practices for using it in this deep dive.

译Claude Tag 是智能体的下一代进化。它是一个主动的、多玩家智能体，具有记忆和身份，构建在 Claude Code 之上。了解更多关于 Claude Tag 的工作原理以及使用它的最佳实践，请看这篇深度解析。

Rohan Paul@rohanpaul_ai · 7天前53

FT published a piece "Anthropic has hired an economist with . . . interesting views on human survival" Stanford economist Chad Jones joined Antropic this week. Jones’s NBER paper showed advanced AI as a bargain between faster invention and existential risk, where smarter systems may speed research but may also create losses no economy can repair. His most jarring result uses log utility, an economics assumption that each extra gain helps less than the last, and says a 1% annual extinction risk for 40 years leaves about a 67% survival chance.

译Anthropic 本周雇佣了斯坦福经济学家 Chad Jones。Jones 的 NBER 论文认为，先进 AI 是加速发明与存在风险之间的权衡——更智能的系统可能加速研究，但也可能造成经济无法修复的损失。其结论使用对数效用假设：若每年有 1% 的灭绝风险持续 40 年，生存概率约为 67%。

Anthropic@AnthropicAI · 7天前51

We're joining @raiseus_ai as a founding partner. RAISE US is a nonprofit coalition working to strengthen the American workforce through employer-led action, AI-enabled training, and policy innovation to support the transition to transformative AI.

译Anthropic宣布以创始合伙人身份加入RAISE US。RAISE US是一个非营利联盟，由前美国商务部长Gina Raimondo和Eric Holcomb共同主持，联合各州州长、雇主和教育工作者，旨在通过雇主主导行动、AI赋能培训和政策创新，为美国劳动力建立AI时代的 "人员战略"，帮助工人培训、转型和发展，弥补当前仅有AI技术战略、缺乏人员战略的空白。

Chubby♨️@kimmonismus · 7天前58

Google is reorganizing its AI coding strike team as it tries to close the gap with Anthropic in one of the most lucrative parts of the AI market. According to The Information, the months-old team is being expanded into a more formal "midtraining" group, sitting between pretraining and post-training. The goal is to improve Gemini’s coding abilities and extend those capabilities into broader business tasks like creating presentations. The move comes after two major departures: Noam Shazeer reportedly left for OpenAI after changes to his compute access, while Nobel Prize winner John Jumper is heading to Anthropic after recently being moved to the coding strike team. That is the uncomfortable part for Google. It has world-class researchers, TPUs, Gemini, Cloud, Search, YouTube, and enormous distribution. But coding has become the clearest monetization layer in AI, and Anthropic has turned that advantage into massive revenue momentum. Google now seems to be admitting that strong base models alone are not enough. Coding needs specialized training, dedicated compute, and a much sharper product loop.

译Google将数月前成立的AI编码突击队扩展为更正式的“midtraining”小组，位于预训练与后训练之间，旨在提升Gemini编码能力，并延伸至创建演示文稿等商业任务。此前，Noam Shazeer因计算资源变动转投OpenAI，Nobel奖得主John Jumper调至该团队后也加入Anthropic。Google承认，仅靠强大基座模型不够，编码已成为AI最明确的变现层，Anthropic借此获得收入增长。Google需专用训练、计算和更清晰的产品闭环来追赶。

Chubby♨️@kimmonismus · 7天前52

This is an interesting update that Anthropic published in their official letter. But in short: they apparently haven't managed to stop the distillation. It's continuing almost seamlessly, just like before.

译这是Anthropic在其官方信件中发布的一个有趣更新。但简而言之：他们显然未能阻止蒸馏。蒸馏几乎无缝地继续，和以前一样。

Nathan Lambert@natolambert · 7天前40

The AI companies should be presenting this much more as transparent data of what is happening over time and much less like smear campaigns with strong policy wishes. Just comes across as so self serving, hard to want to support them.

译AI 公司应该更多地以随时间推移的透明数据来呈现这些内容，而不是像带有强烈政策意图的抹黑运动。这让人感觉非常自私自利，很难让人愿意支持它们。

🚨 AI News | TestingCatalog@testingcatalog · 7天前36

Anthropic is serving “0 traffic” to Fable 5 at this moment despite the earlier reports, as confirmed by Sam. But we still all waiting patiently 👀

译Anthropic 目前向 Fable 5 提供的“流量为 0”，尽管此前已有相关报道，据 Sam 证实。但我们仍在耐心等待 👀

Chubby♨️@kimmonismus · 7天前41

Correction: Anthropic states that the apparent access to Fable 5 is likely attributable to a UI bug.

译更正：Anthropic 表示，Fable 5 的可见访问很可能是由 UI bug 导致的。

Chubby♨️@kimmonismus · 7天前49

It looks like Fable 5 might actually be available again. But there's a major caveat: It's only for US citizens and businesses. That would really change everything and leave Europe completely behind.

译看起来 Fable 5 可能确实再次可用了。但有一个重大限制：仅限美国公民和企业。这将彻底改变一切，把欧洲完全抛在后面。

Chubby♨️@kimmonismus · 7天前63

Fable 5 is back - and now there’s video proof. Not just showing up in the model selector. People are actually using the model again. We are so back.

译Fable 5 回来了——现在有视频证据。不只是出现在模型选择器中。人们真的又开始使用这个模型了。我们回来了。

Rohan Paul@rohanpaul_ai · 7天前49

Great Stanford + MIT + Harvard + Anthropic paper. Gives a clear training-based reason for why larger models learn abilities smaller models miss. Says bigger AI models learn rare skills because they forget them less during training, their extra space protects weak learning signals. The authors say the issue is not just whether a small model could represent the task, but whether training lets it keep that task while many common tasks keep pushing on the same limited parts. Their core idea is that common tasks take up the model’s neurons first, so rare tasks get overwritten before they appear often enough to build into stable knowledge. In a crowded data mixture, common patterns get first claim on the model’s internal machinery. Small models may briefly pick up a rare signal, but the next wave of common-task updates overwrites it before the signal appears again. They tested this first with controlled toy tasks where they could change how rare and complex each task was, then with OLMo language models from 4M to 4B parameters. The main result is that bigger models learned low-frequency tasks much better, kept more task features inside their representations, and showed less gradient interference, which means common-task updates disturbed rare-task learning less. Larger models can remember weak rare signals long enough to turn them into real learned skills. ---- Link – arxiv. org/abs/2605.29548 Title: "Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention"

译Stanford、MIT、Harvard与Anthropic联合论文从训练层面解释大模型能力更强的原因：大模型遗忘更少，额外容量保护了弱学习信号。常见任务优先占据神经元，罕见任务在出现足够次数前被覆盖。小模型可能短暂捕捉罕见信号，但随后被常见任务更新覆盖。实验使用OLMo模型（4M到4B参数），结果显示大模型更好掌握低频任务，保留更多任务特征，梯度干扰更小。

meng shao@shao__meng · 7天前59

A 社指控阿里千问 25000 个伪装账号来蒸馏 Claude 这个数量比之前指控 DeepSeek、MiniMax 和 Kimi 加起来好像都多懂了，都传出去：Qwen 3.8 值得期待 😂 A 社的蒸馏账号报告，也是一种 Benchmark。。。

译Anthropic 指控阿里千问用 25000 个伪装账号来蒸馏 Claude 这个数量比之前指控 DeepSeek、MiniMax 和 Kimi 加起来好像都多懂了，都传出去：Qwen 3.8 值得期待 😂 Anthropic 的蒸馏账号报告，也是一种 Benchmark。。。

X.PIN@thexpin · 7天前61

http://x.com/i/article/2069762663366975488 # Tokenmaxxing is dying, and Chinese open-source models fill the gap Amazon, Meta, and Uber are capping the token spend as GLM-5.2 and DeepSeek give their models away for free. Over the past week, a new Chinese model called GLM-5.2 has set off another round of alarm in Silicon Valley. Released by the company z.AI under a permissive open-source license, it takes direct aim at the coding and agentic-workflow business that Anthropic has built its reputation on — and running on a one-million-token context window, it lands surprisingly close to Claude Opus 4.8 and OpenAI’s GPT-5.5. The open-source community is ecstatic. At the same moment, America’s “unlimited AI credits” mania is draining away. Amazon, Meta and others are killing their no-limits AI plans. After Uber’s engineers burned through a full year’s AI budget in four months, the company capped each employee at $1,500. Even Microsoft CEO Satya Nadella has warned that the industry can’t let a few AI giants swallow the whole economy. The link between open-source models and what people now call “Tokenmaxxing” is simple enough: programmers burn too many tokens, the bills get too big, and faced with a mountain of invoices, people reach for the open-source option. This is not the Tokenmaxxing takedown you’ve read on Substack, though. Because a few questions kept nagging at me. If open-source models can do the job, why is anyone still topping up their Claude account? And if everyone runs to open-source, how does anyone building a model make money? It was only after GLM-5.2 shipped that I arrived at a first answer. Both of these waves — the rush to open-source and the rush to burn tokens — come down to the same thing: how we decide to think about a token. ## Born Out of Scarcity Start with the open-source side, and start with GLM-5.2. Z.ai has released the core weights of GLM-5.2 under an unrestricted MIT license. Any company can download it free from Hugging Face, customize or fine-tune it, and run it locally or on a virtual machine. Standing the thing up is still a slog, but next to the now-delisted Fable 5, it’s a genuinely good option. The model was built on Huawei’s Ascend chips — no Nvidia hardware involved. But GLM-5.2 is not another DeepSeek. DeepSeek’s Liang Wenfeng came out of a quant fund, is worth billions, and has chosen near-total seclusion. (He recently put about $2.8 billion of fresh money into DeepSeek) Z.ai, by contrast, is an open-source model maker that’s already publicly listed in Hong Kong. It has no billionaire patron, and its road has been every bit as winding as DeepSeek’s. In 2020, BAAI’s Tang Jie argued the language model still deserved the effort. Of BAAI’s 480 A100 cards, 400 went to Tang’s team. Tang also tried Huawei’s 910A and 920 chips. On large-model training, the 920’s operator efficiency was just 18% of an A100’s; after Tang’s team helped rewrite the operators, they pushed it to roughly 40%, and trained a 13B code model, CodeGeeX. But Tang’s real goal was 100B-parameter model, even 2,000 910A cards weren’t enough. In the end, Tang turned to z.AI, the company he’d founded back in 2018, rented 1,000 cards. In July 2022, they finally had their hundred-billion model: GLM-130B. I tell his story because he embodies the type. Most of China’s open-source AI companies grew out of academic projects; they incorporated mainly because they needed to buy compute, and they open-sourced their architecture to keep their academic visibility. Starved of chips, they learned to adapt to whatever domestic silicon they could get. Z.ai wasn’t placed on the U.S. entity list until 2025, but it was already optimizing for Huawei chips in 2020. Localized compute and open architecture became, almost by default, the signature of Chinese AI. The open-source bet has its skeptics inside China, too. In 2024, Baidu founder Robin Li argued that closed models were more powerful and cheaper to run than open ones. His point being that closed models came with more compute and bigger teams, and that ERNIE was nearly a match for ChatGPT. (A little ironic, isn’t it?) ERNIE was not, in fact, in ChatGPT’s league, and China never produced a closed model strong enough to make Li’s case. Turning open-source into profit is a hard problem. In a 2025 interview, a z.AI expert described the company’s three possible lanes — inference, agentic, and coding — and said z.AI chose coding. MiniMax, by contrast, chose multimodal AI and AI companionship. At the time it wasn’t an obvious call: z.AI’s business leaned on enterprise and government contracts, coding showed no clear path to profit, and multimodal could win consumers directly. Z.ai was not the favorite. Then the AI-coding boom arrived. Z.ai’s latest results show a net loss of about ￥3.18B ($444M) against R&D spending of roughly ￥3.2B ($444M). Still in the red — but strip out the open-ended spend on compute, and z.ai’s revenue can cover day-to-day operations. If it can get cheaper chips, or use its chips more efficiently, or land a wave of enterprise buyers, the losses could narrow. That would be good news. In a sense, z.AI may owe Anthropic a thank-you note: both for the AI-doom evangelism and for the AI-coding fervor. Anthropic’s strong models cultivated customers, and its incessant messaging then drove some of them away. One of the places those customers landed was z.AI. A first conclusion, then: going open-source is a passive choice: a Chinese model maker admitting, out loud, that it’s behind on both compute and model quality. But if closed-model progress stalls, users won’t keep paying premium prices for closed-model tokens; they’ll choose open-source on their own. The Chinese saying fits: just hold your plate steady, and the roast duck falls from the sky. Remember to Like & Subscribe! ## Water, Electricity, and a Bad Analogy Now the other wave : Tokenmaxxing. GLM-5.2, DeepSeek and Kimi are mostly catching customers who fled the bills. But if OpenAI and Anthropic were good enough, would open-source still persuade anyone? Then Alibaba gave me a frame. In a March internal memo, CEO Wu Yongming argued that in the AI era, the token would become a basic factor of production, the way traffic was in the internet era. Alibaba set up the Alibaba Token Hub (ATH) around that idea. Follow the logic. In the age of electrification, a country’s electricity output and its GDP growth tend to rise together — no nation ever went bankrupt building power plants. So I looked at U.S. electricity prices, consumption and GDP from the 1920s to the 1960s. As prices fell, total spending on electricity rose 6.2x, but nominal GDP rose 11.1x. Americans spent relatively less on power and got more output for it. The pattern doesn’t always hold cleanly, though. Through the fast-industrializing decades in Japan, China, and West Germany, electricity spending actually outran GDP. But in West Germany and Japan, even during those high-growth years, the share of GDP eaten by electricity fell sharply to almost 2.0%. That suggests is a kind of lag: a rising industrial economy takes roughly fifteen years to work through the adjustment and reach the point where cheap power finally translates into abundant output. If Wu is right and tokens really are AI’s water and electricity, they ought to deliver something similar. But run the numbers and the story breaks. Over the past four years, the cost of a given unit of AI dropped more than 90 percent, while total token spending rose 70x. My god. If this is water and electricity, the bill is climbing far too fast. A seventyfold jump in token spending over four years has not produced anything like a matching surge in what society actually makes. Yes, the data centers went up, and the chips are back-ordered for months. But none of it has meaningfully improved the quality or efficiency of production outside the AI industry itself. What breaks the “AI as utility” analogy is the reasoning model. Across coding and agentic tasks, a model now generates thousands of internal reasoning tokens before it answers, pushing single-task consumption 10 to 100 times higher than older models. So how much does all that buy you? In an NBER paper, DeMiller, Musolff and Yang measured the gains from AI coding tools across four stages of work: - Writing a single file: +290% - Bulk work: +150% - A specific deliverable: +50% - A shipped, delivered product: +30% In other words, even in coding — the thing AI does best — the gains shrink fast as you zoom out from a single file to a finished product. Optimizing the whole pipeline is far harder than optimizing one slice of it. ## Three Months of Unlimited Tokens As latecomers, Chinese firms tried to copy the Tokenmaxxing wave too. Per public reports in March, Tencent gave core R&D teams an annual token package worth about $31,700 each, plus $1,000 a month for outside tools; ByteDance opened its internal AI tools for unlimited use and reimbursed half of employees’ personal AI experiments, capping technical staff at $1,000 a year; Baidu handed engineers unlimited ERNIE access plus up to $800 a year for outside tokens; 360 simply loaded every employee with 100 million tokens. The recalibration came fast. Three months later, Tencent’s Hunyuan team was capped at roughly $970 worth of outside models, and everyone moved onto quotas — though using Tencent’s own Hunyuan model stayed unlimited. ByteDance staff likewise faced no limit on its in-house TRAE tool. Internally, Tencent came out against usage rankings, refusing to treat token consumption as a single yardstick for output. The reason was simple: Chinese companies wanted real output, and they weren’t seeing it. One employee, speaking anonymously, described a team that built workflows across several different models — only to find the AI-generated pieces wouldn’t fit together, and to scrap the whole thing and start over. Twenty-odd people spent about $6,900 in tokens in a month and had nothing to show for it. At some firms, the free tokens got quietly repurposed — for analyzing stocks, say — and the company had no idea where they’d gone. Meta is tightening what employees can spend on Anthropic and other providers — a sharp reversal from the scene a few months earlier, when staff competed to burn tokens. Bloomberg has reported that Uber and Walmart each capped AI coding-tool use; the Financial Times reported that Amazon scrapped the internal leaderboard that ranked employees by AI usage. A June report from the consultancy Bain, titled Your AI Budget Is Growing. Your Returns Aren’t. Here’s Why., found that among companies able to quantify AI’s cost savings, 40 percent saw actual savings of 10 percent or less. Of the 37 percent who’d targeted savings of 11 to 20 percent, only 31 percent actually got there. The grassroots buying isn’t over, though. One ByteDance engineer pays for Claude Max — $100 a month reimbursed — to write what he considers the cleanest code. Better than DeepSeek, by his lights, and GLM he can’t get. But one employee’s purchase doesn’t make the whole company better off. Tokenmaxxing shifts an individual’s cost onto the employer. The irony is that the last firm into the water was the first one out. Tencent, a relative laggard in China’s AI race, quit Tokenmaxxing earlier than anyone. ByteDance is still touting its numbers: as of June, it says, daily token calls to its Doubao model topped 180 trillion, up more than tenfold in a year. Continue Reading

译中国公司 z.AI 以 MIT 许可证开源 GLM-5.2 模型，拥有百万 token 上下文窗口，基于华为昇腾芯片训练，性能接近 Claude Opus 4.8 和 GPT-5.5。与此同时，Amazon、Meta、Uber 等美国公司因工程师过度消耗 token 而开始限制 AI 预算（Uber 每员工上限 1500 美元），推动开源模型需求。GLM 团队源自学术项目，长期适配国产芯片；DeepSeek 投入 28 亿美元，共同成为“Tokenmaxxing”趋势的替代方案。