Meta AI has shockingly grown 2.5x in the last 2mos and is poised to be the #3 AI consumer app in the world behind Gemini and ChatGPT. Sadly, this growth is very likely inorganic given it has by far the worst retention by a mile: only 4.5% users stay in 30 days.

译Meta AI 在过去两个月内惊人地增长了 2.5 倍，有望成为仅次于 Gemini 和 ChatGPT 的全球第三大 AI 消费级应用。遗憾的是，这种增长很可能是非有机的，因为它的留存率迄今最差：只有 4.5% 的用户会在 30 天后继续使用。

歸藏(guizang.ai)@op7418 · 6月8日5

最近可以在各种媒体和账号上，看到我的那个 Skills 推荐。感谢量子位。

译最近可以在各种媒体和账号上，看到我的那个 Skills 推荐。感谢量子位。 [引用 @op7418]：http://x.com/i/article/2053655813877870592

Rohan Paul@rohanpaul_ai · 6月8日63

Coinbase CEO Brian Armstrong thinks AI demand is almost limitless, but he expects 80% of workloads to shift to models that are 99% cheaper within 12-18 months.

译Coinbase CEO Brian Armstrong 预测，对智能的需求近乎无限，但 80% 的工作负载将在 12-18 个月内迁移到便宜 99% 的模型，仅 20% 继续运行在追求最高 IQ 的最新模型上（如科学突破、高级编排型 AI 智能体）。他类比高端 MacBook/游戏 PC 的配置占比，但指出模型价格下降远超摩尔定律。Armstrong 认为未来瓶颈是能源和算力，而非更好的模型。Coinbase 正将用户提示词路由到更便宜的模型，部分情况下 token 用量指数增长，成本基本持平。

Orange AI@oran_ge · 6月8日41

创作的核心往往根植于不可测度的人类情感、不可言说的共鸣，这都是理性分析难以捕捉的人类未来可以和AI竞争的领域，可能恰恰是我们作为人类有缺陷的心灵角落。 -李诞

elvis@omarsar0 · 6月8日53

The point is that you should start implementing ways to encode instructions/prompts with clear goals inside automations. Nothing new but newer LLMs are being trained to perform for longer duration uninterrupted. Loops are one way to take advantage of that.

译关键在于，你应该开始实施在自动化中编码带有明确目标的指令/提示词的方法。这并不是什么新鲜事，但较新的大语言模型正在被训练以更长时间不间断地执行。循环就是利用这一点的一种方式。

小互@xiaohu · 6月8日26

应该是暗示全新的Siri

Chubby♨️@kimmonismus · 6月8日56

A walking contradiction: On the one hand, Daniela from Anthropic says that AI has hardly replaced any jobs so far, on the other hand, co-founder Olah warns the Pope about the disruptive effect of AI on the labor market and society.

译一个行走的矛盾体：一方面，来自Anthropic的Daniela表示，AI迄今为止几乎没有取代任何工作岗位；另一方面，联合创始人Olah却向教皇警告AI对劳动力市场和社会的破坏性影响。

小互@xiaohu · 6月8日45

这个说的的挺对的我最近的一个和很强烈的感受也是人机协作才是未来之前我一直追求完全的自动化导致了不停的和AI扯皮或者总觉得对系统、技能不满意一直在修改它... 结果是你越想优化它它就崩溃，往相反的方向走了我现在删除了所有规则，尝试在关键节点进行人工介入来和AI一起完成，效果会更好而且人的精神也好了，不再经常骂AI是傻逼了...

译小互分享个人经验：此前追求完全自动化，导致不断与AI扯皮、对技能和系统频繁修改，效果反而更差。现在删除所有规则，只在关键节点人工介入，与AI共同完成任务，不仅效果更好，人也轻松许多，不再频繁抱怨AI。

郭明錤｜Ming-Chi Kuo@mingchikuo · 6月8日65

WWDC26 不影響 Apple 2H26 股價正向趨勢，但將揭露多頭敘事的續航力 ‒‒ 1. Apple 目前的多頭核心敘事，是一個近乎直覺、沒什麼人反駁的市場共識：「即使 Apple 在 AI 進度上暫時落後，最終仍能後來居上」。 2. 根據最新的供應鏈調查，我認為 Apple 的業績將會好到今年底，而這會進一步強化多頭核心敘事成為：「Apple 沒有 AI 都這麼好，有了 AI 還得了！」 3. 因此，無論 Apple 在 WWDC26 上講什麼，只要這個多頭核心敘事沒有被破壞，Apple 2H26 的股價正向趨勢就不易改變。 4. 上述多頭核心敘事並非沒有破綻，但我認為至少有機會維持到 2026 年底。至於能維持多久，就是這次 WWDC26 真正值得觀察的地方。 5. 這次 WWDC26 的重點，不在於發表會結束後的短線股價反應，而是：同樣使用 Gemini，Apple 能否做出比 Google 更好的 AI 應用、agentic workflow、裝置端與雲端混合體驗。 6. 如果答案是肯定的，將有利於延長 Apple 的多頭核心敘事；如果答案是否定的，意味著「Gemini 決定了 Apple AI 體驗的上限」，則股價雖未必會轉空，但「Apple 終究會後來居上」的多頭核心敘事，將開始被更多人重新檢視。

译郭明錤指出，Apple 當前多頭核心敘事是「AI 雖落後但最終後來居上」。供應鏈調查顯示其業績將好到今年底，進一步強化此敘事。WWDC26 上無論發布什麼，只要該敘事不被破壞，Apple 2H26 股價正向趨勢就不易改變。真正觀察點在於：Apple 能否用 Gemini 做出比 Google 更好的 AI 應用、AI 智能體工作流及設備端+雲端混合體驗。答案肯定則利好敘事延續；否則「Gemini 決定 Apple AI 上限」的質疑將削弱「後來居上」共識。

郭明錤｜Ming-Chi Kuo@mingchikuo · 6月8日60

WWDC26 won't change Apple's positive 2H26 share-price trend, but it will test the staying power of the bull narrative ‒‒ 1. Apple's core bull narrative right now is an almost intuitive market consensus that few people push back on: "Even if Apple is temporarily behind on AI, it will ultimately catch up and come out ahead." 2. Based on my latest supply-chain checks, I believe Apple's business momentum will remain strong through year-end, which should further reinforce the narrative into something like: "If Apple is doing this well without AI, just imagine once it has AI." 3. So regardless of what Apple says at WWDC26, as long as this core bull narrative stays intact, Apple's positive 2H26 share-price trend is unlikely to change. 4. That core bull narrative has its weak spots, but I think it has a good chance of holding at least through end-2026. How much longer it can last is what makes WWDC26 genuinely worth watching. 5. The key takeaway from WWDC26 will not be the short-term share-price reaction after the event. It will be whether Apple, using the same Gemini, can deliver better AI applications, agentic workflows, and on-device & cloud hybrid experiences than Google. 6. If the answer is yes, it would help extend Apple's core bull narrative. If the answer is no, it would suggest that Gemini sets the ceiling for Apple's AI experience. The stock may not necessarily turn bearish, but the "Apple will ultimately come out ahead" narrative would start to face growing scrutiny.

译郭明錤指出，苹果核心看涨叙事是“AI暂时落后但最终会迎头赶上”。供应链显示业务势头年底前强劲，强化“无AI已不错，有AI更想象”叙事。故无论WWDC26内容，只要叙事不变，苹果2026下半年股价趋势积极。WWDC26真正看点在于苹果能否用同款Gemini做出比谷歌更好的AI应用、智能体工作流及端云混合体验。若能，叙事延续；若不能，Gemini设定AI上限，“苹果最终领先”将受质疑。

Chubby♨️@kimmonismus · 6月8日55

Interestingly, banks are the sector where AI will first cause significant job losses. Banks are openly preparing for AI-driven job cuts, with executives at JPMorgan, Citigroup, Goldman Sachs, and Standard Chartered acknowledging that roles will be eliminated as the technology takes hold. Junior analyst classes are being cut by as much as two-thirds, leaving students struggling to break into finance, even as banks still source most of their AI talent from those same entry-level cohorts. Meanwhile, banks are rolling out targeted AI use cases like Citigroup's wealth-management avatar and Revolut's in-app assistant, though some doubt that all the announced cuts are truly AI-driven rather than cover for prior overhiring.

译银行业正公开为AI驱动的裁员做准备。摩根大通、花旗、高盛、渣打银行高管承认，随着AI技术普及，许多岗位将被淘汰。初级分析师班级规模削减幅度高达三分之二，导致学生难以进入金融行业，尽管银行仍主要从这些初级群体中招募AI人才。与此同时，银行也在推出针对性AI应用，如花旗的财富管理化身和Revolut的APP内助手。不过，有人质疑并非所有宣布的裁员都真正由AI驱动，可能只是掩盖之前的过度招聘。

Deedy@deedydas · 6月8日70

This is the best scene in Hell Grind, an entirely AI-made movie, the flashback. Watch it and read this analysis on where we are with AI movies today: time, cost, quality. Overall: Phenomenal technical demo by Higgsfield. Mediocre movie. Good graphics, hints of emotion, but superhero movie level quality in certain scenes at best. Too many cuts. That said, 660x fewer man hours, 50x faster and 36x cheaper than the median US film. Time: The 95 min film took 15 people 14 days. The median US theatrical production takes ~200 people ~2yrs. That’s a 660x improvement in man-hours and 50x in calendar time. Economics: It took $500k, 80% of which was compute. The final footage was cut from ~100hrs of footage generated from text to video / image to video models like Bytedance’s Seedance: a 64:1 “curation” ratio. The median US movie takes ~$18M, with even indie films costing $1-5M. Thats 36x cheaper than median. Quality: Average watch *at best*. Way too many cuts between shots, several characters change accents and have “AI” synthetic voices and characters feel like it’s AI too. Movement, editing and blocking feel artificial too. On the plus side, we’ve more or less solved character consistency, camera angles and realism. The reason the movie wasn’t amazing was more about poor directorial choices than innately unusable video models. Hard to put a number on it but maybe we’re at ~90% on quality that is technically achievable. If Scorsese made an AI movie, I reckon it would be quite good. I know the visceral reaction to anything AI is real and well-studied. But I think it’s folly to fight the inevitability of AI film. It’s too cheap and quick to ignore and almost there on quality. Creators with distribution *will* make AI films and shows and just put them on YouTube. This is the worst quality, slowest and most expensive it will ever be. In the end, good content beats “real” content.

译95分钟AI电影《Hell Grind》由15人14天完成，成本50万美元（80%为算力）。相比中位数美国电影（约200人2年、1800万美元），实现660倍人时、50倍时间、36倍成本改善。影片从约100小时AI生成素材（使用字节跳动Seedance等模型）中按64:1比率剪辑。质量方面，角色一致性、摄像机角度和写实感基本解决，但剪辑过多、角色口音多变、AI合成声音明显，动作和编排生硬——整体平庸，主因是导演选择而非模型限制。作者认为这是AI电影最差、最慢、最贵的时刻，未来创作者将直接投放YouTube。

Logan Kilpatrick@OfficialLoganK · 6月8日18

bullish on Gemini

译看好 Gemini

Ethan Mollick@emollick · 6月8日32

A year ago the closest thing we had to an AI agent was o3.

译一年前，我们最接近AI智能体的是o3。

François Chollet@fchollet · 6月8日21

I wrote my first neural networks in pure C, then in Matlab, then in NumPy, before eventually upgrading to Theano. Since then I have seen and tried pretty much every NN framework ever developed. Some are bad, some are good. The good ones understand API design principles.

译我最初用纯C写神经网络，然后在Matlab里写，接着在NumPy里写，最后升级到Theano。从那以后，我见过并尝试过几乎所有开发出来的NN框架。有些很烂，有些很好。好的框架都理解API设计原则。

宝玉@dotey · 6月8日44

长时间运行 Agent，Agent 能自行验证才是关键，否则可能只是浪费 Token

译宝玉指出，Agent 能否自我验证是长时间运行的关键，否则可能浪费 Token。@bcherny 的基准测试显示 Claude Opus 最适合长时间运行，并给出 5 条自主运行技巧：1. 使用自动权限模式；2. 部署动态工作流让 Claude 协调数百/数千个 Agent；3. 用 /goal 或 /loop 指令持续推进；4. 在云端运行 Claude Code 以便关闭笔记本；5. 确保端到端自我验证——通过 Chrome 浏览器扩展验证网页、iOS/Android 模拟器 MCP 验证移动端、启动完整 Web 服务验证后端。

jason@jxnlco · 6月8日47

i refer to these people 'the car guys of ai' they are just working on their car all the time

译我将这些人称为“AI的汽车爱好者”，他们只是一直在修理他们的车。

Greg Brockman@gdb · 6月8日20

interesting

译有趣

meng shao@shao__meng · 6月8日23

这篇公众号真的太典型的中国自媒体风格了，咋咋呼呼，各种定论，论据基本靠嘴。。。大厂抢青少年？这难道不是培训机构的生意而已？最小年龄低至 6 岁，学 Agent、Coding？连幼儿园的孩子也不放过了？抢人和裁员的这段，看到「虾」这个字，就完全没欲望读了，一定对 AI 一无所知的编辑写的。我可以拍脑袋说，靠 AI 裁员，只是企业经营无方，给自己找的一个体面的借口罢了。。林凡？脉脉 CEO？去年 AI 人才需求增长 10 倍？今年又增长了 8.7 倍？哪来的数据啊？是脉脉里的职位？那玩意真的有人在用？会不会是把原来的职位，改了个名字，加个 AI 的关键词？

译邵猛发文批评某公众号文章，称其是典型中国自媒体风格——咋咋呼呼、定论多、论据靠嘴。文章提到“大厂抢青少年”（最小6岁学Agent、Coding），邵猛认为这只是培训机构的生意。文章还声称“靠AI裁员”，邵猛斥之为企业借口。邵猛特别质疑脉脉CEO林凡的数据：去年AI人才需求增长10倍、今年再增长8.7倍，怀疑这些数字来自脉脉内部职位，且可能只是旧职位改名加“AI”关键词，并非真实需求。

meng shao@shao__meng · 6月8日59

不写 Prompt，写 Loops -- Boris Cherny (Claude Code) 道理大家都懂，伴随 Claude Opus 4.8 和 GPT-5.5 这种高智能、高推理和编程能力的模型出现，模型能自主决策和解决的问题复杂度越来越高，已经不需要我们一问一答的手把手写 Prompt 可是，不管是 Loops 还是 Codex Goals，都有一个不得不面对的问题，Token！Token！还是 Token！因为他们消耗 Token 真的太快了，可能一个 Goals 就干没了我 5 小时的用量，我们还远没有到 Token 自由的阶段，特别在企业里还要严格审查 Token 消耗的 ROI，所以在企业中大家还是会把前置的 Spec、AGENTS.md 等尽量写明约束，让 AI 尽量做确定性可控的部分。

译Anthropic Claude Code 负责人 Boris Cherny 表示，他不再手动写提示词，而是编写 Loops 让 AI 自主决策和解决问题。伴随 Claude Opus 4.8 和 GPT-5.5 等高智能模型出现，手把手写 Prompt 的模式将被取代。但 Loops 方案消耗 Token 过快，一个 Goals 可能耗尽 5 小时用量，企业仍需严格审查 Token 消耗的 ROI，因此实践中会通过前置的 Spec、AGENTS.md 等约束让 AI 做确定性可控的部分。Boris 预测这一转变将在今年剩余时间持续。

宝玉@dotey · 6月8日56

> 为什么选 HTML，而不是 React 选 HTML 而不是 react 这个观点我是不认同的。第一个原因是设计稿它是需要关联数据有交互的，能根据数据去动态地变更 UI，这点 HTML 是做不好的。像 Claude Design 它不仅仅是一个 UI 设计工具，更是一个 UX 交互设计工具第二个呢，当你的 UI 复杂到一定程度，你一定是要拆的。如果你是一个巨大的 HTML，其实你要重构或者是更改的话会非常的麻烦，因为它不可能把整个的 HTML 文件给加载进去。而使而使用 react 的话，它可以把页面拆成若干个很小的小组件，这样的话你一方面可以重用，另外的话如果你要修改，你只要加载其中的一个小组件就可以了，它对上下文要求是没有那么高的。还有一点是用 react 的话，它天然地就把设计划分成了一个结构化的树形的结构，当基于这个设计稿去开发的时候，对 coding agent 的话它是很好用的。而如果你是一个巨大的 HTML，它不是一个结构化的，它的实现跟设计会是比较割裂的，当你的设计稿去修改的时候，你很难再去跟你的代码有一个映射或者是同步至于文章中说到的理由，说因为模型要把注意力放在保证 react 的运行准确率上，这个在以前也许是成立的，但以现在的大模型能力，这一点来说已经不是什么问题了

译宝玉不认同“选HTML而非React”的观点。首先，设计稿需要动态数据交互，HTML难以胜任，而Claude Design是UX交互工具。其次，复杂UI必须拆分为小组件，React能实现重用与低上下文修改，大型HTML则维护困难。再者，React天然形成结构化树形设计，方便coding agent开发，HTML结构割裂、难以与设计映射。至于文章说模型注意力需放在React运行准确率上，宝玉认为当前大模型能力已使此问题不复存在。

Sam Altman@sama · 6月8日24

interesting recursive loop here maybe

译Codex 推出了一个新的“大按钮”。在接下来的 100 天里，每天会选出一位用 Codex 做出了令人印象深刻或极其有用工作的人，给他们一个月的 10 倍使用限额，看看他们能做出什么。明天开始第一个。Sam Altman 评论称“这里可能有个有趣的递归循环”。

Berryxia.AI@berryxia · 6月8日14

Google 3.5 Pro 看来真的是没有能打的了！只能拿这种测试来“蒙骗”大众了。 Google 大善人醒醒啊，你看看隔壁即将发布的是啥模型和自己家的这对比下啊！ 😑

宝玉@dotey · 6月8日36

Claude Code 虽然移动端可以 remote control，但是 plan 后不能 bypass permission，需要不停的确认，神烦

jason@jxnlco · 6月8日14

This but “codex is for almost everything”

译但这……“Codex 几乎适用于一切”。

elvis@omarsar0 · 6月8日55

Super-powerful AI models will launch in the coming weeks. We are looking at a potential step change in model capabilities. The biggest mistake right now is to lock into one vendor. I say this not only from a cost perspective, but also from an engineering perspective. Start figuring out how to leverage combinations of these models (including open models). What that means is that you can swap models anytime and best leverage their strengths. For coding agents, open models are already just as good as the frontier ones. So, how to better prepare? Consider how you will be routing tasks/work to these models. AI model routing is high reward, and it should be part of your AI engineering efforts going forward.

译未来几周将有超强AI模型发布，模型能力可能迎来阶段性提升。DAIR.AI创始人Elvis Saravia认为，当前最大的错误是从成本或工程角度锁定单一供应商，建议开始利用模型组合（包括开源模型），保持“模型无关”，以便随时切换模型以发挥其各自优势。对于编程智能体，开源模型表现已媲美前沿模型。他建议将“AI模型路由”作为核心策略，高效分配任务/工作给不同模型，这是一项高回报的AI工程投入。

Rohan Paul@rohanpaul_ai · 6月8日67

Demis Hassabis's new interview: "Society needs to hear that because we don't have long to prepare for what that means. We are standing in the foothills of the singularity now. ..which is AGI. I believe that we are only a few years away from that, maybe around 2030, plus or minus a year. " ~ Demis Hassabis, Co-Founder and CEO of Google DeepMind It is going to be enormously profound, I think. The future, in my view, is still to be written. But these next few years are going to be very critical as to which way that will go, and how we collectively want that to look.” --- IMO, The real disruption is not whether AGI arrives exactly in 2030, plus or minus a year, but whether institutions can adapt, as in post-AGI world, technology will change much faster than human systems can respond. Schools still train people for stable professions, companies still organize work around human bottlenecks, and governments still regulate after harm becomes visible. AGI, if it arrives anywhere near the frontier-lab timelines, compresses that lag into a dangerous gap. ---- From "Stanford Graduate School of Business" YouTube channel, (link in comment)

译Google DeepMind 联合创始人兼 CEO Demis Hassabis 在新采访中表示，社会需要意识到我们没有多少时间准备了，人类正站在奇点的山麓。他认为 AGI 可能只需几年，大约 2030 年（±1 年）就能实现。推文作者评论指出，真正的颠覆不在于 AGI 何时精准到达，而在于机构能否适应——后 AGI 世界技术变化远快于人类系统响应速度，学校、公司、政府均未做好准备。若 AGI 按前沿实验室时间线到来，这一滞后将压缩成危险鸿沟。

SenseTime@SenseTime_AI · 6月8日26

Key takeaways from Phil Wong, Head of Capital Markets at SenseTime, during @HSBC‘s Private Bank Roundtable: China's #AI advantage today is increasingly defined by 𝗰𝗼𝘀𝘁, but also 𝗾𝘂𝗮𝗹𝗶𝘁𝘆 𝗼𝗳 𝗽𝗿𝗼𝗱𝘂𝗰𝘁, and in turn the ability to 𝗯𝗼𝗼𝘀𝘁 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝘃𝗶𝘁𝘆 and 𝗲𝗻𝗵𝗮𝗻𝗰𝗲 𝗲𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝗰𝘆 for the end client, in order to maximise and optimise economic outcomes for end users. The real differentiator lies in 𝗰𝗿𝗲𝗮𝘁𝗶𝗻𝗴 𝗺𝗲𝗮𝘀𝘂𝗿𝗮𝗯𝗹𝗲 𝗯𝘂𝘀𝗶𝗻𝗲𝘀𝘀 𝗼𝘂𝘁𝗰𝗼𝗺𝗲𝘀 𝗮𝘁 𝘀𝗰𝗮𝗹𝗲, in addition to just a cost-benefit. How SenseTime is putting this into practice: • MultimodalＭodel #SenseNova U1 delivers strong performance with a smaller model footprint. • AI tools are streamlining daily workflows—such as data analysis and PPT generation with Office #Raccoon, and video production powered by #Seko. • AI infrastructure, #SenseCore, leverages compute-power co-optimization to reduce energy consumption and improve efficiency. Beyond these, keep an eye on spatial intelligence, world models, and other emerging AI frontiers.

译商汤资本市场主管 Phil Wong 在汇丰私人银行圆桌会议上指出，中国 AI 优势日益体现于成本、产品质量及提升终端客户生产力与效率。真正差异化在于大规模创造可衡量业务成果。商汤实践包括：多模态模型 SenseNova U1 以较小规模实现强性能；AI 工具 Office Raccoon（数据分析与 PPT 生成）和 Seko（视频制作）；AI 基础设施 SenseCore 通过算力协同优化降低能耗、提升效率。此外需关注空间智能、世界模型等前沿领域。

Yuchen Jin@Yuchenj_UW · 6月8日65

OpenAI’s Sora lead left. OpenAI’s chips lead left for Anthropic. The “take every big bet at once” strategy is starting to look fragile. When you’re in a knife fight with Anthropic over coding, every other bet starts bleeding. (On coding itself, OpenAI has clearly caught up.)

译OpenAI的Sora负责人离职了。 OpenAI的芯片负责人跳槽去了Anthropic。 “同时押注所有大赌注”的策略开始显得脆弱。当你在编程领域与Anthropic进行白刃战时，其他所有赌注都在流血。（在编程本身方面，OpenAI显然已经迎头赶上。）

Chubby♨️@kimmonismus · 6月8日65

Demis Hassabis is arguably the most serious scientist around. He's not someone who engages in hype to sell products. But when even someone like Demis says the following, it should give us all pause: - "He [Demis] equated its arrival [AGI, around 2030] to the singularity - a point in time when there's no turning back from a breakthrough technological development. - "Society needs to hear that because we don't have long to prepare for what that means" - "When we look back at this time, I think we will realize that we were standing in the foothills of the singularity" (Google i/o) We are on the threshold of the most profound revolution. Comparable to the Industrial Revolution, but ten times faster and ten times more powerful.

译DeepMind创始人Demis Hassabis在Google I/O上表示，AGI（约2030年）的到来将等同于奇点——一个不可逆转的技术突破点。他直言社会需要尽早准备，因为时间不多了；回顾当下，我们正站在奇点的山脚。推文作者将其视为比工业革命快10倍、强10倍的深刻革命，人类社会正面临前所未有的变革。

Rohan Paul@rohanpaul_ai · 6月8日46

"AI can do a lot for medical discovery, but there are also legitimate concerns that AI is outperforming PhD-level virologists, & that could abet a bioweapons risk that is not hypothetical." - Johns Hopkins professor & adviser to Anthropic, Ben Buchanan

译AI 可以在医学发现方面做很多事情，但也有一些合理的担忧，即 AI 正在超越博士水平的病毒学家，这可能会助长一种并非假设性的生物武器风险。 ——约翰霍普金斯大学教授兼 Anthropic 顾问 Ben Buchanan

Ethan Mollick@emollick · 6月8日63

It is a really good time to store up a few of your hardest, most valuable, and most unusual ideas - whether for work, hobbies, or a new venture. Thanks to AI, really good & unique ideas are getting extremely cheap to implement, but not necessarily easier to find. Big opportunity

译现在正是储存一些你最困难、最有价值、最不寻常的好主意的好时机——无论是为了工作、爱好还是新的事业。得益于 AI，真正好且独特的想法实现起来变得极为廉价，但找到它们却不一定更容易。巨大的机会就在眼前。

elvis@omarsar0 · 6月8日59

This was one of the standout AI papers of the week. (bookmark it) It tackles a question most self-improving AI agents ignore: is the agent actually discovering anything, or just remixing what it already knows? How can you tell whether the agent is doing real discovery or just confident retrieval? The authors give three clean buckets: - Retrieval is looking something up in a notebook you already have. - Search is combining tools you already own in new ways. - Discovery is inventing a new concept that wasn't in your toolkit before. The issue is that most agents stop at the first two. The math behind their definition (category theory plus a left Kan extension, if you care) is basically a bookkeeping trick to ask: could the old version of me have produced this result? If yes, it's not discovery. If no, something genuinely new showed up. They build a Builder/Breaker agent that studies protein mechanics. Over four rounds, the model's fit accuracy actually drops (R² goes from 0.48 to 0.68 to 0.54 to 0.41). At first glance, that looks like a failing agent. It isn't. The agent kept taking on harder proteins and rewriting its theory to cover them. Data grew almost 10x while the model code grew only 1.3x. A smaller theory covering a bigger world is exactly what good science looks like. Why does it matter? If you optimize for accuracy alone, your self-improving agent will just settle into easy benchmarks and stop. This paper offers a cleaner success signal and asks whether the agent is compressing more of the world into less code over time. Paper: https://arxiv.org/abs/2606.01444 Learn to build effective AI agents in our academy: https://academy.dair.ai/

译本周一篇AI论文探讨自我改进智能体是否真正发现新知识，还是仅重新组合已知信息。作者将行为分为三类：检索（查询已有笔记本）、搜索（组合现有工具）和发现（发明新概念），并用范畴论和左Kan扩展定义——若旧版本能产生相同结果则非发现。他们构建Builder/Breaker agent研究蛋白质力学，四轮中R²从0.48升至0.68再降至0.54和0.41，看似变差实则不断挑战更难蛋白质并重写理论：数据增长近10倍，模型代码仅增长1.3倍。论文提出用代码压缩率作为真实发现信号。链接：arxiv.org/abs/2606.01444。

Chubby♨️@kimmonismus · 6月7日64

Apple’s Touch Bar was ahead of its time. Just imagine the incredible use cases it could have today. - rate limits, context and much more

译WWDC 2026 预计将成为苹果 AI 关键节点。Siri 将围绕定制版 Google Gemini 模型（约 1.2 万亿参数）重建，自有设备端模型仅约 3B 参数。新 Siri 采用混合架构——本地小模型 + 云端大模型，由苹果控制 UI、应用权限和隐私层。预期功能包括更自然对话、跨应用个人上下文、屏幕感知、应用内操作、独立 Siri 应用（支持语音、文件上传和多模态交互）、Dynamic Island 深度集成，并可选择接入 ChatGPT、Claude 或 Gemini 等第三方服务。苹果意将 Siri 打造成操作系统的隐私 AI 层，成为跨 iPhone、Mac、iPad 的系统智能体。

meng shao@shao__meng · 6月7日16

你是从什么时候觉得自己和以前的工作能力和状态不一样了？我是在旷视的那段时间。去旷视前，我在一家法国企业做数据分析软件，公司的客户很稳定，节奏很慢很稳，员工稳定性很好，是真的 wlb 的状态，甚至一度觉得是在养老。做了半年多我实在受不了了，实在太慢了，差不多 2 天就能做完一周的事，剩下三天对着电脑点来点去发呆，和周围同事闲聊下来大家都是这样，所以兴趣爱好都非常多。。那时还年轻，觉得养老很可耻 😂，就决定去挑战一下，14-15年左右，国内 AI CV 四小龙刚成型，我就去面了商汤和旷视，对两位创始人印象特别深，就很坚决的选了旷视。进来旷视才知道，我周围基本一水的清华姚班，我也是无知者无畏了，面试时根本不知道。这些人，不管是全职还是实习生，是真的非常的聪明，智商太高了，不会的东西学起来超快，会的东西钻的特别深，感觉我自己和他们都不是一个物种 😂 当时合作最多的是研究员的算法工程师们，我做算法封装和落地商业化，那真是如履薄冰，就怕跟不上大家节奏，拖后腿，做好自己应用开发之外，经常要去学 CV 算法的知识，也真是硬着头皮上，不过有那么 2-3 个月之后，感觉就能跟上了，当时和一位算法伙伴一起做人脸关键点和美颜贴纸，我们俩大概搞了两周，古法编程就把这套算法和应用跑通了，当时公司觉得挺不错，可以拿去推了。从那时起，我突然觉得，周围同事的聪明程度，人才密度，以后大概率不会超过现在了，而现在我也能跟上节奏了，好像跟以前完全不一样了。到后来在旷视做双目，做炫彩，再独立做硬件的刷脸方案等等，也从容了很多。再到后来从旷视离开，到一个创业公司做 CTO，再到一个硬件公司，每次都是全新的领域，不过心里也没有再怕过。

译邵猛此前在法国企业做数据分析软件，节奏极慢，两天就能完成一周工作。14-15年左右，他加入AI CV四小龙旷视，周围多是清华姚班的全职或实习生，智商极高、学东西快且深。作为算法封装与商业化工程师，他硬学CV知识，两三个月后跟上节奏。曾与一位算法伙伴两周用古法编程跑通人脸关键点和美颜贴纸算法，获公司认可。此后心态彻底改变，不再畏惧跨领域挑战。后来离开旷视，在创业公司担任CTO等。

Rohan Paul@rohanpaul_ai · 6月7日63

💯💯 David Sacks rips Bernie Sanders' 50% AI ownership plan

译David Sacks 批评桑德斯提出的政府持 AI 公司 50% 股份的计划。他指出 AI 实验室 CEO 们反复声称 AI 会导致大规模失业，但数据不支持，且他们夸大风险而忽视利益与缓解措施。保守派还担心所谓慈善实际是让 NGO 致富并推动与传统价值观相悖的议程。Anthropic 和 OpenAI 是公益公司，用其一半财富偿还国债看似合理，但 AI 国有化将加速企业-政府融合，导致类似中国社会信用体系的中央集权 AI 控制，比央行数字货币更危险。他警告保守派当前支持的监管未来可能被民主党用来对付自己。

凡人小北@frxiaobei · 6月7日63

Notion 这种级别的产品第一反应是直接切流。某种程度上，这比榜单更能说明问题。当 AI 开始成为生产环境的一部分，能连续都维持同样水准才是真正的护城河。外加企业自己不要作妖。

译Notion因Anthropic的Opus 4.7和4.8模型出现性能下降（degraded performance），导致选择这些模型的用户故障率升高，已禁用所有Anthropic模型并将请求切流至替代供应商。大部分用户可继续使用Notion AI，但Anthropic专属功能不可用。评论认为这种应急切流比榜单更能说明问题：AI成为生产环境一部分后，长期维持一致性能才是真正护城河。

swyx@swyx · 6月7日49

idea - universal basic ai: 1 share of xai, oai, and ant to each US citizen. cost SpaceXai: $135.00 × 349 million = $47B OpenAI: $733.54 × 349 million = $256B Anthropic: $930.45 × 349 million = $325B total $628B cost is 8.5% of US Govt budget, 62% of defense budget, roughly same as ~$700B of TARP program during the Financial Crisis (mostly given to big banks, this would be a lot more decentralized)

译想法 - 全民基本AI：给每位美国公民一份xAI、OpenAI和Anthropic的股份。成本 SpaceXai：$135.00 × 3.49亿 = $470亿 OpenAI：$733.54 × 3.49亿 = $2560亿 Anthropic：$930.45 × 3.49亿 = $3250亿总计$6280亿，占美国政府预算的8.5%，国防预算的62%，大致与金融危机期间约$7000亿的TARP计划相当（该计划大部分给了大银行，而这次会去中心化得多）

meng shao@shao__meng · 6月7日19

没看懂啥意思，这里面除了 Sora 被 OpenAI 主动关停，其他都正常啊！？

Orange AI@oran_ge · 6月7日10

早晨问了几个灵魂提问 1. 为什么法国人假期那么多，但生活还那么好（不应该越努力越赚钱吗）？ 2. 中国为什么不能这样呢？ 3. 未来AGI之后，中国有希望成为法国吗？ Gemini 真难用啊… 谷歌真的是… 连基本的 chatbot 都做不好…

译博主清晨提出几个灵魂拷问：法国人假期多却生活优渥（难道不是越努力越赚钱？）；中国为何不能效仿？未来AGI后中国能否成为法国？随后直指Gemini难用，批评谷歌连基本的chatbot都做不好。