说实话，Garry Tan 这篇长帖，是我今年看到的最重要的 AI 文章，没有之一。大多数人看完估计只会惊叹：“哇，这个读书工具好厉害。” 但他们其实并没看懂，这不仅仅是一个工具，说是一份 AI 时代个人能力的指数级放大说明书更合适一些。先看那个最震撼的案例： Book Mirror。把一本 162 页的书扔进去，40 分钟后，产出 3 万字的深度脑页。注意，这可不是普通的读书笔记，而是要把作者的每一个观点，都精准映射到他自己的人生里—— 他的家庭历史、YC 工作、治疗笔记、和几百个创始人的对话。相当于这本书的作者专门花了两天时间，只和他一对一深聊，并且只聊和他最相关的那部分。比 $300/小时的治疗师高效 50 倍以上，而且这已经远远超越普通 RAG。普通 RAG 只能检索，

译Garry Tan的文章指出，以Book Mirror为例的先进AI工具，其意义远超普通读书工具或RAG系统。它能将一本162页的书籍在40分钟内转化为3万字的深度个人化分析，将书中观点精准映射到用户的个人历史、工作与经历中。这种交互相当于作者与用户进行两天的一对一深度对话，其效率远超传统方式，代表了AI对个人能力进行指数级放大的新范式。

Demis Hassabis@demishassabis · 5月10日58

Hard to believe it’s been 10 years since AlphaGo! It was wonderful to catch up with Lee Sae Dol last week in Korea and join Shin Jin-seo for a special Go match. Great to reminisce about AlphaGo & super interesting to hear how it changed the way players approach the game of Go!

译很难相信AlphaGo已经过去10年了！上周在韩国与Lee Sae Dol重逢，并与Shin Jin-seo进行了一场特别的围棋比赛，真是太棒了。重温AlphaGo很棒，并且听到它如何改变玩家下围棋的方式超级有趣！

Yuchen Jin@Yuchenj_UW · 5月10日62

Claude Opus 4.7 is over-trained on the Anthropic website. Every HTML page it designs has that unmistakable Anthropic flavor. GPT-5.5 is still weirdly weak at frontend. It designs frontend like it learned CSS from a backend engineer. OpenAI urgently needs an MTS with taste.

译Claude Opus 4.7 在 Anthropic 官网上训练过度。它设计的每个 HTML 页面都带有明显的 Anthropic 风格。 GPT-5.5 在前端方面仍然出奇地弱。它设计前端的样子就像是从后端工程师那里学的 CSS。 OpenAI 急需有品味的 MTS。

François Chollet@fchollet · 5月10日80

It was always the case that agency was self-compounding, but AI is magnifying the effect. Low-agency AI users further lose agency, high-agency AI users further gain agency.

译主观能动性向来具有自我增强的特性，而AI正在放大这种效应。低能动性的AI使用者进一步丧失能动性，高能动性的AI使用者则进一步增强能动性。

向阳乔木@vista8 · 5月10日27

从大模型迭代看，OpenAI Codex 口碑现在好过 Anthropic Claude。下一轮更新应该到谷歌的Gemini发力了吧好久都没啥声音了...

向阳乔木@vista8 · 5月10日33

啥都不能多了，物以稀为贵。多了就会觉得不好，从吉卜力图、蓝紫渐变网页、AI信息图、衬线字体杂志风PPT... 都说时尚审美有轮回，可能就是这原因？跟风难免，保持个性是最稀缺。

Sam Altman@sama · 5月10日56

what would you most like to see improve in our next model?

译你最希望在我们的下一代模型中看到哪些改进？

Elon Musk@elonmusk · 5月9日71

Tesla AI Vision

译Tesla通过分析真实车队碰撞数据，结合视觉系统与传感器，实现了安全系统的突破。传统碰撞传感器需要时间确认，降低阈值可能导致误触发。而视觉系统能提前“看到”即将发生的碰撞，与传感器协同，使约束控制器能更早、更准确地启动安全气囊和安全带预紧器。通过仿真重放碰撞并测量人体模型受力，团队发现提前部署能优化保护时机。这一改进使预测伤害严重程度整体显著下移，并通过OTA更新实现，是前所未有的安全提升。

向阳乔木@vista8 · 5月9日64

Claude Code负责人Boris Cherny的Youtube分享，让AI总结写了篇文章。 1. Claude Code 前六个月几乎无人使用，真正转折点是 Opus 4 上线。 2. Anthropic 内部 agent 通过 Slack 实时协作，所有人的 Claude agent 互相沟通协调，无人手写 SQL 和业务代码，AI-native 组织形态 3. TypeScript + React 因训练数据最丰富而率先被"解决"，冷门技术栈只需等待以后模型。 4. 识字率从 10% 到 70% 用了几百年，编程民主化会快得多，未来写会计软件最好的人是会计而非工程师(?) 5. AI 时代护城河重构，切换成本和流程壁垒变弱，但网络效应、规模经济、稀缺资源依然坚固，SaaS 不会消亡但会重新洗牌 6. 大公司受困于流程重塑和文化转型，而初创公司从第一天就能 AI-native 构建，起点即是大公司五年后的终点 https://blog.qiaomu.ai/programming-dead-ai-prediction

译Claude Code使用量在Opus 4模型上线后迎来转折点。Anthropic内部已实现AI原生协作，员工通过Slack中的Claude智能体沟通，无需手动编写代码。TypeScript和React因数据丰富被优先支持。编程民主化将快速推进，领域专家可能成为最佳软件构建者。AI将重构商业护城河，削弱流程壁垒，SaaS面临洗牌。初创公司能从零构建AI原生组织，获得领先优势。

Ethan Mollick@emollick · 5月9日46

As much as the state of benchmarks in AI is flawed, it is so much easier to track AI progress than robotics. Not sure what you can make of all the videos of robots running races or doing laundry - are there any equivalents to independent AI benchmarks for robots? ARC-AGI-BOT?

译AI基准测试虽有缺陷，但进展追踪相对容易；机器人学则缺乏明确的衡量标准，演示视频如赛跑或洗衣无法有效评估进展，需要建立类似AI的独立基准测试如ARC-AGI-BOT。引用推文指出，尽管对机器人技术充满期待，但使其在经济上大规模实用的关键飞跃时间表仍不确定，可能在1年、3年、5年或10年内实现。

Orange AI@oran_ge · 5月9日49

如果 AI 让每个人都提升了10倍为什么还要裁员呢？难道不是多多益善吗？

向阳乔木@vista8 · 5月9日35

不知道大家有没有类似感觉。跟去年底相比，Vibe Coding时骂模型的频率少了很多。说明模型显著进步了？

阿绎 AYi@AYi_AInotes · 5月9日60

卧槽，兄弟们，竟然被孙哥回关了❗ ❗ ❗ 激动的手都在抖，怎么有点热泪盈眶的感jio🥹 AI中转站太火爆了，孙哥和小特朗普都下场了，本来想趁着周末好好写一写孙哥为什么做AI中转站这件事，去他的主页找素材，发现不知道啥时候回关我了😳 平复下情绪说回正题，我搜了下最近的评论发现褒贬不一，大多数人都以为孙宇晨又做了一个卷价格的API中转站，殊不知其实他已经偷偷卡位了AI时代最赚钱的位置。 http://B.AI上线一个月，已经破了160万注册用户。 DeepSeek-V4大幅打折接近1折，接近白送，GPT、Claude、Gemini一个Key全搞定，支持加密钱包直连，无KYC，无审核。也有很多人嘲笑这又是一个卷低价的生意，赚点token差价的辛苦钱。但他们忘了，孙哥这辈子从来不会做低毛利的事，表面看是卷价格，实际上是在做战略卡位。他真正的目标根本就不是卖API，我理解孙哥核心目标是要成为AI Agent时代的中央银行。接下来我分三层讲透这个事，也请@justinsuntron 孙哥指正🫡

译作者因被孙宇晨回关而引发讨论，指出其AI中转站项目B.AI并非简单的低价API竞争。该项目上线一个月注册用户破160万，提供大幅折扣的DeepSeek-V4等模型，并支持加密钱包支付。核心观点认为，孙宇晨的战略目标是通过低价策略抢占入口，最终旨在成为未来AI Agent时代的“中央银行”，而非仅赚取API差价。

凡人小北@frxiaobei · 5月9日58

核心观点挺对我胃口： • Markdown 适合短文档，但超过 100 行就没人真看了。 • HTML 信息密度高：表格、SVG 图、交互、颜色、布局、代码注释、滑块、拖拽编辑器都能放进去。 • 对 agent 工作流尤其有用：spec、PR review、实现计划、研究报告、设计探索，都可以变成一个可读/可交互网页。 • 你不一定要做 /html skill，直接 prompt：“做一个 HTML 文件/HTML artifact” 就行。 • 最最最大的价值是让人重新进入 loop：你更愿意读、更容易判断、更能参与 agent 的方案。但人能在 loop 里还能待多久？

译推文主张在复杂文档和AI agent工作流中，HTML比Markdown更具优势。HTML信息密度高，能嵌入表格、SVG、交互组件和编辑器，适合用于编写技术规格、PR评审、研究报告等。用户无需精通HTML，可通过直接提示AI生成HTML文件。其核心价值在于促进人机协作循环：HTML文档更易阅读、评估和参与，从而让人更愿意介入并持续留在AI的工作流程中。关键挑战在于这种高效的人机协作模式能持续多久。

Chubby♨️@kimmonismus · 5月9日18

Holy sh*t! That jump! So the next model after Mythos will work a whole 8 hour work day at 80% success rate, I assume.

译我的天！这跨越！那么Mythos之后的下一代模型，我推测将能以80%的成功率完成整整8小时的工作日。

Chubby♨️@kimmonismus · 5月9日50

What is even more impressive is just how wide the gap between Claude Mythos and Gemini 3.1 Pro becomes when moving from a 50% success rate to an 80% success rate. Mythos doesn't just work "longer" - above all, it works significantly more accurately! That is the truly impressive part.

译更令人印象深刻的是，当成功率从50%提升到80%时，Claude Mythos与Gemini 3.1 Pro之间的差距会变得多么巨大。 Mythos不仅仅是"工作更持久"——最重要的是，它的工作准确率显著更高！这才是真正令人惊叹的部分。

Berryxia.AI@berryxia · 5月9日28

AI时代原来我的“嘴”才是核心竞争力！😂 靠“嘴口喷”代码，现在直接“靠嘴搞设计”！太特么好了，AI时代我也算是吃上细糠了。

Ethan Mollick@emollick · 5月9日23

I dunno I think at this point it must take some special effort to keep the names as baffling as they are.

译我觉得事到如今，要保持这些名称如此令人困惑，肯定需要一些特别的努力。

宝玉@dotey · 5月9日52

其实我不太赞同文章中的观点两者不是非此即彼的关系，Markdown 是用来保存信息，HTML 是用来展示和交互。对于 LLM 来说，Markdown 的信息密度是最高的，HTML 是臃肿的，并不适合 HTML 阅读和生成，即使现在 AI 生成 HTML，也需要借助 React 这样的前端技术拆分成模块才能生成的比较好，一个大的 HTML 是无法一次性生成好的。

译作者反驳了将Markdown与HTML对立看待的观点，指出两者功能不同：Markdown用于高效保存信息，HTML则负责展示与交互。对于LLM而言，Markdown信息密度高，而HTML结构臃肿，不适合直接用于阅读或生成。即便当前AI生成HTML，也需借助React等前端技术将其拆解为模块，难以一次性生成完整的大段HTML代码。引用推文提供了相关讨论的背景。

elvis@omarsar0 · 5月9日49

My favourite new stack: Agents + MCP + Markdown + HTML “Files over apps” is a vibe!

译作者提出以智能体、MCP、Markdown和HTML为核心的新工作流，秉持“文件优先于应用”理念。其核心是结合LLM维基与HTML制品：LLM维基用于捕获关键信息，赋能智能体完成有意义的工作；动态的HTML制品则以交互式、可扩展的方式呈现信息，并支持与智能体双向通信。这种组合能构建强大工具，实现收件箱清零、兴趣追踪、快速原型设计、深度研究及实验调度等多种功能，适用于设计师、工程师和研究人员等广泛群体。HTML并非取代Markdown，二者协同工作效果更佳。

Berryxia.AI@berryxia · 5月9日65

Geoffrey Hinton直接把AI思考的本质讲透了！ “语言只是思考的工具之一，不是全部。” 他说，人类和AI都能用文字建模几乎任何东西，但真正的思考远远超出文字，它还包括图像、空间感、物理运动这些更丰富的形式。最聪明的系统，就是那个能把所有这些模态全部用起来的系统。这才是下一个大突破的方向：不再是单纯把语言玩得更溜，而是让AI真正学会像人类一样，用多感官、多维度的方式去理解世界。当AI开始同时掌握文字、图像、空间和动作的时候，智能的边界会彻底被打破。所以多模态才是最终的出路？那么，文本的基础肯定也不能丢失，基石打好才有更好的发效果。

译Geoffrey Hinton指出语言只是思考的工具之一，AI推理可以是真正的思考，因为语言本身就是一种思考形式。人类和AI能用文字建模，但真正思考超越文字，涵盖图像、空间感和物理运动等多模态。最聪明的系统是能整合所有模态的AI，这代表下一个大突破方向：让AI像人类一样通过多感官、多维度理解世界，从而打破智能边界。多模态是最终出路，但文本基础仍需巩固以确保发展效果。

Sam Altman@sama · 5月9日51

call me maybe

译打电话给我，也许

elvis@omarsar0 · 5月9日64

LLM Wikis + HTML Artifacts are insanely powerful. You should seriously consider this in your workflows. LLM Wikis captures all the important information that lets you and your agents do meaningful work. HTML artifacts present that information in interesting ways that allow you to take important actions along with your agents. My HTML artifacts sit on top of my LLM wikis. They are dynamic and are easily extended as needs arise. I have hooked my Artifacts to talk to my agents, and similarly, the agents can talk to artifacts. This has allowed me to build powerful artifacts that reduce my inbox to zero, keep me updated on any topic of interest, fast prototyping, do deep research, design/trigger new experiments, generate figures to improve understanding, schedule research, search relevant information, discover topics, and so much more. What you see in the clip is not a website. It's a simple interactive HTML artifact. HTML artifacts are useful for designers, engineers, researchers, students, and anyone working with agents. Lastly, HTML doesn't replace Markdown. They are a much better combination working together.

译LLM Wikis用于捕获关键信息，赋能用户与智能体进行有意义的工作。HTML Artifacts则以动态、可交互的方式呈现这些信息，支持与智能体双向通信。两者结合可构建强大工具，实现收件箱清零、实时更新关注领域、快速原型设计、深度研究、实验设计与触发、生成解读图表、安排研究计划、搜索相关信息及发现新主题等功能。文中展示的交互界面并非传统网站，而是轻量级HTML Artifact。该组合方案适用于设计师、工程师、研究人员、学生及所有使用智能体的工作者，且HTML与Markdown可互补协同，形成更优工作流。

阿绎 AYi@AYi_AInotes · 5月9日58

说个暴论，在AI时代最值钱的技能已经不是写代码了，怎么把代码讲清楚将会变得越来越重要！怎么把代码讲清楚将会变得越来越重要！ Anthropic Claude Code团队的@trq212 大神用不到两年时间，把自己的技术文章做到了稳定的百万浏览量。他说，技术写作彻底改变了他的人生。兄弟们你们可能不信，老哥的方法论简单到离谱，就四个字：先种后收。先埋头做真实的工作，踩真实的坑，积累第一手的洞见，然后把你学到的东西，变成别人能直接抄的经验。就是说写文章不能为了写而写，得先有干货，再有输出。他写爆文就靠两条铁则：第一，能多简单就多简单，用复杂术语装逼，只能说明你自己都没搞懂。第二，分享真正的秘密，因为人们不爱看正确的废话，只爱听别人不说的内部干货。最狠的是，老哥在这场15分钟的线下workshop现场，直接写了一篇《HTML vs Markdown for agents》的长文。发出去几个小时，就拿了26万浏览，用行动证明了他教的东西真的管用。当然他也用Claude加速写作，但他反复强调：绝对不能让AI磨掉你的个人声音，因为这才是能持续输出爆款的核心。尽管很多人酸，说他能火只是因为他在Anthropic，不可否认内部视角是天然的流量buff，但我觉得更重要的是，他掌握了这个时代最稀缺的能力：把复杂的技术，讲得让同行立刻能用。 AI会写代码，会做产品，会调模型，但它不会讲你的故事，不会分享你踩过的坑，不会有你独有的思考方式。而写作，就是把你的个人经验，放大一万倍的杠杆。更妙的是，写作本身会反过来逼你把产品和系统想得更清楚。就像很多时候，你以为你懂了，但只有当你试着把它写下来的时候才发现自己其实并没有懂。所以各位程序员兄弟们，别再觉得技术写作是大佬的事。从今天开始，把你每天踩的坑、学到的新东西，写成100字的笔记。先播种，再收割，这个复利一旦滚起来，会比你想象的可怕得多。如果中推里推荐一位标杆和大神，那一定是宝玉老师@dotey，从宝玉老师身上我学到的很重要的一点是，践行开源精神，你分享的越多，跟着你学习的人越多，respect！

译AI时代，解释代码的能力比编写代码更具价值。Anthropic的Thariq (@trq212) 通过技术写作在两年内使文章稳定达到百万浏览量，称技术写作彻底改变了他的人生。他的方法论是“先种后收”：先积累实践经验，再输出可复用的洞见。写作原则强调简单易懂和分享内部干货。他在工作坊中现场演示写作并获得高浏览，验证了方法有效性。使用Claude加速写作时，他坚持保持个人声音，这是持续产出爆款的关键。技术写作能将个人经验放大为杠杆，并促进更清晰的思考。引用推文指出，技术写作是获得观点、建立权威的免费而有效方式。

Noam Brown@polynoamial · 5月9日28

I’m proud to be working at a company being sued by Elon rather than one partnering with him

译我自豪于在一家正被Elon起诉的公司工作，而非与他合作的公司。

Greg Brockman@gdb · 5月9日30

feeling the magic

译感受魔力 [引用 @BenBajarin]：描述Codex在Chrome中运行的唯一方式就是纯粹的魔法。 "任何足够先进的技术都与魔法无异。"——亚瑟·C·克拉克第三定律

elvis@omarsar0 · 5月9日48

I prefer to combine both HTML and MD. I have been sharing a few examples of LLM Artifacts like this one: https://x.com/omarsar0/status/2052062654810980618?s=46&t=lM541_vvE8ML-ozRBlFUDg

译我更喜欢结合HTML和MD。我一直在分享一些LLM Artifacts的例子，比如这个：https://x.com/omarsar0/status/2052062654810980618?s=46&t=lM541_vvE8ML-ozRBlFUDg [引用 @trq212]：HTML是新的markdown。我已经几乎停止为所有事情写markdown文件，转而使用Claude Code为我生成HTML。这就是原因。

Chubby♨️@kimmonismus · 5月9日26

I have a hunch about what this vague hint is meant to convey: OpenAI’s super app is coming. The reference calls to mind Sam Altman’s Death Star meme.

译我对这个模糊暗示的含义有种预感：OpenAI的超级应用即将到来。这个引用让人想起山姆·奥特曼的死星梗。

Thariq@trq212 · 5月9日65

http://x.com/i/article/2052796100608974848 # Using Claude Code: The Unreasonable Effectiveness of HTML Markdown has become the dominant file format used by agents to communicate with us. It’s simple, portable, has some rich text capability and is easy for you to edit. Claude has even gotten surprisingly good at using ASCII to make diagrams inside of markdown files. But as agents have become more and more powerful, I have felt that markdown has become a restricting format. I find it difficult to read a markdown file of more than a hundred lines. I want richer visualizations, color and diagrams and I want to be able to share them easily. I'm also increasingly not editing these files myself, but using them as specs, reference files, brainstorming outputs, etc. When I do make edits, I’m usually prompting Claude to edit them, which removes one of markdown’s largest benefits. I’ve started preferring HTML as an output format instead of Markdown and increasingly see this being used by others on the Claude Code team, this is why. (if you want to start with some examples, you can see a bunch here: https://thariqs.github.io/html-effectiveness, just be sure to come back and read more about why) # Why HTML? ## Information Density HTML can convey much richer information compared to markdown. It can of course do simple document structure like headers and formatting, but it can also represent all sorts of other information such as: - Tabular data using tables - Design data with CSS - Illustrations with SVG - Code snippets with script tags - Interactions using HTML elements with javascript + CSS - Workflows using SVG and HTML - Spatial data using absolute positions and canvases - Images using image tags I would go so far as to say that there is almost no set of information that Claude can read that you cannot fairly efficiently represent with HTML. This makes it a highly efficient way for the model to communicate in-depth information to you and for you to revie wit. I’ve found that in the absence of being able to do this, the model may do more inefficient things in markdown like ASCII diagrams or, my favorite, estimating colors with unicode characters like in this screenshot from Claude Code. ## Visual Clarity & Ease of Reading As Claude is able to do more complex work, it is also writing larger and larger specs and plans. In practice, I've found I tend to not actually read more than a 100-line markdown file, and I certainly am not able to get anyone else in my organization to read it. But HTML documents are much easier to read, Claude can organize the structure visually to be ideal to navigate with tabs, illustrations, links, etc. It can even be mobile responsive so you can read it differently based on your form factor. ## Ease of Sharing Markdown files are fairly hard to share since most browsers do not render them natively well. You often have to add them as attachments to emails or messages. With HTML, as long as you upload the file (for example to S3), you can share the link easily. Your colleagues can open it wherever they wish and easily reference it. The chance of someone actually reading your spec, report or PR writeup is much much higher if it’s in HTML. ## Two-way Interaction HTML can allow you to interact with the document, for example you might want to ask it to add sliders or knobs to adjust a design or allow you to tweak different options in the algorithm to see what happens. You can also ask it to let you copy these changes into a prompt to paste back into Claude Code. Read more about my playgrounds post to see examples of this two way interaction: https://x.com/trq212/status/2017024445244924382 Data Ingestion Why use Claude Code to make HTML files instead of ClaudeAI or Claude Design for example? One of the biggest reasons is all the context Claude Code can ingest. For example, when writing this article, I asked Claude Code to read through my code folder and find all the HTML files I’ve generated, group and categorize them and then make an HTML file with all diagrams representing each type. The diagrams you see in this article are a direct result of that. Besides the file system, Claude Code can find additional context using your MCPs (like Slack, Linear, etc.), your web browser (with Claude in Chrome), your git history, etc. ## It’s Joyful Making HTML documents with Claude is just more fun and makes me feel more involved and invested in the creation, and that by itself is enough. ## How to Get Started I’m a little bit afraid that people will read this article and turn it into a /html skill or something. While there might be some value in that, I want to emphasize that you don’t need to do much to get Claude to do this. You can just ask it to “make a HTML file” or “make a HTML artifact”. The trick is knowing what you want the artifact to do and how you might use it. You may over time make a skill, but for now I’d suggest just prompting from scratch to get a hang of how to use it in different cases. # Use Cases To make this more concrete, I’ve made many different HTML files for different use cases. You can view all of them here: https://thariqs.github.io/html-effectiveness/ but here’s an overview. ## Specs, Planning & Exploration HTML is a rich canvas for Claude to dive into a problem. When I start working on a problem instead of a simple markdown plan I expect to make a web of HTML files. For example, I might start with asking Claude Code to brainstorm and create some explorations of different options. I would then ask it to expand more into one, maybe make mockups or code snippets. Finally, when I feel good I’ll ask it to write an implementation plan. When I’m happy with the plan I’ll create a new session and pass in all of these files for it to implement. When verifying I’ll also ask the verification agent to read in the files and it will have much broader context on what is needed. Example Prompts: - I'm not sure what direction to take the onboarding screen. Generate 6 distinctly different approaches — vary layout, tone, and density — and lay them out as a single HTML file in a grid so I can compare them side by side. Label each with the tradeoff it's making. - Create a thorough implementation plan in a HTML file, be sure to make some mockups, show data flow and add important code snippets I might want to review. Make it easy to read and digest. Use Cases: - Exploring other ways to implement something in code - Exploring multiple visual designs ## Code Review & Understanding Code can be difficult to read in a Markdown file. But with HTML we can render diffs, annotations, flowcharts, modules, etc. Use this to understand code that the agent has written, to get code review or to explain a PR to someone reviewing your code. I find this often works better than the default Github diff view, and I attach a HTML code explainer to every PR I make now. Example prompt: Help me review this PR by creating an HTML artifact that describes it. I'm not very familiar with the streaming/backpressure logic so focus on that. Render the actual diff with inline margin annotations, color-code findings by severity and whatever else might be needed to convey the concept well. Use Cases: - Creating a PR - Reviewing a PR - Understanding a topic in Code ## Design & Prototypes Claude Design is based on HTML because HTML is incredibly expressive at design, even if your end surface is not HTML. Claude can sketch out a design in HTML and then write it in your language of choice, be it React, Swift, etc. You can also prototype interactions, such as animations, actions, etc. Consider asking Claude to make sliders, knobs, etc. to tune in exactly what you’re looking for. Example prompt: I want to prototype a new checkout button, when clicked it does a play animation and then turns purple quickly. Create a HTML file with several sliders and options for me to try different options on this animation, give me a copy button to copy the parameters that worked well. Use this for: - Creating design system artifacts - Adjusting components - Visualizing component libraries - Prototyping Joyful Animations ## Reports, Research & Learning Claude Code is incredibly good at synthesizing information across multiple data sources and converting it into a report for readability. You can prompt Claude to search your Slack, your codebase, git history, the internet, etc. and use it to generate extremely readable reports for yourself, for leadership, for your team, etc. You could assemble this in the form of a long HTML document, an interactive explainer or even a slideshow/deck. Ask Claude to use SVG for diagrams to help visualize it. For example, for my posts on prompt caching, I asked Claude to prepare an in-depth research file in HTML for me to read on all of our changes to prompt caching after reading the git history. Example prompt: I don't understand how our rate limiter actually works. Read the relevant code and produce a single HTML explainer page: a diagram of the token-bucket flow, the 3–4 key code snippets annotated, and a "gotchas" section at the bottom. Optimize it for someone reading it once. Use this for: - Summarize how a feature works - Explain a concept to me - Weekly status reports to your boss - Incident reports to your leadership - SVG illustrations, flowcharts, technical diagrams, etc # Custom Editing Interfaces Sometimes it’s hard to describe what you want purely in a text box. In this case, I'll ask Claude to build me a throwaway editor for the exact thing I'm working on. Not a product, or a reusable tool, but a single HTML file, purpose-built for this one piece of data. The trick is always to end with an export: a "copy as JSON" or "copy as prompt" button that turns whatever I did in the UI back into something I can paste into Claude Code. Example prompts: - I need to reprioritize these 30 Linear tickets. Make me an HTML file with each ticket as a draggable card across Now / Next / Later / Cut columns. Pre-sort them by your best guess. Add a "copy as markdown" button that exports the final ordering with a one-line rationale per bucket. - Here's our feature flag config. Build a form-based editor for it, group flags by area, show dependencies between them, warn me if I enable a flag whose prerequisite is off. Add a "copy diff" button that gives me just the changed keys. - I'm tuning this system prompt. Make a side-by-side editor: editable prompt on the left with the variable slots highlighted, three sample inputs on the right that re-render the filled template live. Add a character/token counter and a copy button. Use this for: - Reordering, triaging, or bucketing anything (tickets, test cases, feedback) - Editing structured config (feature flags, env vars, JSON/YAML with constraints) - Tuning prompts, templates, or copy with live preview - Curating datasets, approve/reject rows, tag examples, export the selection - Annotating a document, transcript, or diff and exporting the annotations - Picking values that are painful to express in text: colors, easing curves, crop regions, cron schedules, regexes. ## Frequently Asked Questions I’ve been telling many people about how I’ve switched to HTML and I’ve seen a few repeated questions. Isn’t it less token efficient? While markdown often uses fewer tokens, I’ve found that the added expressiveness of HTML and the much higher likelihood of me reading it means I get overall better output. With the 1MM context window in Opus 4.7, the increased token usage is not really noticeable in the context window. When do you use markdown for now? I have honestly stopped using markdown altogether for almost everything, but I’m probably far on the HTML maximalist side of things. How do I view the HTML file? I tend just open it in a browser locally (you can ask Claude to open it), or upload to S3 if you want a shareable link. Doesn't this take longer to generate than markdown? This does take longer! HTML can take 2-4x longer than Markdown, but I've found the results are worth it. What about version control? This is honestly one of the biggest downsides of HTML, HTML diffs are noisy and hard to review compared to Markdown. How do I get Claude to match my taste / not make it ugly? The frontend design plugin helps Claude make good HTML files. But to match your own companies style, you can create a single design system HTML file by pointing Claude at your codebase. You can then use that design system file as a reference for other html files. ## Stay in the Loop All of the above is to say that I think the real reason I use HTML is that I feel much more in the loop with Claude. I had begun to fear that because I had stopped reading plans in depth I would simply have to leave Claude to make its choices. But I am happy to say instead that I feel more in the loop than ever before when using HTML. I hope you do too.

译作者指出，随着AI智能体能力增强，Markdown在传达复杂信息时显得局限。HTML因其更高的信息密度、视觉清晰度、易于分享和双向交互能力，正成为更优的输出格式。HTML能承载表格、CSS样式、SVG插图、可交互元素等丰富内容，远超Markdown的ASCII图表等有限表达。此外，HTML文档更易于阅读和传播，并能通过链接直接分享，提高了团队协作中技术文档的查阅率。Claude Code因其强大的上下文摄取能力，特别适合用于生成此类HTML文件。

Thariq@trq212 · 5月9日51

HTML is the new markdown. I've stopped writing markdown files for almost everything and switched to using Claude Code to generate HTML for me. This is why.

译HTML 是新的 Markdown。我已几乎停止为所有内容编写 Markdown 文件，转而使用 Claude Code 为我生成 HTML。原因如下。 [引用 @trq212]：http://x.com/i/article/2052796100608974848

Chubby♨️@kimmonismus · 5月9日55

The surprising part is not just that Claude Mythos is powerful. It is that OpenAI seems to have closed much of the cyber-capability gap with GPT-5.5 Cyber in weeks, not years. On AISI’s expert cyber tasks, GPT-5.5 Cyber was roughly on par with Mythos and even slightly ahead on pass rate, while being materially cheaper per token. But Mythos still has the stronger public real-world proof point: Mozilla’s large-scale Firefox vulnerability work. Be that as it may, 2026 increasingly looks like OpenAI’s comeback year: stronger releases, more cost-effective models, and a series of decisions that seem to be landing at exactly the right moment.

译OpenAI的GPT-5.5 Cyber在网络安全能力上迅速缩小与Claude Mythos的差距，耗时仅数周而非数年。在AISI的专家网络任务中，两者表现接近，GPT-5.5 Cyber通过率甚至略高，且每token成本显著更低。但Mythos在公开实践案例上仍占优势，如协助Mozilla进行大规模Firefox漏洞排查。2026年正成为OpenAI的强势回归之年，其模型性能更强、成本效益更高，且一系列决策时机精准，展现出强劲复苏态势。

向阳乔木@vista8 · 5月9日22

终于理解产品为什么会变得臃肿复杂了。当你处于老板位置，控制大量研发资源时，觉得做什么都只需要张张嘴。就像我们现在用最顶级的模型一样的感觉，哈哈哈。克制是挺难的一件事儿，资源有时是一种诅咒。

译推文核心观点是产品变得臃肿复杂的原因在于资源丰富时，控制者如老板容易产生只需动嘴就能实现想法的错觉，导致过度开发而缺乏克制。类比使用最顶级模型时的轻松感，资源有时反而成为诅咒，突显了在管理大量研发资源时保持简约的挑战性。资源充裕可能引发盲目扩张，使产品设计失去焦点，强调克制在创新过程中的重要性。

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes · 5月9日50

No, blue collar jobs are not safe either

译不，蓝领工作也不安全 [引用 @Figure_robot]：We taught two F.03 robots to clean a room and make a bed in under 2 minutes - fully autonomous.

Ethan Mollick@emollick · 5月9日59

I realize that “Mythos as hype” means two different things to different groups. For insiders, it means “Mythos was not a magical step-change in AI ability.” For outsiders, it means “Mythos couldn’t really find zero day exploits” The latter was wrong, the former was likely right

译我意识到“神话即炒作”对不同群体意味着两件事：对业内人士而言，它意味着“Mythos并非AI能力的魔法式跨越进步”；对圈外人而言，则意味着“Mythos其实找不到零日漏洞”。后者是错的，前者可能是对的。

Jim Fan@DrJimFan · 5月8日79

I promise this will be the best 20 min you spend today! Robotics: Endgame, the sequel to my last year's Sequoia AI Ascent talk, "Physical Turing Test". I laid out the roadmap for solving Physical AGI as a simple parallel to the LLM success story. Be a good scientist, copy homework ;) And stay till the end, more easter eggs and predictions for your polymarket! 00:30 DGX-1 origin story at OpenAI, I was there in 2016 signing with Jensen and Elon. Heading to the Computer History Museum! 01:42 The Great Parallel 03:31 Robotics, the Endgame 03:39 Why VLAs fall short 04:32 Video world models as the 2nd pretraining paradigm 06:09 World Action Models (WAM) 07:46 Strategies for robot data collection and the FSD equivalent to physical data flywheel for robot manipulation 11:06 EgoScale and the Dexterity Scaling Law we discovered recently 14:00 Physical RL: bridging the last mile 15:39 DreamDojo: an end-to-end neural physics engine for scaling RL in silico 17:00 Civilizational Technology Tree and my predictions for the near future. Spoiler: it's closer than you think. Thanks to my friends at Sequoia for inviting me back to AI Ascent this year! I had a blast! Last year's talk is attached in the thread if you missed it.

译演讲者以“Robotics: Endgame”为题，提出解决物理AGI的路线图，直接类比LLM的成功路径。核心观点包括视频世界模型作为第二预训练范式、世界行动模型（WAM）、机器人数据收集策略（类似FSD的物理数据飞轮）、EgoScale和灵巧性缩放定律、物理强化学习 bridging the last mile，以及DreamDojo端到端神经物理引擎。预测物理AGI的实现比预期更近，并提及2016年参与OpenAI DGX-1签署与Jensen和Elon的个人经历。

Orange AI@oran_ge · 5月8日41

有点意外马斯克在 xAI 上的第一次尝试竟然失败了小扎也是失败了一次过去成功的方法论在大模型领域似乎无法复用

Rohan Paul@rohanpaul_ai · 5月8日31

AI is not going to replace human beings,” said @mcannonbrookes speaking to @l2k on @wandb’s podcast. And it definitely stayed with me. AI & agents are only going to enable us to do more, build more, and help humanity realize its full potential. #TLVCPartner

译“AI 不会取代人类，” @mcannonbrookes 在 @wandb 的播客中对 @l2k 说。这确实让我印象深刻。 AI 和代理只会使我们能做更多、建设更多，并帮助人类实现其全部潜力。 #TLVCPartner

Chubby♨️@kimmonismus · 5月8日54

Anthropics co-founder Jack Clark: "My prediction is that by the end of 2028, it is more likely than not that we will have an AI system where you could say to it: "Make a better version of yourself." And it would simply go off and do that completely autonomously." Its coming.

译Anthropics联合创始人Jack Clark： “我的预测是：到2028年底之前，我们很可能会拥有一个AI系统。你可以对它说： ‘创造一个更好的自己。’ 然后它就能完全自主地去执行这个任务。” 它要来了。

Berryxia.AI@berryxia · 5月8日63

社会发展过程包括AI时代都不可能逾越过某些特定阶段！ Demis 这次意外解释了它！ Demis Hassabis直接把AGI发展的优先级讲得清清楚楚！ “先把它做成工具，再去考虑意识和心智的问题。” 他说，先用AGI去读懂宇宙的语言，等真正理解之后，再决定要不要给它加上代理能力或者意识。这不是小事，而是把整个路线图彻底理顺了。很多人现在一上来就讨论“让AI有意识”“让AI有主观体验”，但Demis的观点完全反过来：先把工具做好，把宇宙的底层规律搞明白，再谈后面那些更危险、更哲学的问题。这才是真正务实、也最稳的路径。避免过早踩进代理和意识的雷区，先把生产力拉满。视频里他说得特别平静，但信息量极大。 AGI的下一步，谁先先行？

译Demis Hassabis明确AGI发展应分阶段进行，优先将其作为工具用于理解宇宙底层规律，而非过早赋予意识或代理能力。他强调这种务实路径能避免风险，先提升生产力，再处理更哲学和危险的问题。引用推文也指出AGI应先成为工具，再尝试赋予意识，先用于读懂宇宙语言。这一反向思维理顺了发展路线图，为AGI的下一步提供了稳健方向。

Rohan Paul@rohanpaul_ai · 5月8日58

AI is not going to replace human beings,” said @mcannonbrookes speaking to @l2k on @wandb’s podcast. And it definitely stayed with me. AI & agents are only going to enable us to do more, build more, and help humanity realize its full potential.

译AI不会取代人类，”@mcannonbrookes 在@wandb的播客中对@l2k说道。这句话确实让我印象深刻。 AI和智能体只会让我们能做得更多、创造更多，并帮助人类实现全部潜能。