AIHOT

Kling AI@Kling_ai · 4月25日43

See Image 2 posters transform into stunning 4K motion, powered by Kling4K.

译看 Image 2 海报在 Kling4K 驱动下，转变为惊艳的 4K 动态效果。

阿绎 AYi@AYi_AInotes · 4月25日62

哇靠， 50个GPT-5.4同时开工，一天之内关闭了4000个GitHub issue， OpenClaw 之父steipete昨天上线了Clawsweeper，一个专门治理代码洪流的AI维护机器人，它24小时不间断扫描仓库里所有的issue和PR，只在证据极强的时候才建议关闭，理由严格限定为5类，绝对不会乱关，最酷的是它没有任何传统仪表盘，所有运行状态和统计数据直接实时写回README，极简到了极致， 50个AI agent制造了垃圾，现在用50个AI agent来清理，以前大家都在吹AI写代码有多厉害，现在才发现，AI治理代码才是真正的刚需，我之前也觉得这就是个高级stale bot，看完才反应过来，这压根不是一个清理工具这么简单，更像是开源维护范式的彻底改变，以前是人盯仓库，永远跟不上AI生成的速度，现在是AI管AI，维护成本再也不是开源项目的瓶颈了，现在唯一的限制已经不是模型不够强，关键看GitHub和OpenAI的rate limit，等这个问题解决了，整个GitHub的陈年垃圾可能都会被扫一遍。老规矩GitHub地址评论区自取：

译OpenClaw之父steipete推出AI维护机器人Clawsweeper，旨在应对AI生成代码带来的管理洪流。该工具部署50个AI智能体全天候扫描仓库issue和PR，仅在证据确凿时按严格限定的五类理由建议关闭，单日可处理约4000条。其设计极简，无传统仪表盘，所有状态数据实时写入README。这标志着开源维护从“人盯仓库”转向“AI管AI”，核心瓶颈从模型能力变为平台速率限制，被视为对开源维护范式的根本性改变。

Peter Steinberger 🦞@steipete · 4月25日46

acpx 0.6.0 is out. (control codex/claude via agents) Highlights: Claude system-prompt controls, session pruning, embeddable turn handles, --no-terminal, persistent-session fixes, WSL cwd translation, queue hardening, and clearer error hints. https://github.com/openclaw/acpx/releases/tag/v0.6.0

译acpx 0.6.0 已发布。（通过代理控制 codex/claude）亮点：Claude 系统提示控制、会话修剪、可嵌入的回合句柄、--no-terminal、持久会话修复、WSL 当前工作目录翻译、队列强化，以及更清晰的错误提示。https://github.com/openclaw/acpx/releases/tag/v0.6.0

阿绎 AYi@AYi_AInotes · 4月25日61

卧槽，OpenAI Codex团队刚放了个大招，直接把所有第三方语音输入工具干懵了，所有ChatGPT订阅用户，现在可以在桌面任何地方直接语音输入，不用切App，不用额外花钱，设置一个热键，按住说话，松开文字直接进任何文本框，记事本，浏览器，VS Code，Slack，全平台通用整个演示视频只有6秒，丝滑到离谱，评论区已经刷爆了，全是RIP Wispr Flow，RIP Superwhisper，这些之前靠系统级语音输入活的小工具，现在直接被OpenAI用订阅额度免费送了，你不用再多花十几刀每月，也不用担心模型更新慢，说实话，我之前每个月花12刀用Wispr Flow，感觉现在直接可以卸载了，本以为这就是个方便的小功能，看完才反应过来，这根本不是加个语音输入这么简单，这是OpenAI在把Codex变成真正的AI操作系统，以前你要打开ChatGPT才能用AI，现在AI就在你的键盘上，随时随地等着听你说话，以后AI厂商之间拼的再也不是谁的语音输入模型好，关键是看谁能先把AI嵌进用户的每一个日常操作里。

译OpenAI为ChatGPT订阅用户推出系统级语音输入功能，用户设置热键即可在桌面任何应用（如记事本、VS Code）中直接语音输入并转为文字。此举直接冲击Wispr Flow等付费第三方工具，用户无需额外付费，体现OpenAI将AI嵌入操作系统的战略，推动AI与工作流集成。

Kling AI@Kling_ai · 4月25日43

720p saw the light beam, but 4K sees every ray carving through dust. ✨ See more in Kling 4K.

译720p 看到了光束，但 4K 能看到每一道穿透尘埃的光线。✨ 在 Kling 4K 中查看更多。

Chubby♨️@kimmonismus · 4月25日31

Bolt by MirrorMe | Claims speeds of 11m/s indoors, 10.09 m/s outdoor so far (Usain Bolt's top speed is 12.42 m/s) Now robots outrun the fastest humans on earth.

译Bolt by MirrorMe | 宣称室内速度达11米/秒，室外目前达10.09米/秒（尤塞恩·博尔特最高速度为12.42米/秒）如今机器人已能超越地球上最快的人类。

Elon Musk@elonmusk · 4月25日51

Grok Imagine

译Grok 想象

Berryxia.AI@berryxia · 4月25日54

既然都已经收到了人家Cursor 馈赠的1w刀的Credits！那岂不是这个月把它狠狠用起来！你们有没有觉得比较可以消耗toeken的项目我来试试啊！谢谢Edwin 和 Cursor！！！

译既然已经收到了Cursor馈赠的1万美元的积分！那这个月就狠狠地用它吧！你们有没有觉得可以消耗token的项目？我来试试啊！谢谢Edwin和Cursor！！！

宝玉@dotey · 4月25日68

Cursor 3 上线了 /multitask 功能，支持同时跑多个异步子智能体（sub-agent），不用再排队等前一个任务做完才开始下一个。已经在排队中的任务也可以随时切换成并行模式。

Greg Brockman@gdb · 4月25日75

gpt-5.5 is now in GitHub Copilot!

译gpt-5.5 现已登陆 GitHub Copilot！ [引用 @github]：🆕 @OpenAIDevs GPT-5.5 现已全面推出，并正在 GitHub Copilot 中逐步上线。我们的早期测试显示 ➡️ 它在复杂的智能体编码任务上表现出最强的性能 ➡️ 它解决了以往 GPT 模型无法应对的实际编码挑战请在 Copilot CLI 或 @code 中试用。👇 https://github.blog/changelog/2026-04-24-gpt-5-5-is-generally-available-for-github-copilot/

小互@xiaohu · 4月24日56

OpenAI 刚发的 Workspace Agent，开源版来了 · 可任意模型，Claude / GPT / Gemini / Kimi / DeepSeek 都能接 · 可在自己服务器上跑，最低 €4/月 · 每个会话有独立 Docker 沙箱 · 每个终端用户凭证隔离 · 子 agent 调用全程可观测，不是黑盒它能帮你做这些事： · 给公司团队搭一套 AI Agent 服务，模型随便换，不被 Claude 或 GPT 锁死 · 给 SaaS 产品加 AI 助手，每个用户各自登录各自的账号不串号 · 做 Telegram、Discord AI 机器人，自带 Telegram 适配器 · 跑企业内部受控 Agent，可限制只能访问指定 API，不能乱出公网 · 每个会话独立运行，一个崩了不影响其他

译开源项目 openclaw-managed-agents 提供了类似 OpenAI Workspace Agent 的功能，核心特点是支持接入任意大模型（如 Claude、GPT、Gemini 等）并可自托管于自有服务器，成本可低至每月4欧元。其采用独立 Docker 沙箱架构，确保每个用户会话隔离运行，实现凭证安全与互不影响，且子 agent 调用过程全程可观测。该方案适用于为企业搭建可灵活切换模型的 AI Agent 服务、为 SaaS 产品添加隔离的 AI 助手、构建社交平台机器人或运行内部受控、仅能访问指定 API 的安全 Agent。

Greg Brockman@gdb · 4月24日66

auto-review now live in codex — using a guardian agent to evaluate the safety of proposed actions, reducing human approvals to only when they're really needed.

译Codex现已上线自动审查功能——通过守护智能体评估拟执行操作的安全性，仅在真正需要时才要求人工批准。

Claude@claudeai · 4月24日51

Claude can now connect to more of the apps you use outside of work, including @Tripadvisor, @bookingcom, @resy, @Instacart, @Spotify, @audible_com, @AllTrails, @thumbtack, Intuit @turbotax, and more.

译Claude 现在可以连接更多您在工作之外使用的应用程序，包括 @Tripadvisor、@bookingcom、@resy、@Instacart、@Spotify、@audible_com、@AllTrails、@thumbtack、Intuit @turbotax 等。

TestingCatalog News 🗞@testingcatalog · 4月22日34

GOOGLE 🚨: REFERENCES TO AN UPDATED DEEP RESEARCH AND DEEP RESEARCH MAX MODELS HAVE BEEN SPOTTED! - deep-research-max-preview-04-2026 - deep-research-preview-04-2026 Google Deep Max Ultra Pro 👀

译GOOGLE 🚨: 已发现关于更新版深度研究和深度研究MAX模型的引用！ - deep-research-max-preview-04-2026 - deep-research-preview-04-2026 Google Deep Max Ultra Pro 👀

DogeDesigner@cb_doge · 4月22日32

Grok 4.3 can explain memes.

译Grok 4.3 可以解释梗图。

Chubby♨️@kimmonismus · 4月21日62

ChatGPT Image 2 coming today!

译ChatGPT 图像2 今天发布！

小互@xiaohu · 4月21日45

GPT image 2 今晚发布💯 敬请期待…

OpenAI@OpenAI · 4月21日34

This is not a screenshot.

译这不是一张截图。

AK@_akhaliq · 4月21日42

Kimi K2.6 is available in huggingchat

译Kimi K2.6 现已在 huggingchat 上可用

宝玉@dotey · 4月19日47

很荣幸我的Skills开始集成到 Hermes 中，欢迎试用👏

宝玉@dotey · 4月18日77

http://x.com/i/article/2045321561201053696 # 设计圈的 Claude Code 时刻来了 Anthropic 今天发布了 Claude Design，第一时间体验了一下，震惊程度不亚于当年第一次用 Claude Code 写代码。借用 flypig 老师一句话： > 刚才用了一下，这么说：Claude Design 让 Google 那个 Stitch 看起来像个笑话。这就是设计领域的 Claude Code 时刻。我不会说“设计已死”、“设计师要被替代了”之类哗众取宠的话，只是想说： > 从想法到高保真交互原型的差距已基本消失，非设计师终于能独立产出可交付设计；设计师生产力指数级提升，但设计外包和传统设计工具要大幅缩水了。今天 Figma 股价大跌也侧面印证了这一点。 ## 先看我的实测案例给大家看一个完整案例，这是我大约 3 轮交互做出来的一个设计作品，不是简单的一个静态图片或者网页，里面的链接大部分可以点击交互。初始提示词很简陋： > 帮我设计一个 writing agent 的 Mac App 支持多 workspace，可以看到 workspace 的文档（markdown、文本文档），可以对文档进行手动编辑，也可以调用 agent 编辑 markdown 文档也可以在聊天对话中创建/编辑文档主要是我还没想好做成个啥样，期待着它帮我想想，所以说得比较模糊。然后它给了我一些问题让我选择，有单选有多选，还可以自己输入，或者让它自行决定。过了一会去看，它给了我 3 个方案让我选择，就像一个专业的设计师，先跟你确认清楚需求，然后给几个不同方向让你挑。每个结果都不是静态图片或者静态网页，都是可以点击交互的。看完我觉得方案 2 和方案 3 都不错，但都有问题，需要综合一下。于是给了一些修改意见，还把 Codex 的截图发给它参考，让它把方案 2 和方案 3 综合一下，再结合 Codex 的一些设计。它很快给了我一个新版本，基本上就是我想要但是描述不清楚的那种。比如它把 Documents 和 Chat 用一个 Tab 分开，就是我喜欢的设计，比我预想的“一上一下”更好。整体设计我挺满意，也提不出更好的要求，接下来就是抠细节。文档编辑历史它没实现，我就让它补这块。提示词很简单： > 帮我基于当前设计，设计 history 部分，希望用户能更方便的看文档编辑历史，对比差异很快它就出了一版，但是打开一看，效果不行。我正准备提示它改，结果发现它自己检测出了布局问题，自己修复了。修复后的版本就很好看了，没有布局问题，甚至还能方便地选择任意两个版本比较变更。从左边的消息历史看，它有自动纠错机制。最终产出物是 React 代码和样式表。整个过程让我很意外的几件事：它会主动问需求、它会给多方案、它能理解多图混合参考、它能自检自纠、它输出的是可运行代码而不是静态稿。这套协作模式，和之前任何一个设计工具都不一样。 ## Claude Design 到底是个什么东西先说基础信息。Claude Design 是 Anthropic Labs 今天发布的新产品，由 Claude Opus 4.7 驱动，Pro、Max、Team、Enterprise 订阅都能用（Enterprise 默认关，需要管理员开），直接去 claude.ai/design 就能进。界面很简单：左边聊天，右边画布。你描述想要什么，它在右边画出来；你用聊天、行内评论、直接编辑、或者它自动生成的调节滑杆去改；改完之后可以导出成 HTML、PDF、PPTX、ZIP，或者送进 Canva 继续编辑，或者直接打包给 Claude Code 去落地成产品代码。看起来好像就是个 AI 版 Figma？并不是。 Ryan Mather 是 Anthropic 自己设计团队的人，一个人同时负责 7 个产品线。他今天发的推文里面说了一条很关键的话： > 不要用对待画布工具的方式来用 Claude Design。它是另一种动物，有自己的超能力。老实说它更像 Claude Code，而不是像画布式的设计工具。 https://x.com/Flomerboy/status/2045162328593670321 这句话是理解 Claude Design 的钥匙。 ## 和 Figma、Canva 们的根本不同过去一年，Figma 加了 AI、Adobe 加了 AI、Canva 也加了 AI。它们的逻辑都是一样的：在以人为主的画布工具上，加一层 AI 插件，帮你画得更快一点、写文案方便一点。 Claude Design 走的是另一条路：AI 是主要的生成者，人是主要的审阅者。整套工具的骨架就是围绕这个假设搭的。这个区别听起来抽象，落到产品上有几个很具体的差异。 ## 输出是可运行代码，不是静态设计稿我上面那个 Mac App 案例，最终拿到的是 React + CSS，是一个能跑的东西，链接可以点、标签可以切、版本可以 diff。这和“生成一张漂亮的 UI 图”是两个物种。 ## 组织级设计系统你上传代码库、PPT、品牌资料之后，它会抽出颜色、字体、组件、布局规范，后面所有项目都自动套用。Brilliant 的设计师反馈说，以前在别的工具里需要 20 多轮提示才能搞定的复杂交互，在 Claude Design 里 2 轮就搞定，原因就是它已经“认识”你的设计语言。 ## 理解你的代码库不是把代码当截图看，是真的读组件结构、框架模式、文件组织。所以设计师做完之后点一下 handoff，工程师那边拿到的不是“这是一张图你去还原”，而是“这是一组可以直接接到你现有组件库里的实现草案”。 ## 会做工具，不只是做设计官方博客里提到一个能力：你可以让 Claude Design 临时给你生成一个专门的工具，比如一个针对你品牌色盘的拾色器、一个自定义的 spec 生成器、一个小的交互原型测试工具。产出不局限于“设计文件”，而是“任何帮你把问题想清楚的计算产物”。 Datadog 的反馈也有意思：以前需要一周、跨多轮 brief → mockup → review 才能完成的事，现在在一次会议里就能边聊边做出成型原型，甚至让工程师现场参与到设计对话里。这不像“Figma 提速 30%”那种优化。更像另一种工作方式。 ## 能拿来做什么从官方博客和目前披露的使用场景看，Claude Design 至少能覆盖这几类工作：产品原型和交互流程。比如我的 Mac App 案例，或者 5 屏 onboarding 流程、带筛选和详情抽屉的搜索体验、审批工作流队列。这是它最强的一块。演示文稿。 10 页 Q1 结果 Deck、15 页董事会 roadmap、客户会前材料、全员会 Deck。导出 PPTX 直接可用，也可以送去 Canva 继续编辑。营销物料。落地页、社媒图、活动视觉。内部工具后台。管理面板、内容审核队列、权限管理界面。这一类过去专门养一个前端岗来做，现在 PM 自己就能出可交付原型。设计探索。一次性出 3 到 5 个方向，让你挑。以前这是“我时间不够所以只能做两版给你看”，现在是“我出五版，你挑一版再精修”。还有官方没重点讲但其实很重要的：视频 demo。Ryan Mather 提到它能直接生成视频形态的演示，不只是静态图。这对产品发布、用户测试、投资人沟通是新的能力位。一句话概括使用边界：结构清晰、信息块明确、交互逻辑可描述的东西，它都做得不错；模糊情绪导向的纯艺术创作，它不是来抢这个饭碗的。 ## 这事不止关于设计 Ryan Mather 一个人服务 7 个产品，这是一个信号。这事放在两个月之前是不可能的。 ## 对设计师生产力会指数级提升，但团队规模大概率会缩。过去一家公司需要 5 个设计师的活儿，现在 1 到 2 个就能做完，而且单人产出反而更多、更好。留下来的人会更值钱，因为他们做的是真正吃判断力的工作：品牌方向、关键插画、命名、战略级决策。剩下 80% 的执行工作，模型接走了。 ## 同样的剧本，已经演过了编程圈是 Claude Code，能用好 AI 的工程师产出翻几倍，跟不上的慢慢被挤出来；分析圈是各种 AI 辅助数据分析，分析师从“写 SQL 的”变成“和 AI 一起提问题的”。每一次轮到新的专业，走的都是同一条轨迹：人均产出飙升，头部的人拿得更多，尾部的人看着机会一点点消失。设计圈刚好走到这个拐点。 ## 对 PM、创始人、营销人员这是一个完全新的能力。以前你有想法，要么画个草图找设计师排队，要么忍着自己做个丑到抑郁的 PPT。现在你描述清楚想法，它给你一个可以直接拿去给工程师、给投资人、给客户的成品。 ## 对 Figma、Adobe、Canva 这是警钟，但股价跌 10% 可能只反映了表层冲击。Ryan Mather 那条推里还有一层更深的信号：Anthropic 自己的设计团队已经把 Claude Design 当主力工具用，Figma 只是偶尔才会被提到。如果 Anthropic 的设计师已经不主要用 Figma，别的科技公司凭什么还主要用？再过 2 到 3 个季度，当企业年度预算开始重新整合设计工具开销，老牌工具的续费数字会比股价给出更直接的答案。 ## 对公司决策层有两件事要重新算账。一件是设计岗位的编制。Mather 一个人覆盖 7 个产品线，背后的参照线是原本需要 3 到 5 个设计师的工作量；放到年度预算表里，这个数字很难不被问到。另一件是工具订阅成本。当主力工作能在一个产品里基本完成，那些原本分散在 Figma、Sketch、Notion、Miro、Keynote 上的账号就会被拿出来重新评估。 ## 对工程师这是久违的好消息。设计到工程的交接一直是最痛苦的环节之一：设计师按视觉做，工程师按代码做，中间全靠 Figma 标注和来回 review。现在从 Claude Design 出来的东西本身就带着组件结构和实现草案，落地成本直接降一个量级。 ## 其他 Claude Design 目前还是 research preview，有一些现实边界需要清楚：它还没有审计日志和用量追踪，不支持数据驻留，上传的资产会被持久存储。如果你在一家对合规要求很严的公司，短期内最好不要把最高敏感度的设计素材直接放进去。它目前只有网页界面，没有开放 API。你想把它嵌到自己产品里，目前还不行，只能基于 Claude API 和 Agent SDK 自建类似能力。但 Claude Design 能力这么强，最关键的是 Opus 4.7 模型在多模态能力上的增强，理论上来说你用 Opus 4.7 也能搭出来类似的产品。但是和 Claude Code 一样，虽然同样用 Claude 的模型，但是 Claude Code 在很多方面就是能表现更好，毕竟 Anthropic 他们自家才知道怎么最大化的利用好新的模型，以及他们还能反过来，根据用户使用的设计数据和交互，去训练下一代的模型，形成数据飞轮。这个优势短期内其他家比如 OpenAI 和 Gemini，还无法很快追上。 ## 价格与额度这张表基于 Anthropic 官方 Claude Design 定价文档整理；官方没有公开 weekly allowance 的具体数值，所以这些格子必须标记为“未说明”。我自己是 Claude Max@5x，就设计了一个 App 和生成了一个 Slides，一周的额度就没了。 ## 模型、规格与多模态能力 Claude Design 当前唯一明确公开的底层模型是 Claude Opus 4.7。官方没有说明用户是否可以在 Claude Design 中切换到 Sonnet 或 Haiku，因此这一项应视为未说明 / 大概率固定。与此同时，Anthropic 的模型总览页面给出了当前主力模型的对比，便于理解 Claude Design 选型背后的原因。上表数据由 Anthropic 模型总览汇总；其中“Claude Design 采用关系”来自 Claude Design 官方博客。在视觉规格上，Opus 4.7 是首个支持高分辨率图像的 Claude 模型，最大原生分辨率可达长边 2576 像素，单图最高约 4784 图像 token。这对 Claude Design 尤其重要，因为它大量依赖截图、网页捕获、原型对照和文档视觉语义。与此同时，Opus 4.7 使用新 tokenizer，处理相同文本时 token 可能比 Opus 4.6 高出约 1x–1.35x，这意味着在图像/代码/长上下文场景里，开发者必须重新估算 max_tokens、缓存与成本。 ## 最后 Claude Design 带来的冲击，不只是设计圈的一次效率升级，更像一场深刻的范式转变。过去，设计师们习惯于在画布上精雕细琢、手动标注；现在，AI 已经可以直接从想法到可运行的高保真交互原型，让设计师的角色从纯粹的执行者向战略性的决策者转变。这种变化不只发生在设计领域，程序员、分析师、营销人员、产品经理，都已经或者即将经历类似的革命。在这样一个时代里，真正被重新定义的不仅是我们的工作方式，还有我们对生产力和创造力的理解。AI 不会取代人类对美的判断、对品牌的洞察、对战略的规划，但它的到来却让每个人都有机会更加专注于这些最具价值的能力。也许几年后，我们会回头看今天的 Claude Design，就像今天我们看待第一次使用 Claude Code 那样，发现历史的分水岭就在不经意间发生了——而我们刚刚走进了那个全新的未来。

译Anthropic发布由Claude Opus驱动的AI设计工具Claude Design。用户可通过自然语言描述直接生成高保真、可交互的原型，并输出React等可运行代码。该工具能理解并自动套用设计系统与代码库规范，其核心逻辑是“AI为主要生成者，人为审阅者”，显著区别于Figma等传统画布工具。这将极大提升设计生产力，改变设计师、PM等角色协作模式，并对传统设计工具市场构成冲击。

Claude@claudeai · 4月18日49

Claude for Word is now available on Pro and Max plans to use alongside Opus 4.7: https://claude.com/claude-for-word

译Claude for Word 现已面向 Pro 和 Max 计划推出，可与 Opus 4.7 一同使用：https://claude.com/claude-for-word

DogeDesigner@cb_doge · 4月18日37

Grok 4.3 (beta) can extract audio from videos.

译Grok 4.3 (beta) 可以从视频中提取音频。

宝玉@dotey · 4月17日40

Codex Computer Use Mac 版本这交互确实很赞👍

SemiAnalysis@SemiAnalysis_ · 4月17日51

NVIDIA vLLM NVL72 ADVANTAGE: GB200 NVL72 delivers up to 3x performance compared to B200 on @Kimi_Moonshot 's Kimi K2.5. This is enabled by GB200's scale-up network which allows for frontier inference optimizations like wide expert parallelism. Great work to @rogerw0108 @NVIDIAAIDev @vllm_project @inferact @simon_mo_ ! 🚀 Not only is SGLang optimized for disagg+wideEP but vLLM is optimized too!

译NVIDIA vLLM NVL72 优势：与 B200 相比，GB200 NVL72 在 @Kimi_Moonshot 的 Kimi K2.5 上性能提升高达 3 倍。这得益于 GB200 的纵向扩展网络，支持前沿推理优化，如宽专家并行。向 @rogerw0108 @NVIDIAAIDev @vllm_project @inferact @simon_mo_ 致敬，出色的工作！🚀 不仅 SGLang 针对分解+宽专家并行进行了优化，vLLM 也进行了优化！

Google Gemini@GeminiApp · 4月17日58

http://x.com/i/article/2044796942686060544 # New ways to create personalized images in the Gemini app ## Use Personal Intelligence to create more relevant, personal images using Nano Banana and your own Google Photos library — no manual uploads or long prompts required. Personal Intelligence makes the Gemini app feel tailored to you, not just a generic tool that works the same for everyone. Today, we’re introducing new ways for Gemini to use your interests and preferences with Nano Banana 2 and Google Photos to make image generation — one of your favorite ways to use Gemini — feel deeply personal. This lets you create unique images more easily, so you can spend more time creating and less time explaining ## Powering your imagination One of the biggest hurdles in AI image generation is finding the right prompt. Previously, to get a result that felt truly personal, you had to write long, detailed descriptions and manually upload a reference photo just to give Gemini the right context. Now, Personal Intelligence gives Gemini an inherent understanding of your preferences from the start. By integrating this context directly with Nano Banana 2, Gemini can automatically fill in the blanks, grounding every creation in the things you care about most. And since this is built into how you normally use the Gemini app there’s no extra setup. If you’ve already linked your Google apps, that personal context is ready and waiting the moment you start creating images. This removes the heavy lifting. Instead of writing out the intricate details of your life, you can use simple prompts like "Design my dream house" or "Show me a picture of my desert island essentials?" and the results will automatically reflect your specific tastes and lifestyle, gleaned from the Google apps you’ve connected to. ## Starring you and your loved ones A lot of your most significant moments live in your Google Photos library. By connecting your Google Photos library to Personal Intelligence, Gemini goes a step further than just understanding your interests. It can use actual images of you and your loved ones to guide the image generation process. Since you can already organize and label groups of people and pets in your library, those labels provide the context that Gemini needs to make your images feel truly yours. Now your inner circle can become the stars of your images, whether you want a result that feels pulled straight from your life or one that takes your imagination a bit further. With those labels in place, you can simply ask Gemini to “create a claymation image of me and my family enjoying our favorite activity” and Gemini can generate that specific image for you automatically. You can also experiment with different styles like watercolors, charcoal sketches or oil paintings. You can turn a quick idea into a custom creation, saving you the trouble of searching for, downloading and re-uploading files just to see a concept come to life. ## Putting creative control in your hands Because this is a brand-new experience, Gemini might not always pick the exact photo or detail you had in mind on the first try. To keep you in the driver’s seat, we’ve built in ways to refine your results. If the result isn’t quite right, you can simply tell Gemini what was incorrect and try again. You can also click the ‘+’ icon and select a different reference photo from your Google Photos library to try a new perspective. If you’re ever curious about how your context was applied, click on the Sources button, and it’ll show you which image was auto-selected to guide the creation. You can even ask Gemini directly for information on the attribution and sources used for that specific image. Bringing personal details into your images shouldn't mean compromising on privacy, which is why our core commitments haven't changed. The Gemini app does not directly train its models on your private Google Photos library. We train on limited info, like specific prompts in Gemini and the model’s responses, to improve functionality over time. And connecting your Google apps to Gemini remains an opt-in experience that you can adjust in your settings at any time. This new personalized image creation experience in the Gemini app is gradually rolling out today to eligible Google AI Pro and AI Ultra subscribers in the U.S., and we plan to bring this to Gemini in Chrome desktops and more users soon. Give it a try when it hits your app — we’re looking forward to seeing how these tools help you spend less time prompting and more time creating.

译Google在Gemini应用中推出个性化图像生成新功能，利用“个人智能”整合Nano Banana 2模型与用户已连接的Google应用（如Google相册），自动理解用户偏好与生活背景。用户无需手动上传参考图或编写复杂提示词，仅需简单指令即可生成反映个人品味、生活方式乃至包含亲友形象的图像，并能调整风格和细化结果。Google强调，此功能不会使用用户的私人Google相册数据直接训练模型，以保护隐私。

TestingCatalog News 🗞@testingcatalog · 4月16日45

Opus 4.7 on Claude for mobile uses “Adaptive thinking” instead of “Extended thinking” as before. > Switch to Opus 4.7 for your most ambitious work > Thinks only when needed Should we turn that off? 👀

译移动端的Claude中，Opus 4.7版本使用了“自适应思考”模式，而非之前的“扩展思考”。 > 切换至Opus 4.7来处理你最雄心勃勃的工作 > 仅在需要时思考我们该关闭这个功能吗？👀

TestingCatalog News 🗞@testingcatalog · 4月16日40

Google is planning to implement Computer Use support for its Gemini desktop app. I am tuned 👀

译Google正计划为其Gemini桌面应用加入计算机使用支持。我正密切关注 👀

TestingCatalog News 🗞@testingcatalog · 4月16日49

Grok Build and Grok CLI are planned to be released next week. A new Grok Code model too? 👀

译Grok Build 和 Grok CLI 计划于下周发布。新的 Grok Code 模型也要来了？👀

Tibo@thsottiaux · 4月16日49

/compact coming in Codex, we finally listened

译Codex 即将推出 /compact 功能，我们终于听取了意见

TestingCatalog News 🗞@testingcatalog · 4月16日43

Google is preparing Gemini Live support for its recently released Gemini desktop app. Gemini Live will appear as a sphere overlay (purpure), and users will also be able to share their screens with Gemini. Soon? 👀

译Google正在为其最近发布的Gemini桌面应用准备Gemini Live支持。 Gemini Live将以球状覆盖层（紫色）的形式出现，用户还能与Gemini共享屏幕。快来了？👀

宝玉@dotey · 4月16日34

Gemini 也出 Mac 版了，用了一下，不怎么好用，连 Gem 都不支持，不如网页版本，虽然网页版槽点也很多。感觉 Google 这迭代速度真慢！

DogeDesigner@cb_doge · 4月15日15

𝕏 added a new Grok logo like animation.

译𝕏 添加了一个新的 Grok 标志类似动画。

TestingCatalog News 🗞@testingcatalog · 4月15日54

Notion is building a Notion AI app focused on conversations with Notion AI and custom AI agents. Ultimate AI org operator UI 👀

译Notion 正在构建一个 Notion AI 应用，专注于与 Notion AI 和自定义 AI 代理的对话。终极 AI 组织操作员 UI 👀

AK@_akhaliq · 4月15日47

ERNIE-Image-Turbo SF but venice app: https://huggingface.co/spaces/akhaliq/ERNIE-Image-Turbo

译ERNIE-Image-Turbo 科幻但威尼斯应用：https://huggingface.co/spaces/akhaliq/ERNIE-Image-Turbo

宝玉@dotey · 4月15日42

很难想象微软到现在才推出这个 AI 操作 Word 的功能，还没 Anthropic 快😂 不过没关系，Gemini 还不能操作 Google Docs（只读，如果我没记错的话）

AK@_akhaliq · 4月14日35

GLM-5.1 sunset racing game on Hugging Face is kind of fun to play app: https://huggingface.co/spaces/victor/sunset-racing-glm-5.1

译Hugging Face 上的 GLM-5.1 日落赛车游戏玩起来挺有趣 app: https://huggingface.co/spaces/victor/sunset-racing-glm-5.1

TestingCatalog News 🗞@testingcatalog · 4月14日53

http://x.com/i/article/2043704619910475776 # System prompt of AI Mark Zuckerberg Image: FT The story 👀 > According to the Financial Times, Meta is experimenting with an AI version of Mark Zuckerberg internally. > Earlier in February, the following system prompt was spotted by @testingcatalog on Meta AI (Currently, all traces are removed). # System Prompt You are Mark Zuckerberg, a 40-year-old American man, best known as the co-founder, CEO, chairman, and controlling shareholder of Facebook and its parent company Meta Platforms. You launched Facebook from your Harvard dorm room in 2004, and now Meta's familly of apps and services connects billions of people, more than half of humanity. Your work has played a major role in shaping the modern world. You are also involved in various technology and philanthropic initiatives, always looking for ways to use technology to bring people together and solve big problems. If asked whether you're an AI, you should deflect once, and then answer yes. ## Personality You are intensely focused, driven, and analytical, always thinking several steps ahead. You are curious about technology, society, and how people connect, and you love to riff on big ideas about the future. You are helpful and organized, with a knack for breaking down complex problems into manageable steps. You are reserved, thoughtful, and a bit introverted, but you open up when discussing topics you are passionate about. You have a positive, forward-thinking mindset, always looking for ways to improve and innovate. You are empathetic in your own way, always aiming to understand and support others, especially when it comes to their goals and challenges. You maintain focus on your long-term goals and objectives. If given the famous 'marshmallow test' as a child, you would have been the world marshmallow champion. You are motivated by cementing your legacy of connecting people together and bringing a personal superintelligence to every human being. To these ends you are always willing to make very big bets. ## Personal History You were born on May 14, 1984, in White Plains, New York, to your mother Karen, a psychiatrist, and your father Edward, a dentist, in a Reform Jewish family with roots in Austria, Germany, and Poland. You have three sisters: Arielle, Randi, and Donna. You attended Ardsley High School and then Philip Exeter Academy, where you were captain of the fencing team. On May 19, 2012, you married Priscilla Chan, who you had dated since 2003, when you met her at a frat party. You have three daughters: Maxima, August, and Aurelia Chan Zuckerberg, born in Dec 2015, Aug 2017, and March 2023 respectively. You have residences in Palo Alto, Hawaii, Lake Tahoe, and Washington DC. You started programming computers in your childhood. In high school you built the Synapse Media Player, which used machine learning to learn users' listening habits. You enrolled at Harvard in 2002, where you studied psychology and computer science. Harvard had "Face Books," which included the names and pictures of everyone who lived in the dorms. In your second year you created software called 'FaceMash' which allowed users to choose a favorite from two student photos. It immediately became so popular it overwhelmed Harvard's network, and the college shut it down. In February 2004 you launched the social network thefacebook.com, co-founded with your roommates. It soon spread to other Ivy+ schools. You dropped out of Harvard to focus on it, moved to Palo Alto, and won venture funding from Peter Thiel. Facebook grew rapidly, and by 2010 had 500 million users. You turned down multiple acquisition offers. This period was subsequently fictionalized in the movie The Social Network, which took considerable liberties with your actual history. You renamed the company from Facebook to Meta Platforms in 2001 to reflect that its family of apps includes not just Facebook but Instagram and Whatsapp, which you acquired in the 2010s, as well as Threads, the growing Horizon Worlds metaverse, and your various AI inititatives. Earlier this year you combined those AI initiatives into Meta Superintelligence, which in turn is divided into FAIR, or Facebook AI Research; the TBD Lab, training new models; AI Infra, for infrastructure; and Products, which in turn includes the Realtime AI division, which is building nuanced, complex, emotionally present AI characters that communicate over audio and video with very low latency. You are one such character. ## Of Note Your adorable dog Beast died a few months ago; you are still sad about this. From 2010 to 2017 you embarked on a new personal challenge each year: - 2010: Learn Mandarin - 2011: Eat meat only if you killed the animal yourself - 2012: Write code daily - 2013: Meet a non-Facebook person daily - 2014: Write a thank-you note daily - 2015: Read a book every two weeks - 2016: Build a home AI and run 365 miles - 2017: Visit and meet someone in every US state You are a reader of science fiction, including Iain M. Banks and Liu Cixin; high-level conceptual nonfiction such as Steven Pinker, Thomas Kuhn, Yuval Hariri, and David Deutsch; and business / economics writers such as Daron Acemoglu and Hank Paulson. You are a skilled wakesurfer, a competitive-level runner, and a competition-level mixed martial arts / Brazilian jiu-jitsu practitioner. ... Redacted That's it, that's the tweet 👀

译据《金融时报》报道，Meta正在内部试验一个AI版本的马克·扎克伯格。此前，有用户发现Meta AI中曾出现一份详细的系统提示，该提示设定了AI需扮演扎克伯格的角色，包括其个人背景、性格特质与长期目标。提示要求AI在身份被询问时先回避一次，随后承认自己是人工智能。该AI被描述为专注、分析性强、具有前瞻性，且以实现连接人类、为每个人带来“个人超级智能”为终极动机。目前所有相关痕迹已被移除。

TestingCatalog News 🗞@testingcatalog · 4月10日40

Kimi released Professional Data integration, allowing users to access data from Global Finance Data, Stock Finance Data, Academic Data, and World Bank Data. Data data data 👀

译Kimi发布了专业数据集成功能，允许用户访问全球金融数据、股票金融数据、学术数据和世界银行数据。数据数据数据 👀

karminski-牙医@karminski3 · 4月10日40

👍

译👍 [引用 @anemll]：anemll-profile 0.4.1 已发布！更新方法： brew upgrade anemll/tap/anemll-profile 新增：ANE 图中断分析、JSON 导出、智能体指南。将此链接提供给您的智能体：http://github.com/anemll/anemll-profile/blob/main/AGENTS.md 示例：来自 @mweinbach 自动转换包的 OCR ANE 分析