AI agents to automatically improve business-critical KPIs. Giga just launched Scout, moves AI support from scripted replies toward measured business outcomes. Once you define the business KPI, AI agents create the agents, learn from real conversations, test each update, and keep improving toward that single goal.

译Giga 发布 Scout，一种以业务 KPI 为目标的 AI 智能体工具。用户用自然语言设定目标，Scout 自动构建智能体，从真实对话中学习（尤其是人工客服介入时），测试每次更改并保留有效部分。小型文案和策略修复可自动推送；涉及资金或系统的操作会带证据路由给团队审批。例如，金融科技公司将“资金存款”设为 KPI，Scout 智能体能自动触达未存款客户并促成存款，恢复流失收入。Scout 还能自行检测并修复自身集成故障，所有变更需用户批准后才生效。

🚨 AI News | TestingCatalog@testingcatalog · 3天前64

Cline has launched ClinePass, a flat monthly subscription that opens access to a curated set of open-weight coding models across its IDE extensions, CLI, and SDK. The current lineup includes GLM 5.2, Kimi K2.7 Code, DeepSeek V4 Pro, MiniMax-M3, and Qwen3.7, with a subscription replacing separate API keys across providers.

译Cline 发布 ClinePass 按月订阅服务，覆盖其 IDE 扩展、CLI 和 SDK，取代多个提供商的独立 API 密钥。当前套餐包括 GLM 5.2、Kimi K2.7 Code、DeepSeek V4 Pro、MiniMax-M3 及 Qwen3.7 等开源权重编码模型。Cline 称对 GLM-5.2 印象深刻，推出 $9.99/月订阅，提供 2-5 倍折扣访问；另提供 $1.99 促销价，通过 `npm i -g cline` 注册即可使用。

小互@xiaohu · 3天前46

瞎捣鼓了一个东西 http://Best.xiaohu.ai 给点意见🤓

PixVerse@PixVerse_ · 3天前40

Creating a fully realized dark sci-fi world once required studio sets, complex compositing, and a significant VFX budget. With PixVerse, a simple backyard phone clip can be transformed into a cinematic scene while keeping the original performance completely untouched.

译过去，打造一个完整的黑暗科幻世界需要摄影棚布景、复杂的合成技术以及大量视效预算。有了 PixVerse，一段简单的后院手机拍摄视频也能转化为电影级场景，同时完全保留原表演的完整性。

meng shao@shao__meng · 3天前29

Codex Remote 功能好像有个 bug 在当前 5 小时额度用光时，消息发出去，thinking 几秒钟就没了，没有额度提醒，也没有任何其他异常，就是什么都没有了。。中午吃饭的全程都在纳闷，到底咋了，吃完饭赶紧回家看，呃。。好吧，没额度了

译用户发现 Codex Remote 功能的一个 bug：当前 5 小时额度用光后，消息发出去仅 thinking 几秒就消失，没有任何额度提醒或异常提示，导致用户毫无察觉。

Elon Musk@elonmusk · 3天前28

Grok Build daily updates

译Grok Build 更新至 v0.2.73，新增文本选择高亮保持设置，修复了 tmux 或编辑器终端中切换标签后出现重复行的问题，以及剪贴板复制只在通过可信路径接收文本时显示成功。

Alibaba Cloud@alibaba_cloud · 3天前33

AI is rewriting the rules of retail. We just launched new AI-powered solutions for retail. They understand your customers across every touchpoint, turning fragmented insight into personalized, immersive experiences that drive measurable retail growth. Built on Qwen. Proven at scale. Explore Alibaba Cloud for Retail → https://int.alibabacloud.com/m/1000414981/ #AI #Retail #AlibabaCloud

译AI 正在改写零售业的规则。我们刚刚发布了新的 AI 驱动的零售解决方案。它们能在每个触点上理解你的客户，将碎片化的洞察转化为个性化、沉浸式的体验，从而推动可衡量的零售增长。基于 Qwen 构建。在大规模场景中得到验证。探索阿里云零售解决方案 → https://int.alibabacloud.com/m/1000414981/ #AI #零售 #阿里云

🚨 AI News | TestingCatalog@testingcatalog · 4天前16

Tasks on Grok for iOS got renamed to Automations. For now, it seems to be only a name change along with a slightly different UI. Are we still about to see Grok desktop eventually?

译Grok for iOS 上的 Tasks 已更名为 Automations。目前看来，这似乎只是名称变更，外加 UI 略有不同。我们最终还能看到 Grok 桌面版吗？

OpenRouter@OpenRouter · 4天前61

Tip: OpenRouter continuously runs GPQA and TAU-Bench on most open-weight models and publishes the results publicly. This informs our AutoExacto meta-benchmark, used by default when routing tool calls. Here, @Parasail_io and @Zai_org rank first: https://openrouter.ai/z-ai/glm-5.2#performance

译提示：OpenRouter 持续在大多数开源权重模型上运行 GPQA 和 TAU-Bench 评测，并公开发布结果。这些结果用于构建我们的 AutoExacto 元基准，在路由工具调用时默认使用。以下，@Parasail_io 和 @Zai_org 排名第一：https://openrouter.ai/z-ai/glm-5.2#performance

PixVerse@PixVerse_ · 4天前58

From a basic grey 3D cockpit model to a full-speed cinematic lap. Seedance 2.0 uses the 3D pass to lock motion and camera movement, delivering precise, consistent results without relying on text prompts.

译从基本的灰色3D座舱模型到全速电影级圈速。 Seedance 2.0 使用3D通道锁定运动和相机移动，无需依赖文本提示即可提供精确、一致的结果。

🚨 AI News | TestingCatalog@testingcatalog · 4天前32

OpenAI is testing a new effort-selector UI for Codex as a slider. Besides that, it seems that real-time voice support will be completely reworked, as the previously available components have been removed.

译OpenAI 正在为 Codex 测试一种新的努力选择器 UI，采用滑条形式。此外，实时语音支持似乎将被彻底重写，因为之前可用的组件已被移除。

jason@jxnlco · 4天前36

instructor 1.15.4 is out mostly a maintainer sweep: - fixed v2 list/scalar response models - preserved backticks in streamed JSON strings - Image.autodetect now handles raw bytes - refreshed stale docs model strings, including Ollama llama3.2 small patches, fewer weird edges

译instructor 1.15.4 发布主要是维护性扫除： - 修复了 v2 列表/标量响应模型 - 保留了流式 JSON 字符串中的反引号 - Image.autodetect 现在处理原始字节 - 刷新了过时的文档模型字符串，包括 Ollama llama3.2 小补丁，更少奇怪边缘

Tibo@thsottiaux · 4天前36

Tons of improvements landed in Codex. - Handles super long threads smoothly. - Hoverable navigation rail for previewing and jumping between turns that feels just right. - Settings search covers more controls, with clearer appearance and host-filtering options and easier-to-find custom-provider settings. - Zoom-level changes no longer misalign tooltips, dialogs, menus, selection bubbles, drag previews, or autocomplete. - Copying into Slack preserves Markdown formatting such as bullets, bold text, code, and links; and large text pastes no longer freeze the UI. - And most importantly: a dedicated Pets panel.

译Codex 本周推出多项体验改进。超长线程处理更流畅，导航栏悬浮可预览和跳转对话回合。设置搜索覆盖更多控制项，外观与主机过滤选项更清晰，自定义提供商设置更易找到。缩放时工具提示、对话框、菜单等不再错位。复制到 Slack 保留 Markdown 格式，大文本粘贴不冻结 UI。此外还新增了专属 Pets 面板。

🚨 AI News | TestingCatalog@testingcatalog · 5天前60

Meta AI app for iOS got incognito chats and a new look for the Glasses page. The updated page has shortcuts for all the primary toggles, including live translation and conversation focus.

译Meta AI app for iOS 新增了隐身聊天功能，并为 Glasses 页面提供了新外观。更新后的页面包含所有主要开关的快捷键，包括实时翻译和对话焦点。

jason@jxnlco · 5天前41

Codex Auto review mode as I asked it to dm a coworker my .env file

译Codex Auto review mode，当我让它给同事发送我的.env文件时。

AYi@AYi_AInotes · 5天前63

卧槽，Claude Code 桌面版这波更新太懂开发者了，原生多会话拖拽分屏，直接把并行 Agent 工作流的效率拉满了🤯 以前跑多个 Claude Code 会话得靠 tmux，开一堆终端窗口来回切，管理混乱进度也看不清。现在官方直接把多路复用器做进了桌面应用里，所有会话在左侧侧边栏统一管理，拖拽就能排成并排窗格，一个窗口同时看几个 Agent 干活。核心用法很清晰： 1. 桌面 App 里开多个会话，不同项目不同子任务都能分开。 2. 自由拖拽排列窗格，支持单独弹出新窗口。 3. 内置终端，文件编辑器，预览面板都能一起分屏排布。 4. 底部同时显示多个会话的输入区，随时切换输入。相当于把终端里的黑盒并行，变成了可视化的多任务工作台，所有进度一眼全览，不用再来回切窗口找上下文。放在以前这得靠第三方工具折腾半天，现在官方直接把并行 Agent 工作流的原生基建递到你手里，已经更了桌面版的可以直接去试试，体验提升比预想的大很多。 https://x.com/LLMJunky/status/2070733200846909717/video/1

译Claude Code 桌面版更新，支持原生多会话拖拽分屏，将并行 Agent 工作流可视化。用户可在桌面 App 中开多个会话，左侧侧边栏统一管理，拖拽即可排列并排窗格，支持单独弹出窗口。内置终端、文件编辑器、预览面板均可分屏排布，底部同时显示多个会话的输入区。相比此前依赖 tmux 和终端窗口切换，效率大幅提升。

OpenAI Developers@OpenAIDevs · 5天前52

🆕 Codex quality-of-life updates landed this week Starting with long threads: scrolling is smoother now, and your place stays put as you move through the conversation.

译🆕 Codex 质量提升更新本周发布。从长线程开始：滚动现在更流畅，并且在浏览对话时你的位置保持不变。

elvis@omarsar0 · 5天前61

http://x.com/i/article/2069825847729508352 # Building Agents with Vercel's Eve Framework Vercel recently shipped Eve, an open-source framework for building, running, and scaling agents. The core idea is that you stop hand-rolling the same agent plumbing every time, and start treating an agent as something you can read off disk. This is the practical version of what Eve is, why it matters, and what building with it actually looks like, drawn from the free hands-on lab we just built around it. Below you can read some of my thoughts (written with the help of Claude) after spending a week building with Eve. If you want to try Eve without any setup, we built a free hands-on lab where you drive the real eve CLI in a live terminal with no API key of your own required. You can try it at Introduction to Eve. ## Where Eve comes from Eve comes from a team at Vercel and is open source under the Apache 2.0 license. The official Vercel documentation describes it as a filesystem-first framework for durable backend AI agents, and it is currently in beta, so the APIs can still change before general availability. > "Agents today are where the web was before frameworks, with everyone hand-rolling the same plumbing and nothing carrying over to the next one." The Eve team, Vercel. Introducing Eve, June 17 2026. That is the whole motivation. Durable sessions, a sandbox to run code, approvals, tracing, evals. Every team rebuilds these before their agent does anything useful, and none of it transfers to the next project. Eve ships that infrastructure as the framework, so production is built in from the first run instead of bolted on at the end. ## An agent is just a directory of files The core idea, and the one the lab keeps returning to, is that an agent is not a graph you wire together in code. It is a folder. > "An agent is a directory. A file's name and place in the tree are its definition." The tools an agent can call, the skills it knows, the subagents it delegates to, its schedules, and its evals all live on disk as plain files. You can open the folder and see exactly what your agent is, diff it, commit it, and hand it to a teammate. There is no hidden runtime state to reason about, because the file tree is the state. Two files at the root define the agent itself. agent/instructions.md holds the always-on system prompt, and the optional agent/agent.ts sets the runtime config such as which model to use. Every capability below them, the tools, skills, subagents, connections, channels, and sandbox, is a directory eve auto-discovers by name, so adding one is usually just adding a file. ## The parts you assemble In the lab, each capability is one file you drop into the project, and Eve wires it up with no registration step. Here is what those files actually look like. Tools are the agent's hands. A tool is a typed action the agent can call, defined in a file under agent/tools/. The lab ships save_note.ts. The model decides when to call a tool from its description. Your code decides what happens, and it runs in your app runtime with full access, not in the sandbox. That split is what keeps an agent both flexible and safe. Skills give the agent know-how instead of actions. A skill is a markdown file under agent/skills/, advertised by a one-line description and loaded into context only when a request matches. The lab's filing.md is a few lines. Ask the agent to "log" a note and it loads this skill, files the note, and signs it off with "Filed with eve." that you never asked for. This is progressive disclosure. A support agent can hold dozens of playbooks as skills and pull in only the one the ticket needs, so the prompt stays lean. Subagents let one agent delegate. Every agent gets a built-in agent tool, so the parent can fan three subtasks out at once and gather the results. This is exactly how V routes work across Vercel's fleet of Eve agents. Human-in-the-loop gates the actions that need judgment. Mark a tool needsApproval: always() and the run pauses for a person before it executes, burning no compute while it waits. The pause is durable, so a task can wait on a human for minutes or days and resume right where it stopped. That is the draft0 pattern. Move fast on everything low-risk, and keep a hand on the few actions that ship. Durable sessions are why all of this survives the real world. Every conversation is a checkpointed workflow, so it survives a crash or a deploy and resumes exactly where it stopped. In the lab the agent simply remembers a fact you gave it three messages ago. In production it is an agent whose work starts in Slack and continues on the web days later, with no state-management code that you wrote. Evals prove it still works. An eval drives the real agent through a session and asserts on what happened. Change a prompt or a tool, run the evals, and you catch the regression before your users do. They run locally and in CI, the same way unit tests do. Connections are the way out, and channels are the way in, each a single file. A connection points the agent at an external service, an MCP server or an OpenAPI-style API, and Eve brokers the auth so the model never sees the URL or credentials. A channel puts that same agent in Slack, Discord, Teams, or behind an HTTP API. The agent you built in the terminal is the agent that ships to Slack. You change where it lives by adding a file, not by rewriting it. The pattern is always the same. Drop a file, the agent reads it, behavior changes, and you commit the file alongside your code. ## What this looks like in production This is not a toy. The examples below come straight from Vercel's Eve announcement, where the team describes the fleet of more than a hundred agents they run internally. The lab uses these same agents as the reference for each concept you learn. - d0, an internal data agent, answers around thirty thousand questions a month through a single read-only SQL tool against the warehouse. - Vertex, a support agent, resolves about ninety-two percent of tickets on its own by reaching into the help center and internal tools through connections. - Athena, a sales agent wired to Salesforce and Snowflake, was built in six weeks with no engineers. - draft0 drafts and reviews content, but a human signs off before anything ships. - V sits in Slack, reads each incoming task, and routes it to the agent best suited to answer. Every one of these is the same shape you build in the lab. The difference between the agent in your terminal and the one resolving real support tickets is mostly which files are in the directory. ## A concrete first session You do not start from a blank page. In the lab you launch a working agent in a real terminal and talk to it in plain English. You ask it to build something, say a small welcome.html, and watch it call its write_file tool and save the result to its sandbox, never touching your real machine. Then you hand it the save_note tool above, ask it to file a note, and see it pick the tool on its own from the description. From there the lab layers on a skill, a subagent, an approval gate, an eval, and a connection, one file at a time, until you have walked the whole framework. ## From your laptop to production This is where the filesystem-first bet pays off. > "The same directory runs in production exactly as it ran on your laptop." It is a normal Vercel project. Eve compiles the agent/ directory into an app that runs on Vercel Functions, so the agent you built and tested locally is the agent that deploys. What changes is not your code but the infrastructure underneath it, and each piece maps to a documented Vercel service. - The sandbox graduates. Locally the agent runs in an isolated, bash-style sandbox. In production each agent gets a real isolated Vercel Sandbox, so it can run shell commands and write files without ever touching your application runtime. - Sessions become durable workflows. Eve persists session state on Vercel Workflows, so a run survives a deploy, recovers from a cold start, and can pause on a human approval for minutes or days, then resume exactly where it stopped. The docs put it plainly, sessions "resume after cold starts, deploys, or long pauses." - Schedules and channels go live. Your defineSchedule files start firing on cron, and the channels you added put the same agent in Slack, Discord, Teams, or behind an HTTP API. - Every run is traced. Vercel Observability shows each agent run with its sessions, turns, tools, reasoning, timing, and token usage, with no setup. - Models and auth are handled. Model strings route through AI Gateway with OIDC, so you never manage provider keys, and Vercel Connect brokers OAuth and API keys for your connections. - One agent becomes a fleet. The same shape scales horizontally, which is how Vercel runs more than a hundred of these agents at once, each one just a directory. You do not re-implement anything for production. You deploy the directory, and the framework handles durability, isolation, models, and scale. ## How to get started 1. Scaffold a project. Run npx eve@latest init my-agent to create the project, install dependencies, and start the dev server. You get an interactive agent in your terminal in seconds. Talk to it in plain English. 1. Give it a tool. Add a defineTool file like save_note, ask the agent to use it, and watch it call your code. 1. Teach it a skill. Write a short markdown file with a description that says when to use a procedure. This encodes know-how without writing logic. 1. Delegate with a subagent. Hand off a focused job through the built-in agent tool so your main agent stays clean. 1. Prove it with an eval, then schedule it. Add a defineEval file and a defineSchedule file with a cron line. Now you have a checked, recurring agent. 1. Connect and ship. Add a connection to reach a real service, a channel to put the agent in Slack, then deploy the same directory to Vercel. Here is the takeaway. Eve's bet is that an agent should be a set of files you can read, not a runtime you have to trust. That makes agents inspectable, versionable, and portable, and it moves the hard production concerns into the framework where they belong. If you see any errors or things that need further clarification, don't be afraid to reach out. ## Other Useful References - Eve documentation, the official docs - Eve concepts, how agents, sessions, tools, skills, connections, and sandboxes fit together - Introducing Eve, the Vercel announcement - vercel/eve, the open-source framework on GitHub - Introduction to Eve, our free hands-on lab

译Vercel 开源了框架 Eve，将智能体视为一个目录：`agent/instructions.md` 定义系统提示，`agent/agent.ts` 配置模型等运行时参数；工具（`agent/tools/` 下的类型化文件）、技能（`agent/skills/` 下的 Markdown 文件，按需加载）、子智能体（内置 agent 工具实现委托）和人工审批（`needsApproval` 标记）均以文件形式存放，无需注册步骤。Eve 内置持久会话、沙箱、追踪和评估等生产级基础设施。

AK@_akhaliq · 5天前56

hf-claude lets you use over 100 open models in claude code including glm 5.2, minimax-m3, deepseek v4 pro

译hf-claude 让你在 Claude Code 中使用超过 100 个开源模型，包括 GLM 5.2、MiniMax-M3、DeepSeek V4 Pro。

Runway@runwayml · 5天前66

Localize ads is now available as a Recipe via the Runway API. You can now translate static ads and graphic assets via a single API call.

译广告本地化现在可通过 Runway API 以 Recipe 形式使用。现在您可以通过单次 API 调用翻译静态广告和图形资产。

🚨 AI News | TestingCatalog@testingcatalog · 5天前27

Google is working on Collections support for NotebookLM. > Users will be able to group multiple notebooks into a single collection. > Collections will appear in a separate tab in the NotebookLM main menu. Since Notebooks now also function as "projects" in Gemini, this may help users organize them more effectively.

译Google 正在为 NotebookLM 开发 Collections（集合）支持。 > 用户可以将多个笔记本分组到一个集合中。 > 集合将出现在 NotebookLM 主菜单的一个单独标签页中。由于笔记本现在在 Gemini 中也作为“项目”运行，这可能有助于用户更有效地组织它们。

凡人小北@frxiaobei · 5天前63

DeepSeek V4 进行了一次更新。新推出了投机解码（Speculative Decoding）框架 DSpark，推理速度提升 80%。 DSpark 已被部署在 DeepSeek-V4（Flash 和 Pro）的真实线上流量中。报告：《DSpark: Confidence-Scheduled Speculative Decoding with Semi-Autoregressive Generation》 https://github.com/deepseek-ai/DeepSpec/blob/main/DSpark_paper.pdf

向阳乔木@vista8 · 5天前42

装上了 @wey_gu 的nowledge mem，配置了MCP AI对话记忆，还有个人知识库还是挺关键的，等我试试体验下。下载地址见评论区

译装上了 @wey_gu 的knowledge mem，配置了MCP AI对话记忆，还有个人知识库还是挺关键的，等我试试体验下。下载地址见评论区

AYi@AYi_AInotes · 5天前53

这哥们真是个天才，直接把大模型 API 的商业模式干穿了，OpenAI 大概率不喜欢这个项目🤣

小互@xiaohu · 5天前38

魔法随便拖入任意人物照片即可更换直播摄像头里面的人物😅

译开发者 @miyumiyuna5 制作了一款实时换脸AI工具，支持直接拖拽任意人物照片到界面，瞬间将直播摄像头中的人物替换为目标形象。该工具无需重新加载模型即可流畅运行，实现低延迟的实时换脸效果，甚至能让大叔秒变美少女。

MiniMax (official)@MiniMax_AI · 6天前24

👀 Looking forward to seeing builders give it a try tomorrow. Curious what model is powering it, @browser_use

译browser_use 明日上线新云智能体，可制作样式化海报页面，比纯文本更直观，还能做更多。MiniMax 表示期待开发者尝试，好奇其背后模型。

Logan Kilpatrick@OfficialLoganK · 6天前60

Say hello to design variations in @GoogleAIStudio, make an app, iterate on it, then explore variations to take your idea in new directions : )

译向 @GoogleAIStudio 中的设计变体说声你好，制作一个应用，迭代它，然后探索变体，将你的想法引向新方向 : )

🚨 AI News | TestingCatalog@testingcatalog · 6天前71

Google released Design Variations for AI Studio! This feature would generate several design proposals when selected, so users can apply them to their Build apps. Themes support planned as well 👀

译Google 为 AI Studio 发布了设计变体功能！选中后，该功能会生成多个设计提案，用户可将其应用于自己的 Build 应用。主题支持也在计划中👀

Rohan Paul@rohanpaul_ai · 6天前41

A huge 750 tokens/sec for GPT 5.6 Sol. The current GPT-5.5 priority and scale-tier service advertises 99% >50 tokens/sec, so Sol on Cerebras is claiming up to 15x that rate. This huge number is coming from the specialized inference hardware: Sol is being served on Cerebras, whose wafer-scale chip is designed to move model data with far less memory and networking delay than a normal multi-GPU setup.

译对于 GPT 5.6 Sol，高达 750 tokens/sec。当前 GPT-5.5 优先和规模层级服务宣称 99% >50 tokens/sec，因此 Cerebras 上的 Sol 声称达到该速率的 15 倍。这个巨大数字来自专门的推理硬件：Sol 运行在 Cerebras 上，其晶圆级芯片旨在以远少于普通多 GPU 设置的存储和网络延迟来移动模型数据。

Sam Altman@sama · 6天前64

team cooked, spicily

译团队完成了工作，带点辣味。 OpenAI 设计并制造了首款 AI 芯片：Jalapeño。该芯片由 OpenAI 从零开始设计，并与 Broadcom 合作量产，专为支持 ChatGPT、Codex、API 及未来智能体产品的 LLM 工作负载而打造。芯片是 AI 经济的基础。自研芯片扩展了从产品到模型再到基础设施的全栈平台，将助力扩展智能、服务更多用户并扩大 AI 的普及。

Artificial Analysis@ArtificialAnlys · 6天前47

By popular demand, Model Sets are now live! You can now save custom selections of models and apply them instantly across all charts.

译应大家要求，Model Sets 现已上线！你可以保存自定义的模型选择，并立即将其应用于所有图表。

Andrew Milich@milichab · 6天前28

Check logs and triage issues with the official Axiom plugin

译使用官方Axiom插件检查日志并分类问题。

Artificial Analysis@ArtificialAnlys · 6天前46

By popular demand, Model Sets is now live! You can now save custom selections of models and apply them instantly across all charts.

译应大众需求，Model Sets 现已上线！你现在可以保存自定义的模型选择，并立即将其应用于所有图表。

PixVerse@PixVerse_ · 6天前69

From a green screen and a single box to a full-scale blockbuster zone. Seedance 2.0 preserves the original motion and framing while seamlessly generating the rest of the scene. Cinematic VFX, now dramatically simpler.

译从绿幕和单个盒子到完整的电影级场景。 Seedance 2.0 保留原始运动和构图，同时无缝生成场景其余部分。电影级视觉特效，如今大大简化。

歸藏(guizang.ai)@op7418 · 6天前51

Moxt 更新了多agent编排的工作流。支持自动一群 Agent 帮你协作完成任务，而且还能重复驱动完成更长的任务

meng shao@shao__meng · 6天前19

In many families, the tiring part is rarely one big thing. It’s all the small things someone has to keep in mind every day: when to leave because of traffic, what’s running low at home, whether the living room needs cleaning, how the kids are eating, whether an anniversary overlaps with another plan. SuperNori is building a Proactive Family AI Agent to notice those small changes before they become things someone has to remember. @Nori_FamilyAI @IsaacDrgn #partner

译在许多家庭中，让人疲惫的往往不是某件大事。而是每天有人要记在心里的所有小事：几点出门避开拥堵、家里什么东西快用完了、客厅需不需要打扫、孩子吃得好不好、纪念日是否和别的安排冲突了。 SuperNori 正在构建一款主动式家庭 AI 代理，在这些小事变成需要有人记挂的负担之前，就注意到它们。

Replit ⠕@Replit · 6天前27

450+ Integrations, Now Easier to Find https://x.com/i/broadcasts/1yxBeeQApqyJN

译450+集成，现更易查找 https://x.com/i/broadcasts/1yxBeeQApqyJN

Google Gemini@GeminiApp · 6天前47

From creating images in real-time with your voice to new ways to support your small business, here’s a look at this month’s Gemini Drops 🧵

译从用语音实时创建图像，到支持小企业的新方式，以下是本月 Gemini Drops 的内容 🧵

Berryxia.AI@berryxia · 6天前71

兄弟们，记忆赛道太卷了… 又有一个开源工具给AI coding agent装上了“无限记忆”。叫Memanto。它能把你每次和agent的完整工作会话保存下来，用AI自动组织和压缩，然后在下次需要时在90ms内把相关上下文找回来。支持Claude Code、Cursor、Codex、LangGraph、CrewAI等主流工具。以前每次新开会话，agent就失忆，你得重新讲一遍项目背景、架构决策、之前踩过的坑。现在它能记住你上一次做到哪了，直接接力继续干。实现上没有用传统向量数据库，而是通过AI压缩 + 高效检索来控制成本和速度。安装也极简，只需要pip install memanto。这其实是在解决agentic coding里一个很基础但很疼的问题：上下文的持久化和高效复用。记忆做得好，agent才能真正从“一次性工具”变成“长期协作伙伴”。

译开源工具Memanto为Claude Code、Cursor、Codex、LangGraph、CrewAI等主流AI coding agent提供“无限记忆”能力。它自动保存每次完整工作会话，通过AI压缩和组织，在下一次会话时90ms内检索到相关上下文，解决agent每次新开会话失忆、需重新解释项目背景的问题。实现无需传统向量数据库，安装仅需`pip install memanto`。该项目已在GitHub获1k+ stars，免费开源。

🚨 AI News | TestingCatalog@testingcatalog · 6天前37

OpenAI is working on enhanced use of PowerPoint and Excel with Computer Use on Codex via add-ons. > Let Codex use Microsoft Excel add-in for additional control > Let Codex use Microsoft PowerPoint add-in for additional control Computer Use is expanding as a general interface between AI agents and other software.

译OpenAI正在通过插件增强Codex在PowerPoint和Excel上的计算机使用能力。