HeyGen just turned AI video generation into a CLI workflow, so an agent can go from script to avatar, then render the video and ship it without leaving the terminal. The great point here is, once video is exposed as a command, any research agent, reporting bot, or automation stack can treat video as just another output format. The useful idea here is that video-generation can now sit inside cron jobs, pipelines, internal tools, and autonomous agents that move from script to avatar to finished asset without a human stitching the steps together.

译HeyGen正式上线CLI工具，将AI视频生成转化为命令行工作流。用户通过单一命令即可实现从脚本撰写、虚拟形象生成到视频渲染与交付的全流程自动化，无需离开终端。这一功能使视频生成可被任何研究agent、报告机器人或自动化栈作为标准输出格式调用，轻松集成至cron任务、流水线及内部工具中。CLI的推出意味着AI视频生产可完全嵌入自主agent的工作流，实现从脚本到成品的无人干预端到端自动化。

Chubby♨️@kimmonismus · 4月14日

Microsoft is testing OpenClaw-style AI agents to evolve Microsoft 365 Copilot into an always-on assistant that can autonomously handle tasks like managing emails, calendars, and daily workflows. They literally try everything to make copilot more appealing.

译Microsoft 正在测试 OpenClaw 风格的 AI 智能体，以将 Microsoft 365 Copilot 进化为始终在线的助手，可自主处理邮件、日历和日常工作流等任务。他们简直想尽一切办法让 copilot 更受欢迎。

宝玉@dotey · 4月13日

Chrome DevTools MCP 新增了多项专用调试技能：用 Lighthouse 跑性能审计、检测内存泄漏、无障碍调试、LCP（最大内容绘制，直接影响用户感知到的页面加载速度）优化，以及一个实验性的命令行工具。

译Chrome DevTools MCP新增多项面向AI Agent的调试技能，支持通过Lighthouse执行性能审计、检测内存泄漏、无障碍调试及LCP优化。这些功能旨在为AI Agent提供自动化代码质量检查能力，帮助识别性能瓶颈与可访问性问题。同时推出实验性CLI工具，支持命令行调用各项调试能力。

Ethan Mollick@emollick · 4月12日

It is notable that we are all debating exactly which markdown files are most important to feed AI (skills, memory, tool instructions) and in which order to feed them to get the best output. Feels that this is likely a temporary state of affairs in the development of agents

译开发者们正纠结于该向 AI 投喂哪些 markdown 文件（skills、memory、tool instructions）及其先后顺序以优化输出。这种对提示工程的精细打磨，暗示当前 Agent 开发仍处于需要人工微调的早期过渡阶段。

Thariq@trq212 · 4月12日

there's a turbotax connector in Claude Code now, so glad I procrastinated on taxes 😭

译Claude Code 上线 TurboTax 和 Aiwyn Tax（原 Column Tax）连接器，可估算退税、查看欠税并解析税表。作者调侃庆幸拖延报税，现在可用 AI 辅助处理税务。

Boris Cherny@bcherny · 4月12日

brb trying this now

译Claude 新增税务数据连接器，支持接入 TurboTax 或 Aiwyn Tax（原 Column Tax），可估算退税金额、查询欠税情况并解析税表内容，帮助用户在正式申报前理清税务状况。

Peter Steinberger 🦞@steipete · 4月9日

GUYS WE FOUND THE GUY WHO BUILT THE GITHUB MCP SERVER

译大伙快看，我们找到了那个构建 GitHub MCP Server 的开发者！推文以兴奋的语气宣布了这一发现，并附上了相关链接。

Artificial Analysis@ArtificialAnlys · 4月8日

Announcing APEX-Agents-AA, our latest leaderboard on Artificial Analysis, evaluating AI agents on long-horizon professional services tasks with realistic application dependencies This is our implementation of the APEX-Agents benchmark - an agentic work task evaluation open-sourced by @mercor_ai. It tests AI agent ability to execute realistic tasks created by investment banking analysts, management consultants, and corporate lawyers. Mercor released extensive data to enable model evaluation and training across the community, comprising 480 tasks including tool implementations, rubrics, and grading workflows. We exclude tasks with external service dependencies and run the remaining 452 tasks for APEX-Agents-AA. Models complete tasks using Stirrup, our open-source agent harness as used in GDPval-AA, and a customized tool set based on the original benchmark implementation Results overview: 🏅 OpenAI, Anthropic and Google are in close competition at the top of the leaderboard, with 33.3% for GPT-5.4, 33.0% for Claude Opus 4.6, and 32% for Gemini 3.1 Pro Preview 📈 The overall scores on Artificial Analysis today are similar to Mercor’s testing, but some models such as GPT-5.4 nano show improvements in score using our Stirrup test harness ↻ We’ll be updating this leaderboard with key releases for agentic work use as a metric for agent capability on well-defined, long horizon work tasks APEX-Agents overview: ➤ Tasks span 3 professional domains: investment banking, management consulting, and corporate law ➤ The tasks are designed to require long-horizon work with a large number of tools, which are provided through MCP servers as would be used in many real-world deployments (including calendar, chat, spreadsheet and presentation operations, etc.) ➤ Required outputs include direct message responses (87%) and creating or modifying spreadsheets (6.6%), documents (4.8%), and presentations (1.3%) ➤ Model outputs are parsed and graded against binary rubrics using an LLM judge. Each task is run 3 times and scored pass@1 - a pass requires every rubric test to pass ➤ In our APEX-Agents-AA implementation, 452 tasks run in our open-source Stirrup harness with tool management and usage from @mercor_ai's original MCP implementation. This provides a consistent, reproducible baseline for comparing raw model capability that aligns with realistic agent deployments

译Artificial Analysis 发布 APEX-Agents-AA 排行榜，基于 Mercor 的 APEX-Agents 基准评估 AI 代理在长周期专业任务（投资银行、管理咨询、公司法）的表现。测试通过 Stirrup 框架和 MCP 工具执行 452 个任务，涵盖消息回复、文档处理等。结果显示 GPT-5.4 以 33.3% 领先，Claude Opus 4.6 (33.0%) 和 Gemini 3.1 Pro Preview (32%) 紧随其后，三强竞争激烈。评分采用 LLM 评判和 pass@1 标准。

AK@_akhaliq · 4月7日

gradio.Server Any Custom Frontend with Gradio's Backend build with your own frontend framework entirely like React, Svelte, or even plain HTML/JS, while still benefiting from Gradio's queuing system, API infrastructure, MCP support, and ZeroGPU on Spaces blog: https://huggingface.co/blog/introducing-gradio-server

译gradio.Server 允许开发者使用 React、Svelte 或纯 HTML/JS 等任意前端框架构建应用，同时完整保留 Gradio 的队列系统、API 基础设施、MCP 支持及 Spaces ZeroGPU 等后端能力。

OpenAI Developers@OpenAIDevs · 4月2日

Keep the work and the ticket in sync. @linear plugin in the Codex app.

译Codex 应用新增 Linear 插件，实现代码开发与项目管理的实时同步。开发者可在编码环境中直接追踪和更新 Linear 工单状态，无需切换工具即可保持工作进度与任务记录的一致性。

Thariq@trq212 · 3月30日

the more I've been digging into the new Figma MCP, the more excited I am about it something new I'm trying is starting with a very ugly sketch in Figma, and then having Claude Code flesh it out in Figma so I can tweak and edit before sending the final back to Claude Code

译尝试新工作流：先在 Figma 画粗糙草图，让 Claude Code 自动完善细节，人工微调后再回传迭代。同时预告 3 月 31 日将与 Figma 合作直播讲解 MCP 使用技巧。

OpenAI Developers@OpenAIDevs · 3月27日

We're rolling out plugins in Codex. Codex now works seamlessly out of the box with the most important tools builders already use, like @SlackHQ, @Figma, @NotionHQ, @gmail, and more. http://developers.openai.com/codex/plugins

译Codex 正式上线插件系统，开箱即用地支持 Slack、Figma、Notion、Gmail 等开发者常用工具。

Google DeepMind@GoogleDeepMind · 3月26日

Say hello to Gemini 3.1 Flash Live. 🗣️ Our latest audio model delivers more natural conversations with improved function calling – making it more useful and informed. Here’s what’s new 🧵

译Gemini 3.1 Flash Live 音频模型发布，支持更自然的实时对话，函数调用能力改进，使 AI 助手更实用、信息获取更充分。

karminski-牙医@karminski3 · 3月26日

awesome 👍

译太棒了 👍

Claude@claudeai · 3月26日

Your work tools in Claude are now available on mobile. Explore Figma designs, create Canva slides, check Amplitude dashboards, all from your phone. Give it a try: http://claude.com/download

译Claude 移动端现已集成工作工具，支持在手机上直接查看 Figma 设计、创建 Canva 幻灯片及浏览 Amplitude 仪表盘，实现跨端协作功能。

Thariq@trq212 · 3月24日

Today’s Figma MCP update makes it one of the strongest integrations with Claude Code I’ve seen. You can now use Claude Code to design in Figma with the the full context of your design systems.

译Figma 发布 MCP 更新，开放测试版已上线。新增 use_figma 工具支持 AI 代理基于完整设计系统上下文直接在画布上设计，与 Claude Code 深度集成，可直接通过 Claude Code 在 Figma 中完成设计工作。

Boris Cherny@bcherny · 3月20日

Let us know what you think!

译Claude Code 发布 channels 功能，支持通过 Telegram 和 Discord 的 MCP 协议远程控制会话，用户可直接用手机向 Claude Code 发送消息。