AIHOT

Chubby♨️@kimmonismus · 4月28日32

Looks like either today or Thursday is shipping day - again. Excited for the coming release

译看起来今天或者周四又是发布日了。对即将到来的发布感到兴奋

Alibaba Cloud@alibaba_cloud · 4月28日33

Your media library should be a valuable asset, not a liability. Alibaba Cloud’s Media AI solution provides a unified AI platform that understands, organizes, and accelerates your entire media workflow by automatically tagging and summarizing video content, moderating content at the frame level, and enabling AI search across multimodal content. So your content finally starts working for you. 🔗 https://int.alibabacloud.com/m/1000412499/

译您的媒体库应成为宝贵资产，而非负担。阿里云媒体AI解决方案提供统一的AI平台，通过自动标记和总结视频内容、帧级内容审核以及支持跨模态内容的AI搜索，来理解、组织并加速您的整个媒体工作流程。让您的内容最终为您所用。 🔗 https://int.alibabacloud.com/m/1000412499/

歸藏(guizang.ai)@op7418 · 4月28日65

小米牛皮！早上申请的中午就到了直接给了 329 的赠金，相当于一个月的 Codeplan Pro 会员

译小米宣布将其MiMo-V2.5系列模型全部开源，采用宽松的MIT协议，允许自由商用、二次训练与微调。同时，公司推出了Orbit 100T Token计划，旨在激励开发者和构建者。该计划包含两部分：面向AI builder的“百万亿Token创造者激励计划”，成功申请者最高可获得价值659元的16亿Credits；以及面向Agent框架团队的“Agent生态共建计划”，将为框架提供MiMo token限免支持，让终端用户免费体验模型。

PixVerse@PixVerse_ · 4月28日40

Happy Horse has officially galloped onto PixVerse. Start from a prompt, or drop in a first frame and let it run wild. Limited Time — Extra 50% OFF Ends May 6 · 07:00 UTC / 00:00 PDT Saddle up. Try Happy Horse on PixVerse today. RT+ Follow + Reply = 300Creds(48H ONLY)

译Happy Horse 已正式驰骋进入 PixVerse。从一个提示开始，或放入第一帧并让它自由发挥。限时优惠 — 额外 50% 折扣截止时间：5月6日 · 07:00 UTC / 00:00 PDT 备好马鞍。立即在 PixVerse 上尝试 Happy Horse。转发+关注+回复 = 300积分（仅限48小时）

Kling AI@Kling_ai · 4月28日54

720p saw the rainy city, but 4K sees every strand of light inside the rain. 🌧️ See more in Kling 4K.

译720p 看到了雨城，但 4K 看见了雨中每一缕光。🌧️ 在 Kling 4K 中查看更多。

Alibaba Cloud@alibaba_cloud · 4月28日36

🚀 Alibaba Cloud Releases DDoS Security Operations Agent (Anti-DDoS SecOps Agent) Powered by LLMs, this cloud-native security agent supports natural language interaction and automates the generation of protection policies. Learn more：https://int.alibabacloud.com/m/1000412296/

译🚀 阿里云发布DDoS安全运维代理（Anti-DDoS SecOps Agent）该云原生安全代理由大语言模型驱动，支持自然语言交互并自动生成防护策略。了解更多：https://int.alibabacloud.com/m/1000412296/

歸藏(guizang.ai)@op7418 · 4月28日50

Codex 又重置了速率限制，一到周末就重置。太猛了OpenAI

Alibaba Cloud@alibaba_cloud · 4月28日35

🚀 Claw Talks Ep2 | Bring Claw to Work with QoderWork & Quick BI ⏰ Apr 29, 2026 | 5:00 PM (UTC+8) 👉 Live: https://youtu.be/cK3qfRTjgWE See how QoderWork makes AI a true work partner—enabling secure desktop automation and seamless Quick BI integration for analytics, reporting, content creation, and workflows. 📌 Join live and see the future of enterprise productivity! #AlibabaCloud #ClawTalks #QoderWork #QuickBI #EnterpriseAI

译🚀 Claw Talks 第二期 | 携手 QoderWork 与 Quick BI，将 Claw 带入工作场景 ⏰ 2026年4月29日 | 下午5点（UTC+8） 👉 直播链接：https://youtu.be/cK3qfRTjgWE 了解 QoderWork 如何让 AI 成为真正的工作伙伴——实现安全的桌面自动化，并与 Quick BI 无缝集成，助力分析、报告、内容创作和工作流。 📌 加入直播，见证企业生产力的未来！ #AlibabaCloud #ClawTalks #QoderWork #QuickBI #EnterpriseAI

Tibo@thsottiaux · 4月28日57

Don't just reset Codex rate limits for fun, it costs money. Don't just reset Codex rate limits for fun, it costs money. ... but the vibes are good ... I have reset Codex rate limits for ALL paid plans to celebrate a good week and allow everyone to build more with GPT-5.5. Enjoy

译不要只是为了好玩而重置 Codex 速率限制，这是要花钱的。不要只是为了好玩而重置 Codex 速率限制，这是要花钱的。 ...但氛围很好... 我已为所有付费计划重置了 Codex 速率限制，以庆祝美好的一周，并让大家能用 GPT-5.5 构建更多应用。尽情享用

Baidu Inc.@Baidu_Inc · 4月28日49

GenFlow 4.0 is live, and it's already serving 100M+ monthly active users with 200M tasks completed each month! 🚀 Jointly released by Baidu Wenku and Baidu Drive, GenFlow 4.0 is a major upgrade to our general AI Agent, with a fully revamped Office Agent at its core. Users can now invoke PowerPoint, Excel, and Word Agents in parallel from a single prompt. GenFlow 4.0 is also deeply integrated with OpenClaw, deployable in one click from the Baidu Drive PC or mobile app, turning Baidu Drive into a personal AI workspace. More to come at Baidu Create 2026 in Beijing May 13-14, where we'll explore this year's theme: "Agents at Scale."

译百度文库与百度网盘联合推出的GenFlow 4.0已正式上线，每月服务超过1亿活跃用户并处理2亿项任务。此次升级的核心是全新的Office Agent，用户可通过单一提示并行调用PowerPoint、Excel和Word代理。该版本深度集成OpenClaw，支持从百度网盘PC端或移动应用一键部署，将网盘转化为个人AI工作空间。更多进展将于2026年5月13日至14日在北京举行的百度Create大会上公布，大会主题为“Agents at Scale”。

OpenCode@opencode · 4月28日35

Okay it's official, Kimi K2.6 3x usage on Go for another week

译好的，正式确认，Kimi K2.6 在 Go 上的 3 倍使用量再延长一周

PixVerse@PixVerse_ · 4月28日27

Nice PSA! So inspired to watch our creative partners live out AI for Good. Join the AI for Good Film Festival with PixVerse — and let’s head to Geneva together! Details >> https://app.pixverse.ai/challenge/brand/398802048463808

译很好的公益广告！看到我们的创意合作伙伴实践 AI for Good，真受鼓舞。加入与 PixVerse 的 AI for Good 电影节 — 让我们一起前往日内瓦！详情 >> https://app.pixverse.ai/challenge/brand/398802048463808

SemiAnalysis@SemiAnalysis_ · 4月28日57

8x VLLM CUDA MOAT ALERT: InferenceX has added @deepseek_ai V4 Pro for @vllm_project for day 3 performance across B200, B300, H200, GB200 disagg. We are seeing that B300 is up to 8x faster than H200. The team is working on benchmarking vLLM 0.20 which has the new DeepGEMM MegaMoE which fuses EP dispatch/EP combine/GEMMs & SwiGLU activations into a single mega-kernel, we believe that the perf will be even better. Thank you to vLLM maintainers from @NVIDIAAI & @rogerw0108 & team from @interact for their passion for open source & burning the midnight oil over the weekend!

译InferenceX已将DeepSeek V4 Pro集成至vLLM项目，在B200、B300、H200和GB200等硬件上的性能测试显示，B300的推理速度比H200快达8倍。团队正在基于vLLM 0.20版本进行基准测试，该版本引入了全新的DeepGEMM MegaMoE技术，将专家并行调度、组合、通用矩阵乘法及SwiGLU激活函数融合为单一巨型内核，预计将带来更优性能。文中感谢了来自NVIDIA AI、社区贡献者及相关团队的开发人员对开源项目的投入与努力。

Peter Steinberger 🦞@steipete · 4月28日35

Finally have great solutions for PR/Issue management, remote test execution, massive CI infra for testing. Streamlines a lot of the work.

译终于为PR/Issue管理、远程测试执行、用于测试的大规模CI基础设施找到了优秀的解决方案。简化了许多工作。

阿绎 AYi@AYi_AInotes · 4月28日69

Damn，OpenAI刚刚扔出的这个开源仓库，直接把语音交互的未来砸到了所有人脸上🤯🤯🤯 他们发布了gpt-realtime-1.5的官方语音控制组件，现在你真的可以用自然语音，直接控制应用的UI状态，而不是转成文本再下命令。视频里的演示蛮震撼的，说一句切换深色模式，整个界面瞬间变黑。对着表单念你的姓名生日，字段自动填充，进度条实时更新。最绝的是下棋，说骑士走到F3，棋子直接移动，说重置棋盘，一秒清空，就好像模型永远知道当前屏幕上是什么状态，语音操作和鼠标键盘完全等价。讲真这么玩的话，这就不是简单的语音转文字的小升级了，我理解属于交互范式的真正转折。以前语音是输入层，现在语音变成了应用的顶层控制层。就是科幻电影里那种，对着屏幕说一句话，东西就自己变了的感觉🤩 更狠的是他们直接把整个实现开源了🤯🤯🤯 这个realtime-voice-component不是一个半成品demo，是一个完整的React参考实现。一行代码加个浮动按钮，用Zod定义几个工具，十分钟就能给你现有的Web应用加上语音控制。最聪明的设计是工具完全由应用拥有，模型只能调用你预定义的窄动作，不能乱动浏览器，安全又可控。这比之前的Computer Use靠谱一万倍。 Computer Use是让AI瞎点屏幕，而这个是让AI直接调用你写好的接口。一个是黑箱，一个是完全可控的白箱，这才是能真正落地到生产环境的方案。现在已经有人用它接了蛋白结构可视化工具，接了设计软件，接了企业内部仪表盘。未来你能想到的所有需要双手操作的场景，开车，做饭，做设计，做手术，未来都可以用语音控制。这意味着语音正在成为操作系统级别的接口。而OpenAI已经把所有的轮子都给你造好了。想玩的直接去fork仓库，配个API Key，跑demo就能感受到那种说一句世界就变了的魔力。老规矩GitHub地址评论区自取👇

译OpenAI开源了gpt-realtime-1.5的官方语音控制组件，允许用户直接用自然语音控制应用UI状态，而非仅进行语音转文本。该组件是一个完整的React参考实现，开发者可快速集成。其核心在于工具由应用预定义，模型只能调用这些受限动作，确保了安全可控。这标志着语音正从输入层升级为顶层控制层，为设计、驾驶等双手操作场景提供了新的交互可能，是交互范式的重要转折。

OpenClaw🦞@openclaw · 4月28日50

OpenClaw 2026.4.26 🦞 🎙️ Google Live Talk 🦙 Better Ollama/local models 🧳 Bring over Claude + Hermes setups 🔐 One-command Matrix E2EE Big release. Local models eat well. https://github.com/openclaw/openclaw/releases/tag/v2026.4.26

译OpenClaw 2026.4.26 🦞 🎙️ Google 直播访谈 🦙 更好的 Ollama/本地模型 🧳 迁移 Claude + Hermes 配置 🔐 单命令 Matrix 端到端加密重大发布。本地模型享用盛宴。 https://github.com/openclaw/openclaw/releases/tag/v2026.4.26

meng shao@shao__meng · 4月28日76

Devin for Terminal: 从云端协作回到本地终端 @cognition 把过去两年构建 Devin 所积累的能力，重新打包成一个跑在你本机 shell 里的命令行 Agent。安装一行命令即可使用： curl -fsSL https://cli.devin. ai/install.sh | bash 它和 Claude Code、Codex CLI、Cursor CLI、Aider 属于同一品类——本地 CLI Coding Agent。但 Cognition 给出的差异化卖点不在"本地有多强"，而在"本地随时能交回云端"。最关键的设计：Local-to-Cloud Handoff 传统 CLI Agent 的痛点是：任务一旦超出本机能力（跑长测试、做大重构、需要持续运行几小时），你就得守着笔记本不能合盖。Devin for Terminal 的做法是： · 同一个 session 可以从本地无缝交接给云端 Devin · 云端 Devin 拥有自己的虚拟机、浏览器、测试环境、视频录屏、自动修复 · 你关上电脑，回来时是一个写好的 PR 由此衍生出几个有意思的工作模式： · 多 Agent 并行：多个 Devin 同时跑同一份代码库，不用手动管理 git worktree · 沙箱安全：rm -rf 之类的危险操作发生在云端 VM 里，不会动到你本机 · 后台收尾：本地写完功能直接甩给云端跑测试、开 PR、回复 review 评论这是 Cognition 的真正资产——他们已有的云端基础设施（Devin 2.x 的 VM、Wiki、Search、Review 等）此刻被复用为 CLI 的"远程后端"。本地 CLI 只是云端 Devin 的一个新入口。多模型路由不绑定单一模型，支持 Anthropic、OpenAI、Google 全系列前沿模型，以及自家的 SWE-1.6 和开源模型（Kimi、GLM）。官方推荐三档分工： · Opus 4.7 / GPT-5.5：多文件重构、架构变更、复杂推理 · SWE-1.6：日常修改、bug 修复、问答——更快更便宜，是 Cognition 自研模型，相比 SWE-1.5 在 SWE-Bench Pro 上提升约 11%，吞吐 950 tok/s · 短名 opus / sonnet / codex / gemini 自动指向最新版本，新模型上线后"几分钟内"接入支持 Alt+T 实时切换 thinking level，/model 在会话中切模型。这种"模型即配置"的思路与 Cursor、Aider 一致，但 Devin 把它做成了主推卖点之一。 Rust 自研终端渲染库团队用 Rust 写了一套定制的终端渲染库，目的只有一个——快、跟手。并且做了一件很"工程师炫技"的事情：把它跑在 1978 年的 VT-100 物理终端上。VT-100 是现代终端协议（ANSI escape sequences）的事实标准来源，几乎所有现代终端模拟器都在模仿它。让一个 2026 年的 AI Agent 在真实 VT-100 硬件上点亮，是一个非常精确的文化信号： "终端从 1970 年代到现在没怎么变过，变的是你在里面做的事。"

译Cognition公司推出Devin for Terminal，将云端AI编程助手Devin的能力打包为本地命令行Agent。其核心差异化在于“本地至云端无缝交接”设计：当任务超出本机能力时，可将同一会话无缝移交至云端Devin的虚拟机环境执行，用户可离线等待结果。该工具复用现有云端基础设施作为后端，支持多模型路由，可灵活选用Anthropic、OpenAI、Google及自研SWE-1.6等模型，并允许会话中实时切换。团队还使用Rust自研了高速终端渲染库，强调终端形式不变但内部工作范式已革新。

Berryxia.AI@berryxia · 4月28日45

好消息，Outlook 终于特么支持Agent了。坏消息，国内是否也支持？

Berryxia.AI@berryxia · 4月28日54

Minmax 的 Music-2.6 本周在 Cloudflare 上免费使用！从文本提示生成完整长度的歌曲或器乐作品，并可选自动生成歌词。直接开整吧！！！

TestingCatalog News 🗞@testingcatalog · 4月28日56

Microsoft rolled out Agent Mode for Outlook to Frontier early access users. > Copilot in Outlook is now agentic, taking on the ongoing work of running your inbox and calendar. It triages emails, reschedules conflicts, and surfaces what matters most before you even ask.

译Microsoft 向 Frontier 早期体验用户推出了 Outlook 的智能体模式。 > Outlook 中的 Copilot 现已具备智能体功能，可持续处理收件箱和日历的运营工作。它能分类邮件、重新安排冲突日程，并在你询问前就突出显示最重要事项。

TestingCatalog News 🗞@testingcatalog · 4月28日49

ICYMI: Gemini can now generate Docs and Sheets on web and mobile. Not sure when it was added though. Slides are not working for now but looking at Gemini for Business, we will likely get them too, as well as an inline editor potentially.

译你可能错过了：Gemini 现在可以在网页和移动端生成 Docs 和 Sheets。不过不确定这个功能是何时添加的。目前 Slides 还不能用，但考虑到 Gemini for Business，我们很可能也会获得该功能，或许还会有一个内联编辑器。

Berryxia.AI@berryxia · 4月28日64

一个完全本地的 Agent，就生活在你的浏览器里。由 Gemma 4 E2B 和 WebGPU 驱动，它使用原生工具调用来实现： 🔍 搜索浏览历史 📄 阅读并总结页面内容 🔗 管理标签页 100% 本地运行！无需任何服务器！

TestingCatalog News 🗞@testingcatalog · 4月28日37

Anthropic is working on sidebar customisation for Claude on mobile and a common Tasks list for Claude Dispatch and Claude Code. Conway WIP as well 🚧

译Anthropic 正在为移动端的 Claude 开发侧边栏自定义功能，以及为 Claude Dispatch 和 Claude Code 设计通用任务列表。 Conway 也在进行中 🚧

Luma@LumaLabsAI · 4月28日57

Not sure which direction to take it? Explore them all. Set your reference and let Luma Agents explore every visual style you have in mind. From dark and cinematic to bright and editorial, every aesthetic direction rendered and ready to compare. Build it now → https://app.lumalabs.ai/?seed=922de654-a944-4679-adbf-d23cbfb48307

译不确定该选择哪个方向？探索所有可能。设定你的参考标准，让 Luma Agents 探索你心中的每一种视觉风格。从暗黑电影感到明亮编辑风，每一种美学方向都能被渲染呈现，随时可供比较。立即构建 → https://app.lumalabs.ai/?seed=922de654-a944-4679-adbf-d23cbfb48307

Satya Nadella@satyanadella · 4月28日58

Agent Mode is here in Outlook! Copilot can now help run your inbox and calendar, triaging emails, rescheduling meetings, and helping you stay on top of what matters most.

译Outlook中的代理模式现已上线！ Copilot现在可以帮助管理您的收件箱和日历，分类邮件、重新安排会议，并助您掌控最重要的事务。

Google Gemini@GeminiApp · 4月28日31

Ready to unlock your creativity with Gemini Canvas? 🪄 Don’t miss our next Discord event to see Gemini Creative Technologist @DavidMaliglowka live demo his latest Canvas and Nano Banana workflows to help you advance your own creative prompting techniques. 🗓️ Wednesday, April 29th ⏰ 11:30 AM PT 📍 http://discord.gg/gemini

译准备好通过Gemini Canvas释放你的创造力了吗？🪄 别错过我们下一次Discord活动，届时Gemini创意技术专家@DavidMaliglowka将现场演示他最新的Canvas和Nano Banana工作流程，帮助你提升创意提示技巧。 🗓️ 4月29日星期三 ⏰ 太平洋时间上午11:30 📍 http://discord.gg/gemini

Suno@suno · 4月28日49

Screenshot it. Song it. #SunoTextSong

译截图它。歌曲它。#SunoTextSong

OpenAI Developers@OpenAIDevs · 4月28日55

You can build interactive applications with gpt-realtime-1.5, so users can control app state more naturally with voice. Hi Chappy 👋

译你可以用gpt-realtime-1.5构建交互式应用，让用户通过语音更自然地控制应用状态。嗨，Chappy 👋

François Chollet@fchollet · 4月28日60

Keras Kinetic has a new alpha release: v0.0.2! Including a new docs website: http://kinetic.readthedocs.io Kinetic is my favorite new release from the Keras team: a super simple Modal-like API to run training jobs on TPU.

译Keras Kinetic 发布了新的 alpha 版本：v0.0.2！包括新的文档网站：http://kinetic.readthedocs.io Kinetic 是我最喜欢的 Keras 团队新发布：一个超级简单的类 Modal API，用于在 TPU 上运行训练任务。

OpenAI Developers@OpenAIDevs · 4月28日66

📣 What if every open issue had a Codex agent? That’s the idea behind Symphony, an open-source agent orchestrator for Codex that turns task trackers into always-on systems for agentic work, letting humans focus on review and direction.

译📣 如果每个未解决的问题都有一个 Codex 智能体呢？这就是 Symphony 背后的理念——一个为 Codex 设计的开源智能体编排器，它将任务追踪器转变为持续运行的系统，用于智能体工作，让人类专注于审查和方向指导。

Google AI Developers@googleaidevs · 4月28日52

Zoom in on how @GoogleGemma 4 is optimized to handle high-concurrency serving for complex tasks (such as generating SVGs) — on a single GPU. ✓ 10+ sessions are sent to the 26B A4B model ✓ The system routes, accelerates, and processes those workloads — without bottlenecking ✓ A live dashboard visually tracks the load balancing in real time, displaying active slots, context sizes, and token generation speeds Watch the demo to see it in action ⬇️

译深入了解 @GoogleGemma 4 如何优化以在单个 GPU 上处理高并发复杂任务（例如生成 SVG）。 ✓ 10 多个会话被发送到 26B A4B 模型 ✓ 系统路由、加速并处理这些工作负载——没有瓶颈 ✓ 实时仪表板可视化跟踪负载均衡，显示活动槽位、上下文大小和令牌生成速度观看演示视频以了解实际运行情况 ⬇️

MiniMax (official)@MiniMax_AI · 4月28日57

Really excited about this one. Music 2.6 is now available on @Cloudflare AI — full songs with vocals, instrumentals, covers, all from text. We want honest feedback from real users. Give it a spin and let us know what hits (and what doesn't).

译Music 2.6模型现已在Cloudflare AI平台推出，用户可通过文本提示生成带人声、伴奏或翻唱的完整歌曲。该模型由MiniMax AI提供，本周免费使用，支持从文本生成完整歌曲或伴奏，并可选自动歌词。基于Cloudflare的全球网络，它能实现快速推理，适合开发者在Cloudflare Workers上构建音乐应用。作者呼吁用户试用并提供真实反馈。

Xiaomi MiMo@XiaomiMiMo · 4月28日53

We believe open source is more than releasing weights — it’s about building ecosystems. Today, we’re introducing MiMo Orbit. A program to support builders, frameworks, and the next wave of AI applications. MiMo Orbit includes: 1️⃣ 100T Token Grant for Builders We’re making 100 trillion (100T) tokens available to AI builders worldwide. • Free access, while supplies last • Application-based • Up to a 1-month Max Plan (1.6B credits) 📅 Apr 27, 2026, 9:00 AM – May 27, 2026, 9:00 AM (PDT) Apply here: http://100t.xiaomimimo.com 2️⃣ Agent Ecosystem Program We support Agent frameworks globally with free integration access and zero-friction onboarding for your users. If you’re building an Agent framework, let’s collaborate: 📩 business-mimo@xiaomi.com

译小米推出MiMo Orbit计划，旨在超越单纯开源模型权重，构建开放的AI生态系统。该计划包含两大核心举措：一是向全球AI开发者提供100万亿（100T）令牌的免费资源资助，申请者有机会获得最高1.6B积分的月度计划，申请窗口为2026年4月27日至5月27日；二是启动Agent生态系统项目，为全球Agent框架提供免费的集成接入和无摩擦的用户入驻支持，以促进下一代AI应用的发展。

Luma@LumaLabsAI · 4月28日56

Hot sauce doesn't brand like kombucha. Kombucha doesn't brand like coffee. Luma Agents know the difference, and build the whole system in minutes. Logos, atmosphere, editorial product shots, color tokens, type specimens. One brief. One brand. Ready for shelf, feed, and menu. Try it now → https://app.lumalabs.ai/?seed=20c28b58-6310-4f7e-8b78-c334121d3f8c

译辣酱不像康普茶那样做品牌。康普茶不像咖啡那样做品牌。 Luma Agents 懂得其中的差异，并在几分钟内构建整个系统。标识、氛围、编辑产品图、色彩标记、字体样本。一份简报。一个品牌。为货架、信息流和菜单做好准备。立即尝试 → https://app.lumalabs.ai/?seed=20c28b58-6310-4f7e-8b78-c334121d3f8c

TestingCatalog News 🗞@testingcatalog · 4月28日57

Sonar 2 is now available on Perplexity web 👀 > Sonar models are Perplexity’s in‑house LLMs, optimized specifically for fast, web‑grounded search and answering Which base do you think was used for Sonar 2? DeepSeek V4, Kimi K2.6, or Qwen?

译Sonar 2 现已在 Perplexity 网页端上线 👀 > Sonar 模型是 Perplexity 自研的大型语言模型，专门针对快速、基于网络的搜索与回答进行了优化你认为 Sonar 2 是基于哪个模型开发的？DeepSeek V4、Kimi K2.6 还是 Qwen？ [引用 @sethsaler]：Perplexity 推出的 Sonar 2。👀 @testingcatalog @btibor91

Z.ai@Zai_org · 4月27日41

The "triple usage" period for GLM-5.1 and GLM-5-Turbo is now extended to June 30. Availability: Anytime except 2-6 AM ET.

译GLM-5.1和GLM-5-Turbo的"三倍用量"使用期现已延长至6月30日。可用时间：除东部时间凌晨2点至6点外，全天可用。

SemiAnalysis@SemiAnalysis_ · 4月27日50

PALISADES TAHOE, APRIL 26, 2026 — InferenceX has added DeepSeekv4 MTP support with chat template for @sgl_project's B300! Great Work to @radixark @liin1211 for the engineering! Massive interactivity gains, and 7x throughput at iso-interactivity!

译PALISADES TAHOE，2026年4月26日 — InferenceX 已为 @sgl_project 的 B300 添加了 DeepSeekv4 MTP 支持及聊天模板！感谢 @radixark @liin1211 的工程贡献！交互性大幅提升，在同等交互性下吞吐量提高7倍！

elvis@omarsar0 · 4月27日59

Don't try to build a self-improving AI agent without evals. You are just wasting time and compute. An agent can't improve from traces it can't evaluate. This is why it's exciting to see @FutureAGI_ going fully open source with their platform. It combines the best of all the eval tools and methods in one stack. They've shipped a set of tools to make it easier for AI devs to reliably ship self-improving agents. There is a lot to like here: - Evals for hallucination, groundedness, PII, toxicity, tool-use correctness, bias, and any custom metric. Every evaluator is readable and modifiable, not a black-box score. No vendor lock-in to worry about. - Six prompt optimization algorithms (GEPA, PromptWizard, ProTeGi, and others) that take production traces and feed them back as training signals. - Multi-turn simulation before launch, including voice agents through LiveKit, VAPI, Retell, and Pipecat. You stress test edge cases before users ever hit them. - Real-time guardrails for jailbreaks, prompt injection, and PII leaks. - OpenTelemetry-native tracing with 4+ languages (Python, TypeScript, Java, and C#), 50+ framework instrumentors (LangChain, LlamaIndex, CrewAI, AutoGen, DSPy, Haystack). - An OpenAI-compatible gateway with 100+ providers, routing strategies, and caching. If self-improving agents are the direction the field is moving, we need eval infrastructures we can actually trust and build on top of. This is that infrastructure, and now it's open. Check it out here: http://github.com/future-agi/future-agi Generous free tier cloud-based offer here: https://shorturl.at/cxYOd

译构建自进化AI代理必须依赖可靠的评估体系，否则将浪费资源。@FutureAGI_ 开源其平台，整合了领先的评估工具与方法，为开发者提供完整基础设施。该平台涵盖幻觉、毒性、偏见等多维度可修改评估器，集成六种提示优化算法，支持多轮模拟测试与实时安全防护，并提供多语言追踪及兼容OpenAI的网关。其开源特性旨在建立可信任的评估基础，推动自进化AI代理领域发展。

小互@xiaohu · 4月27日55

简单测试了 PixVerse 出的AI视频生成 CLI 工具一行命令生成视频，不用开浏览器，直接可以在Claude Code和小龙虾里面利用Agent生成视频不只是 PixVerse 自己的模型，Sora 2、Veo 3.1、Grok Imagine 模型都支持，都能通过同一个 CLI 调用... 安装就两秒：npm install -g pixverse 就可以输入要求或者提示词让其生成视频，如果绑定了telgram或者飞书可以手机遥控... 也支持图片生视频、AI 配音、唇形同步、音效、超分辨率，基本上网页版能做的，CLI 全能做。

译PixVerse发布AI视频生成CLI工具，用户可通过一行命令快速生成视频，无需打开浏览器。该工具集成于Claude Code等平台，支持调用Sora 2、Veo 3.1、Grok Imagine等多种模型，并具备图片生视频、AI配音、唇形同步等网页版全部功能。安装简便，支持通过Telegram或飞书进行手机遥控操作。

Chubby♨️@kimmonismus · 4月27日68

Google just broke a decade-long tradition. At Cloud Next 2026, the company unveiled not one, but two new AI chips, the TPU 8t for training and TPU 8i for inference. For the first time ever, Google is splitting its custom silicon into specialized architectures instead of relying on a one-size-fits-all design. The TPU 8t superpod packs 9,600 liquid-cooled chips delivering 121 FP4 ExaFlops of peak compute, roughly a 3x leap over the previous generation. The TPU 8i delivers 80% better performance-per-dollar than its predecessor, with triple the on-chip memory and a new Boardfly topology that cuts network latency in half. The important aspect: Anthropic, Meta, and now OpenAI are buying multi-gigawatt allocations of TPU capacity. OpenAI booking Google silicon is a first visible crack in NVIDIA's grip on frontier AI training. Broadcom co-designed the TPU 8t, while MediaTek handles the TPU 8i, both fabbed by TSMC. NVIDIA still holds 81% of the AI chip market, but the era of serious competition has officially begun.

译Google在Cloud Next 2026上首次将定制芯片拆分为专用架构，推出训练芯片TPU 8t与推理芯片TPU 8i。TPU 8t超级模块配备9600个液冷芯片，峰值算力达121 FP4 ExaFlops，较前代提升约3倍；TPU 8i的性价比提升80%，片上内存增至三倍，并通过新拓扑结构将网络延迟减半。Anthropic、Meta及OpenAI均已采购千兆瓦级TPU算力，其中OpenAI首次采用Google芯片，动摇了NVIDIA在前沿AI训练市场的垄断地位。两款芯片分别由Broadcom和MediaTek共同设计，TSMC代工。尽管NVIDIA仍占据81%的AI芯片市场份额，但实质性的竞争时代已拉开序幕。