7月1日

00:50

fofr@fofrAI

Google 通过 Gemini Omni API 发布 gemini-skills 技能包，支持视频编辑、文生视频、图片参考视频生成、首帧生成视频，并提供预处理输入视频为 10 秒 720p、音频剥离、视频检查等辅助工具。同作者展示 Omni Flash 模型编辑能力：输入"将桌子改成浅水池"，模型输出湿手、水波、折射、阴影及音效。该 API 已开放，可用于构建视频编辑流水线。

fofr: Omni Flash is a smart model. The way the hand is wet, the water ripples, the refraction, the shadows, the sound effects ...

智能体 Google 教程/实践视频

00:50

fofr@fofrAI

Omni Flash 模型具有出色的图像编辑能力，能够将桌子变为浅水池，并逼真呈现手部湿润、水波、折射、阴影和音效。该模型现已通过 API 提供，其编辑能力非常适合实现炫酷的流水线。

fofr: Omni Flash is a smart model. The way the hand is wet, the water ripples, the refraction, the shadows, the sound effects ...

Google 图像生成视频评测/基准

00:35

elvis@omarsar0

Elvis Saravia 称赞谷歌持续降低模型使用成本。谷歌在 Gemini API 和 AI Studio 中推出两款新模型：Nano Banana 2 Lite 图像生成速度低于 4 秒，价格仅 $0.034/千张；Gemini Omni Flash 在视频编辑上达到 SOTA，价格为 $0.10/秒，与 Veo 3.1 Fast 一致。Saravia 透露 DAIR.AI 正使用 Nano Banana 和 Gemini 构建教育研究项目，并已开始测试 Nano Banana 2 Lite。

Logan Kilpatrick: Introducing Nano Banana 2 Lite 🍌 and Gemini Omni Flash 🔮, our new generative media models in the Gemini API and AI Stu...

Google 图像生成模型发布视频

00:30

Logan Kilpatrick@OfficialLoganK

推出 Nano Banana 2 Lite 🍌 和 Gemini Omni Flash 🔮，我们在 Gemini API 和 AI Studio 中新的生成媒体模型！ Nano Banana 2 Lite 极快（图像 <4 秒）且便宜（$0.034 / 1K 图像）。 Omni Flash 在视频编辑上达到 SOTA，$0.10 / 秒，与 Veo 3.1 Fast 相同！

Google 多模态模型发布视频

关联讨论 1 条

00:27

🚨 AI News | TestingCatalog@testingcatalog

Google 在 Gemini API 和 AI Studio 推出两款新生成式媒体模型：Nano Banana 2 Lite 图像生成极快（<4秒/张），价格仅 $0.034/千张；Gemini Omni Flash Preview 在视频编辑上达到 SOTA，定价 $0.10/秒，与 Veo 3.1 Fast 相同。Omni Flash 现已提供 API 预览。

Logan Kilpatrick: Introducing Nano Banana 2 Lite 🍌 and Gemini Omni Flash 🔮, our new generative media models in the Gemini API and AI Stu...

Google 图像生成模型发布视频

00:26

Google DeepMind@GoogleDeepMind

我们正在推出两个主要版本： 🔘 Nano Banana 2 Lite：我们最快、最便宜的 Gemini 图像模型 🔘 Gemini Omni Flash：现可通过 Gemini API 和 @GoogleAIStudio 使用，帮助开发者生成和编辑高质量视频。

Google 图像生成模型发布视频

关联讨论 1 条

00:25

Google AI@GoogleAI

Google AI 发布 Nano Banana 2 Lite 与 Gemini Omni Flash 两大模型更新

Google AI 推出两大模型更新：1）Nano Banana 2 Lite——最快、最经济的 Gemini 图像模型，文本生成图像不到 4 秒，已上线 Gemini API 和 AI Studio，即将登陆 NotebookLM、Google 搜索、Google Photos 等；2）Gemini Omni Flash 进入公开预览——原生多模态模型，支持低成本视频生成与对话式编辑，可通过 Gemini API、AI Studio 及 Gemini Enterprise Agent Platform 集成。两模型结合可快速实现空间设计重绘：上传照片、滑动选择设计方案，Omni 将细节以电影级动画呈现。演示应用已在 AI Studio 上架。

Google 产品更新图像生成视频

关联讨论 1 条