Google 发布 Nano Banana 2 Lite 图像模型与 Gemini Omni Flash 视频模型
阅读原文· the-decoder.comGoogle 推出两款新生成式 AI 模型。Nano Banana 2 Lite 可在 4 秒内生成图像,每张成本 0.034 美元(1K 分辨率),API 名称为 gemini-3.1-flash-lite-image。Gemini Omni Flash 允许开发者通过文本提示在 API 中生成和编辑最长 10 秒的视频,每秒输出价格 0.10 美元。Google 推荐将两个模型链式使用:先用 Nano Banana 2 Lite 生成图像,再传递给 Gemini Omni Flash 转化为视频。两者均使用 SynthID 水印,已通过 Google AI Studio、Gemini API 和 Gemini Enterprise Agent Platform 提供。
Google launches Nano Banana 2 Lite for fast AI images and Gemini Omni Flash for video via API
Key Points
- Google released two new generative AI models. Nano Banana 2 Lite generates images in four seconds and costs $0.034 per image at 1K resolution.
- Gemini Omni Flash lets developers generate and edit videos up to ten seconds long through text prompts via the API for $0.10 per second of output.
- Google recommends chaining both models together so developers can generate images with Nano Banana 2 Lite and then animate them into videos with Gemini Omni Flash.
Google releases two new generative AI models. Nano Banana 2 Lite generates images in four seconds at a fraction of the cost. Gemini Omni Flash opens up video generation and editing via text prompts through the API for the first time.
Nano Banana 2 Lite generates images in four seconds
Google says Nano Banana 2 Lite is built for fast ideation and high-throughput developer pipelines. Text-to-image generation takes four seconds and costs just $0.034 per image at 1K resolution. The new image model goes by gemini-3.1-flash-lite-image in the API.
| Model | Price per image | Resolution |
|---|---|---|
| Nano Banana 2 Lite | $0.034 | 1K |
| Nano Banana 2 | $0.067 | 1K |
| Nano Banana Pro | $0.134 | 1K or 2K |
Despite the speed focus, Google says Nano Banana 2 Lite still delivers reliable prompt following, consistent character rendering, and readable text in generated images. Beyond developer platforms, the model is rolling out across Google's consumer products too, including AI Mode in Google Search, the Gemini app, NotebookLM, Google Photos, Stitch, Google Flow, and Google Ads.

Nano Banana 2 Lite brings the Nano Banana family to three production models. Google positions Nano Banana 2 (Gemini 3.1 Flash Image) as the all-rounder with the best balance of quality and cost. Nano Banana Pro (Gemini 3(.1) Pro Image) targets complex, professional use cases and offers what Google calls the strongest control and most advanced reasoning. 
Comparison table of the Nano Banana model family. | Image: GoogleDevelopers can pick the right model based on whether they need speed, quality, or low cost. Google considers the original Nano Banana (Gemini 2.5 Flash Image) outdated. We still mostly use Nano Banana Pro ourselves, since its image quality and prompt reliability tend to beat both Nano Banana 2 and OpenAI's GPT-Image-2.
Gemini Omni Flash brings video generation to the API
Gemini Omni Flash was first shown at Google I/O and is now available to developers through the Gemini API and Google AI Studio. The model combines Gemini's multimodal reasoning with video generation and editing. Pricing is $0.10 per second of video output, matching Veo 3.1 Fast.
Google says the model's strengths are conversational video editing through natural language, the ability to mix input formats like text, images, and video, and tapping into Gemini's world knowledge for generation. Text and graphics can sync directly with video actions.
Gemini Omni Flash currently only generates ten-second clips. Audio references and scene extensions aren't supported in the API yet. The API schema accepts video references up to three seconds long, but Google says the model doesn't process them correctly yet. Character consistency across scene changes or camera movements is still limited too.
Google recommends chaining both models together
Google sees the biggest payoff in combining the two models. Developers can quickly generate images with Nano Banana 2 Lite and pass them as references to Gemini Omni Flash, which then animates them into video. The Interactions API, which is now Google's default AI API, preserves session history and context, allowing up to three consecutive edits.
Google provides three demo apps to show how the models work together. "Anywhere" places users at famous landmarks via selfie and animates the result. "Space Lift" generates interior design concepts from room photos and turns them into video. "Omni Product Studio" converts static product images into e-commerce videos.
Both models use SynthID watermarks to tag AI-generated content, according to Google. Verification is available through the Gemini app, Gemini in Chrome, or Google Search. Nano Banana 2 Lite and Gemini Omni Flash are available now in Google AI Studio, the Gemini API, and the Gemini Enterprise Agent Platform.