# 以全新生成式媒体模型与工具激发创意

- 来源：Google DeepMind：Blog（RSS）
- 发布时间：2025-05-20 17:45
- AIHOT 标记：精选
- AIHOT 链接：https://aihot.virxact.com/items/cmnwsdqam005aslage7a4gx8t
- 原文链接：https://deepmind.google/blog/fuel-your-creativity-with-new-generative-media-models-and-tools

## 精选理由

Google发布Veo 3与Imagen 4生成模型及电影制作工具Flow

## AI 摘要

发布新一代生成式媒体模型 Veo 3 与 Imagen 4，以及专为电影制作打造的工具 Flow，支持更高质量的视频与图像生成及专业影视创作流程。

## 正文

Fuel your creativity with new generative media models and tools

Fuel your creativity with new generative media models and tools

May 20, 2025

· 14 min read

x.comFacebookLinkedInMail

Introducing Veo 3 and Imagen 4, and a new tool for filmmaking called Flow.

Eli Collins

VP, Google DeepMind

x.comFacebookLinkedInMail

Today, we’re announcing our newest generative media models, which mark significant breakthroughs. These models create breathtaking images, videos and music, empowering artists to bring their creative vision to life. They also power amazing tools for everyone to express themselves.

Veo 3 and Imagen 4, our newest video and image generation models, push the frontier of media generation, with their groundbreaking new capabilities. We're also expanding access to Lyria 2, giving musicians more tools to create music. Finally, we’re inviting visual storytellers to try Flow, our new AI filmmaking tool. Using Google DeepMind’s most advanced models, Flow lets you weave cinematic films with more sophisticated control of characters, scenes and styles, to bring your story to life.

We’ve partnered closely with the creative industries — filmmakers, musicians, artists, YouTube creators — to help shape these models and products responsibly and to give creators new tools to realize the possibilities of AI in their art.

Veo 3: Video, meet audio

Veo 3, our new state-of-the-art video generation model, not only improves on the quality of Veo 2, but for the first time, can also generate videos with audio — traffic noises in the background of a city street scene, birds singing in a park, even dialogue between characters.

Across the board, Veo 3 excels from text and image prompting to real-world physics and accurate lip syncing. It’s great at understanding; you can tell a short story in your prompt, and the model gives you back a clip that brings it to life. Veo 3 is available today for Ultra subscribers in the United States in the Gemini app and in Flow. It’s also available for enterprise users on Vertex AI.

Veo 2 updates: New capabilities built with and for filmmakers

As we advance Veo 3, we’ve also added new capabilities to our popular Veo 2 model informed by our work with creators and filmmakers. Today, we’re launching several of these new capabilities, including:

Our state-of-the-art reference powered video capability allows you to give Veo images of characters, scenes, objects, and even styles for better creative control and consistency.

Camera controls help you define precise camera movements, including rotations, dollies and zooms, to achieve the perfect shot.

Outpainting allows you to broaden your frame, turning your video from portrait to landscape, and making it easier to fit any screen size, intelligently adding to the scene.

Object add and remove lets you add or erase objects from your videos. Veo understands scale, interactions, and shadows, and uses this understanding to create a natural, realistic-looking scene.

Reference powered video and camera controls are available now in Flow. We're excited to bring all these new capabilities to the Vertex AI API in the coming weeks, and to more products over the next few months.