# Google推出Gemma 4 12B无编码器多模态模型

- 来源：Google AI Developers (@googleaidevs)
- 发布时间：2026-06-04 00:07
- AIHOT 分数：77
- AIHOT 链接：https://aihot.virxact.com/items/cmpy9v46t0346slax5p31qbmq
- 原文链接：https://x.com/googleaidevs/status/2062204432658386950

## AI 摘要

Google发布Gemma 4 12B，一款无编码器的统一多模态模型，可直接将视觉和音频输入送入LLM主干，无需传统多模态编码器。该模型填补了移动端E4B模型与26B MoE模型之间的空白，封装前沿推理与原生音频能力，采用Apache 2.0许可。在16GB VRAM下即可本地运行复杂多步骤智能体工作流，性能接近26B模型。

## 正文

We're launching Gemma 4 12B： Our unified， encoder-free model that brings powerful multimodal intelligence straight to your laptop 🚀

The model bridges the gap between our mobile E4B model and larger 26B MoE models， packaging frontier-class reasoning and native audio into a highly optimized footprint， all under a permissive Apache 2.0 license.

Here's what makes it unique：

+ Encoder-Less Architecture： We removed the multimodal encoders. The vision and audio inputs flow directly into the LLM backbone.
+ Agentic Performance （16GB VRAM）： Run complex， multi-step workflows locally， with performance nearing our 26B model.