在笔记本电脑上运行 Gemma 4 12B：借助 Google AI Edge 解锁本地智能体工作流

2026-06-03 00:00·30天前

AI 摘要

Google DeepMind 的 Gemma 4 12B 模型可在 16GB RAM 的普通笔记本上运行，支持本地数据处理与视觉洞察生成。macOS 用户可通过 Google AI Edge Gallery 执行动态 Python 代码与可视化，通过 Google AI Edge Eloquent 实现完全离线的语音听写和文本编辑。另外，LiteRT-LM CLI 新增 serve 命令，可创建行业兼容的本地端点，驱动完全本地的 AI 工具和智能体。

原文 · 未翻译

Bringing Gemma 4 12B to your Laptop: Unlocking Local, Agentic Workflows with Google AI Edge

Facebook

Twitter

Mail

Google DeepMind’s latest open model, Gemma 4 12B, is designed to bring agentic, multimodal intelligence directly to your laptop. By combining the model's strengths with the Google AI Edge stack, you can immediately get hands-on to build and experiment locally, on everyday machines (see model card for spec requirement).This model-runtime combination unlocks powerful on-device capabilities, from autonomous data processing and generating rich visual insights, to building fully functional webpages and executing everyday tool use. You can start interacting with Gemma 4 12B across Google AI Edge right now:

Explore Gemma with Google AI Edge Gallery, our local AI showcase app, now available on macOS. With the 12B model you can generate and execute scripts on the fly for tasks such as data analysis.

The Google AI Edge Eloquent on-device, voice dictation app is now available on macOS. We added the ability to interactively polish and rewrite text through voice commands, entirely on-device, powered by the new Gemma 4 12B model.

LiteRT-LM can now serve local, industry compatible endpoints directly from your terminal via the new serve command in the LiteRT-LM CLI. When used with Gemma 4 12B, this is a highly capable and efficient option to power fully-local agentic tools, harnesses, and workflows.

Coding with Google AI Edge Gallery on MacOS

The Google AI Edge Gallery app, now available on macOS, showcases Gemma 4 12B’s coding capability, allowing you to extract meaningful insights from your data right on your device. Through a seamless interface, you can simply describe your analytical goals in natural language. In the example below, we asked the model to “use a python program to render a chart png to compare the top 10 girl names born in 2024 vs 2025” given two text files containing the data. In response, the model dynamically generates Python code, executes it locally, and converts raw data into beautiful, easy-to-grasp visualizations and insights.

Google Developers Blog（RSS）

75导出 Markdown

在笔记本电脑上运行 Gemma 4 12B：借助 Google AI Edge 解锁本地智能体工作流

2026-06-03 00:00·30天前

阅读原文· developers.googleblog.com

AI 摘要

原文 · 保持原样，未翻译

Bringing Gemma 4 12B to your Laptop: Unlocking Local, Agentic Workflows with Google AI Edge

Facebook

Twitter

Mail

在笔记本电脑上运行 Gemma 4 12B：借助 Google AI Edge 解锁本地智能体工作流

在笔记本电脑上运行 Gemma 4 12B：借助 Google AI Edge 解锁本地智能体工作流

Import the Gemma 4 12B model as "gemma4-12b" litert-lm import --from-huggingface-repo=litert-community/gemma-4-12B-it-litert-lm gemma-4-12B-it.litertlm gemma4-12b # Start the OpenAI-compatible server litert-lm serve

Import the Gemma 4 12B model as "gemma4-12b" litert-lm import --from-huggingface-repo=litert-community/gemma-4-12B-it-litert-lm gemma-4-12B-it.litertlm gemma4-12b # Start the OpenAI-compatible server litert-lm serve

Import the Gemma 4 12B model as "gemma4-12b" litert-lm import --from-huggingface-repo=litert-community/gemma-4-12B-it-litert-lm gemma-4-12B-it.litertlm gemma4-12b # Start the OpenAI-compatible server litert-lm serve

Import the Gemma 4 12B model as "gemma4-12b" litert-lm import --from-huggingface-repo=litert-community/gemma-4-12B-it-litert-lm gemma-4-12B-it.litertlm gemma4-12b # Start the OpenAI-compatible server litert-lm serve