# Google发布Gemini 3.5 Flash：在2026年I/O大会上推出一款更快速、更经济的AI智能体与编码模型

- 来源：MarkTechPost（RSS）
- 作者：Michal Sutter
- 发布时间：2026-05-20 15:12
- AIHOT 分数：68
- AIHOT 链接：https://aihot.virxact.com/items/cmperngt3008xslbg31w7eyix
- 原文链接：https://www.marktechpost.com/2026/05/20/google-introduces-gemini-3-5-flash-at-i-o-2026-a-faster-and-cheaper-model-for-ai-agents-and-coding

## AI 摘要

Google在2026年I/O大会上发布了新一代模型Gemini 3.5 Flash。该模型在代码生成和AI智能体任务等基准测试中性能超越了此前的旗舰版本，同时运行速度提升四倍，推理成本降低一半。这一更新显著增强了模型在实际应用中的效率与经济性，特别面向需要快速响应和复杂任务处理的开发者场景。

## 正文

Google just released Gemini 3.5 Flash at Google I/O May, 2026. It is the first Gemini 3.5 model. The series combines frontier intelligence with action. Google calls it a major leap for intelligent agents. The Flash tier has historically been faster and cheaper. 3.5 Flash outperforms Gemini 3.1 Pro on challenging benchmarks. The previous premium tier has now been surpassed.

What the Benchmarks Say

Gemini 3.5 Flash scores 76.2% on Terminal-Bench 2.1. That benchmark tests coding performance. It scores 1656 Elo on GDPval-AA. That measures real-world agentic task performance. It scores 83.6% on MCP Atlas. MCP Atlas measures scaled tool-use reliability. It scores 84.2% on CharXiv Reasoning. That benchmark tests multimodal understanding.

Gemini 3.5 Flash is 4x faster on output tokens. Tasks often complete at less than half the cost. Official pricing is $1.50 per million input tokens. Output tokens cost $9.00 per million. Cached input is priced at $0.15 per million.

The context window is 1,048,576 input tokens. Maximum output is 65,536 tokens. Supported inputs are text, image, audio, and video. The knowledge cutoff is January 2026. Dynamic thinking is on by default. The model auto-allocates more compute for harder problems.

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/#frontier-intelligence

Built for Agentic and Long-Horizon Tasks

Here ‘Agentic’ means the model plans, calls tools, and iterates. It completes multi-step goals, not single questions. ‘Long-horizon’ means that loop runs for extended periods. Google introduced Managed Agents in the Gemini API. One API call spins up a full agent. It reasons, uses tools, and executes code. The environment runs inside an isolated Linux container. Files and state persist across follow-up calls. This enables seamless multi-turn agent sessions.

Previously, managing agent state and environments was manual. The Managed Agents API abstracts that infrastructure entirely.

The Antigravity Ecosystem

Google Antigravity is its agent-first development platform. It takes ideas to production-ready apps. Antigravity 2.0 is a new standalone desktop app. It orchestrates multiple agents running in parallel. Dynamic subagents handle parallelized workflows. Scheduled tasks enable background automation. Integrations cover Google AI Studio, Android, and Firebase.

The Antigravity CLI is for terminal-based developers. It creates agents instantly, without a GUI. Google encourages Gemini CLI users to migrate now. The Antigravity SDK gives programmatic access to the harness. You can define custom agent behaviors with it. Host agents on infrastructure of your choice.

Real-World Enterprise Deployments

According to Google, several enterprise partners are already running 3.5 Flash. Shopify runs subagents in parallel for data analysis. It powers more accurate merchant growth forecasts globally. Macquarie Bank is piloting it for customer onboarding. The model reasons over complex 100+ page documents. It retrieves information and makes reliable recommendations.

Salesforce is integrating 3.5 Flash into Agentforce. It automates enterprise tasks using multiple subagents. Subagents retain context across complex, multi-turn tool calling. Ramp uses it for smarter OCR on invoices. It pairs multimodal understanding with historical pattern reasoning. Xero deploys agents for complex, multi-week workflows. One example is gathering supplier data for 1099 forms. Databricks uses agentic workflows for real-time data monitoring. The model diagnoses issues and proposes fixes for engineers.

Check out the Technical details. Also, feel free to follow us on Twitter and don’t forget to join our 150k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.? Connect with us

The post Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for AI Agents and Coding appeared first on MarkTechPost.