Google AI@GoogleAI · 4月30日52http://x.com/i/article/2049546144930275328
# The Agentic Era: Unveiling Eighth Generation TPUs
A decade in the making, the chips for the agentic era have arrived.
At @GoogleCloud's Next '26 event last week, we unveiled our eighth-generation TPUs (the specialized computer chips we build for AI). These chips were specifically designed to handle the two biggest challenges in AI today: training the AI and serving the AI.
So… what exactly does that mean? Let’s break it down:
TPU 8t: Training the AI
Before an AI can help you write an email or plan a trip, it has to "learn" from massive amounts of data. In the past, this could take months of expensive computer time.
With TPU 8t, we’ve made that process significantly faster through two key advancements.
- More power: It is roughly 3x more powerful than our previous generation of TPUs
- More efficiency: We’ve cleared the "traffic jams" that usually slow down AI training. By making data move 10x faster from storage to the chips, we ensure the system is always working at full speed, never sitting idle.
- Optimized scaling: In a system this size, parts eventually fail. TPU 8t is designed to automatically detect and reroute around hardware issues at large scale. This ensures that 97% of the resources are spent on productive work, preventing crashes that used to waste days of training time.
So now, what used to take months of training now takes only weeks, meaning researchers can experiment and innovate at speed.
TPU 8i: Serving the AI (Agents)
If the "8t" is for teaching, the 8i is for doing. We built this chip specifically for "AI Agents,” the kind of AI that doesn't just chat with you, but actually acts for you (ex: booking a flight, managing a calendar, etc).
To take action, an AI needs to "think" and "reason" through multiple steps very quickly, which TPU 8i enables through these advancements:
- Better thinking: We tripled the chip’s internal memory so it can handle more complex logic.
- More cost effective: It offers 80% better performance for every dollar spent. For a business, that means you can help twice as many customers without increasing your tech budget.
- Latency: At the chip level, we have integrated a new engine which reduces latency by an additional 5x.
Powering the Next Decade
Whether it's a scientist training a new medical model or a business getting some much needed customer support help, these chips provide the raw power needed to make that future a reality.
译在Google Cloud Next '26大会上,谷歌正式推出专为智能体时代设计的第八代TPU芯片,分别针对AI训练与服务两大核心挑战。TPU 8t专注于训练,其性能约为前代的3倍,并通过加速数据移动和优化硬件容错,将原本需数月的训练时间缩短至数周。TPU 8i则专为执行复杂任务的AI智能体服务,内存扩大三倍以支持多步推理,每美元性能提升80%,延迟降低5倍,助力企业以更低成本扩展服务规模。这些芯片将为医疗研究、客户支持等广泛场景提供核心算力,推动AI应用创新。