Google has introduced its 8th-generation AI chip, the TPU 8T, which represents a dual-chip approach with specialized architectures for training (TPU 8T) and inference (TPU 8I). The TPU 8T is optimized for large-scale pre-training and delivers nearly three times the raw computing power of the previous generation. Google has fundamentally transformed its training infrastructure using JAX and Pathways, enabling seamless distribution of training across multiple sites and scaling across more than 1 million TPUs globally, creating the world's largest training cluster. Both chips achieve up to two times better performance per watt, demonstrating improved energy efficiency.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
Google Introduces its 8th-Gen AI Chip, the TPU 8TAdded:
And we've been investing for today and for the future.
In 2022, we were spending $31 billion annually in CapEx.
This year, we expect that number to be about six times that.
Approximately 180 to 190 billion dollars.
A key part of this investment is our custom silicon.
A decade ago, we announced our very first commercial tensor processing unit or TPU on this I/O stage.
Since then, we have transformed how the industry builds for AI.
We recently announced our eighth generation of TPUs at Cloud Next.
For the first time, we have taken a dual chip approach with specialized architectures for training and inference, TPU 8T and 8I.
While they may look similar, they're actually pretty different.
8T is optimized for large-scale pre-training and it's nearly three times the raw computing power of our previous generation.
We have taken a fundamentally different approach with our training infrastructure.
With JAX and Pathways, our training is no longer constrained by the limits of a single massive data center.
Instead, we can now seamlessly distribute training across multiple sites, scaling across more than 1 million TPUs globally.
This gives us the ability to create the largest training cluster in the world.
Give you a live sense of what the speed feels like, here's a prompt on an upcoming flash model if it were running on 8I.
I'll ask it to create a Chrome Dino game, push submit. The response is generated in real time.
As you watch, take a look at the tokens per second in the top right corner. It almost took longer to write out the request and and the game is pretty fun, too.
>> [applause] >> In addition to speed, we're also thinking about scaling sustainably.
Both chips are more energy efficient, delivering up to two times better performance per watt. I bet Timmy TPU will be ready to teraflop right into bed after I/O.
Related Videos
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 views•2026-05-29
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
5 Mind Blowing Omni Uses Cases
PaulJLipsky
1K views•2026-06-02
This computer is made from real human brain cells. And you can buy it.
Talktmsmedia
3K views•2026-05-28
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 views•2026-05-29











