Sunday, April 20, 2025
HomeAIThe first Google TPU for the age of inference

The first Google TPU for the age of inference

Published on

spot_img


Today at Google Cloud Next 25, we’re introducing Ironwood, our seventh-generation Tensor Processing Unit (TPU) — our most performant and scalable custom AI accelerator to date, and the first designed specifically for inference. For more than a decade, TPUs have powered Google’s most demanding AI training and serving workloads, and have enabled our Cloud customers to do the same. Ironwood is our most powerful, capable and energy efficient TPU yet. And it’s purpose-built to power thinking, inferential AI models at scale.

Ironwood represents a significant shift in the development of AI and the infrastructure that powers its progress. It’s a move from responsive AI models that provide real-time information for people to interpret, to models that provide the proactive generation of insights and interpretation. This is what we call the “age of inference” where AI agents will proactively retrieve and generate data to collaboratively deliver insights and answers, not just data.

Ironwood is built to support this next phase of generative AI and its tremendous computational and communication requirements. It scales up to 9,216 liquid cooled chips linked with breakthrough Inter-Chip Interconnect (ICI) networking spanning nearly 10 MW. It is one of several new components of Google Cloud AI Hypercomputer architecture, which optimizes hardware and software together for the most demanding AI workloads. With Ironwood, developers can also leverage Google’s own Pathways software stack to reliably and easily harness the combined computing power of tens of thousands of Ironwood TPUs.

Here’s a closer look at how these innovations work together to take on the most demanding training and serving workloads with unparalleled performance, cost and power efficiency.



Source link

Latest articles

The bonkers high-tech Denza Z9 GT crabwalks and I’ve just driven one

According to its German designer Wolfgang Egger, the Denza Z9 GT has been...

ETtech Explainer: How Trump tariffs hit Nvidia’s China business, and stock

The ongoing trade tensions between the United States and China have escalated once...

Nvidia | Caught in the tech cold war

By the time Nvidia disclosed in a regulatory filing that the U.S. government...

There’s Something Horrifying in Your Toothpaste

Image by Getty / FuturismAlarming new research has found that toothpastes are often...

More like this

The bonkers high-tech Denza Z9 GT crabwalks and I’ve just driven one

According to its German designer Wolfgang Egger, the Denza Z9 GT has been...

ETtech Explainer: How Trump tariffs hit Nvidia’s China business, and stock

The ongoing trade tensions between the United States and China have escalated once...

Nvidia | Caught in the tech cold war

By the time Nvidia disclosed in a regulatory filing that the U.S. government...