The AI Infrastructure (2)

Vikas Sachdeva

Published Sep 2, 2025

This is the 2nd article in the series (The first article)

Planet-Scale Computers: Brewing the Backbone of AI

The internet and cloud services we use daily run on what can best be described as a planet-scale coffee machine. Workloads are distributed across sprawling data center campuses much like streams of coffee pouring into countless mugs. These facilities are purpose-built to house computing, storage, and networking infrastructure. At their core, they deliver the essential “ingredients” both machines and people need: reliable power, cooling, shelter, and security.

And just as a perfect brew depends on balancing water, beans, and heat, a data center depends on balancing power in and heat out. Almost all the input energy eventually turns into waste heat—like steam rising off a hot cup—so managing that thermal load is the barista’s art of data center design.

☕ As I sip, I imagine the cloud from a user’s perspective: a single giant coffee pot that never runs out. Behind the scenes, however, it’s a complex assembly line, grinding, brewing, filtering, and pouring at planetary scale.

From Office Blocks to Hyperscale Espresso Machines

Back in the 1990s, data centers resembled office blocks with oversized HVAC units—think instant coffee dispensers. Fast forward to today, and demand from video streaming, social media, and AI has driven facilities to require 50x the power density of an office. That’s the jump from a drip filter to a high-pressure espresso machine.

This transformation required a complete redesign of cooling: from comfort cooling (like blowing across your mug to cool it) to precision-engineered systems like chilled water loops, direct-to-chip liquid cooling, and airflow sculpting.

The stakes? A single failure can spill coffee all over the counter. Power faults may only affect one cup, but cooling failures? They can burn the whole brew—taking out entire server halls at once.

To measure resilience, engineers rely on frameworks like the Uptime Institute’s Tiers—a kind of roast scale for reliability

The Reliability Roast: Data Center Tiers

Tier I – Light Roast (Basic Capacity): ~99.67% uptime (28.8 hrs downtime/year). Simple brew: a grinder (UPS), hot water (backup generator), and one pot. Good enough for small cafés.
Tier II – Medium Roast (Redundant Components): ~99.75% uptime (22 hrs downtime/year). Adds backup grinders, pumps, and kettles. If one fails, you can still pour.
Tier III – Dark Roast (Concurrently Maintainable): ~99.98% uptime (1.6 hrs downtime/year). Multiple independent brewing lines. You can service one espresso machine while the other still pulls shots—no customer goes without coffee.
Tier IV – Reserve Blend (Fault Tolerant): ~99.99% uptime (0.8 hrs downtime/year). Full 2N redundancy. Every grinder, kettle, and filter duplicated. Even if one whole station fails, the brew keeps flowing.

☕ Another sip. Like coffee choices, your tier depends on taste and tolerance. Small shops may survive with a light roast (Tier I), but global cafés—airlines, SaaS, gaming, AI services, Copilots, Cursors of the world —need the strongest, darkest roast: Tier III or IV.

Anatomy of a AI Data Center

A modern AI data center isn’t a single machine; it’s a café ecosystem:

Mechanical yard – the steam kettles (chillers and cooling towers).
Electrical yard – the grinders and power meters (generators and switchgear).
Server halls – rows of espresso machines (compute/storage/network racks).
Networking zones – the baristas coordinating all the pour-overs (cluster and long-haul connectivity).
Maintenance areas – the workbench where broken tools are fixed.

Take Google’s Council Bluffs campus in Iowa: from above, it looks like a sprawling coffee roastery. Giant “boilers” (substations), “water reservoirs” (cooling tanks), and multiple brewing stations (server halls) all working together to serve billions of daily sips of cloud services.

Recommended by LinkedIn

Solving Your Warehouse Wi-Fi Problems

Jennifer Ferris 6 years ago

A Halloween Themed Data Center?

Maria Demitras 9 years ago

Integration Between KISS and Cubby Brings Cloud Access…

Keep it Simple Storage 5 months ago

Power Distribution: Grinding the Beans

Every cup of coffee begins with ground beans. Every data center begins with high-voltage electricity.

At the utility substation, high-voltage current (110 kV+) is “ground down” to medium voltage (<50 kV).
Unit substations step it down further, like adjusting the grind size for the perfect brew.
UPS systems act as filters, ensuring no grit or interruptions sneak into the shot.
Diesel generators or flywheels are the backup kettles, kicking in if the main boiler runs dry.
Finally, PDUs distribute the brew (power) to every cup (rack).

☕ Another sip—because just like espresso, every watt must be precise. Too coarse, and the flavor (or uptime) suffers. Too fine, and the system clogs under heat.

Cooling: Keeping the Brew at the Right Temperature

Coffee too hot burns your tongue. Data centers too hot burn their chips. Cooling is the careful temperature dance:

Raised floors: like saucers catching overflow, directing chilled air into the right cups.

Chillers: dual chambers—cold and hot—just like keeping milk cold while frothing steam.

Cooling towers: the latte art of physics, evaporating water to shed heat gracefully.

In-rack cooling: pouring ice directly into your mug—efficient, but risky if it spills.
In-row cooling: baristas with fans in between machines, catching steam before it escapes.

Cold plates: think of them as custom-designed coffee sleeves, absorbing the burn right at the chip (CPU/GPU) before handing it to the customer.

☕ Each method is like a different brewing technique—drip, French press, espresso. Different flavors, same goal: keep the coffee (or compute) consistent.

Closing Thoughts (and the Last Sip)

We’ve just walked through the logistics of brewing planet-scale coffee: how data centers distribute power, remove heat, and keep uptime high. Like the perfect café, everything must work in harmony—from grinding beans to pouring the final cup.

In the next article of Code to Chips with Coffee, we’ll look at what’s really driving the insatiable caffeine appetite of AI workloads—why these facilities are pulling shots 24/7, and why each new AI model is demanding a bigger cup.

Until then, sip slowly and remember: every chat, stream, or AI training run is really just another cup brewed from the world’s largest coffeehouse. ☕🌍💻

Aditya Goyal 8mo

This is great! We need a lot of new datacenters in India asap.

1 Reaction

Diksha Malhotra 8mo

Good amount of depth in this article!

1 Reaction

See more comments

To view or add a comment, sign in

The AI Infrastructure (2)

Vikas Sachdeva

From Office Blocks to Hyperscale Espresso Machines

The Reliability Roast: Data Center Tiers

Anatomy of a AI Data Center

Recommended by LinkedIn

Power Distribution: Grinding the Beans

Cooling: Keeping the Brew at the Right Temperature

Closing Thoughts (and the Last Sip)

More articles by Vikas Sachdeva

Others also viewed

What HCI brings to the Enterprise IT Environment - Part 2 of 4

The Use Case for White Racks in Data Centers

What’s all the Hype about Hyper-converged?

Tour DRFortress Any Time, from Anywhere!

A Step Towards Green Future: Green Computing

Stay Informed: QKS Insider

Easy-to-deploy wireless system increases warehouse uptime

Debunking the Cost Myths Around Virtualization: What Service Providers Need to Know

Behind the Curtain: How Fixed RFID Infrastructure Readers Work

Hyperconvergence is a lazy ideal

Explore content categories

From Office Blocks to Hyperscale Espresso Machines

The Reliability Roast: Data Center Tiers

Anatomy of a AI Data Center

Recommended by LinkedIn

Power Distribution: Grinding the Beans

Cooling: Keeping the Brew at the Right Temperature

Closing Thoughts (and the Last Sip)

More articles by Vikas Sachdeva

The AI Infrastructure (5) - Nvidia Rival Cafés

The AI Infrastructure (4) - Deep Dive into the Computational Layer of AI Datacenters

The AI Infrastructure (3) - Why Training AI Models Drinks Megawatts of Energy

The AI Infrastructure (1)

Story of Metavlsi

The two critical elements for a scalable VLSI training

The semiconductor industry needs talent - India is the answer

Good Things Take Time

Others also viewed

What HCI brings to the Enterprise IT Environment - Part 2 of 4

The Use Case for White Racks in Data Centers

What’s all the Hype about Hyper-converged?

Tour DRFortress Any Time, from Anywhere!

A Step Towards Green Future: Green Computing

Stay Informed: QKS Insider

Easy-to-deploy wireless system increases warehouse uptime

Debunking the Cost Myths Around Virtualization: What Service Providers Need to Know

Behind the Curtain: How Fixed RFID Infrastructure Readers Work

Hyperconvergence is a lazy ideal

Similar topics

Power Solutions for AI Data Center Infrastructure

Explore content categories