Breaking

AI & Machine Learning

Feature

The Blackwell Supply Constraint Is Still the Most Important Variable in AI Infrastructure Delivery

The conversation about AI infrastructure bottlenecks has evolved considerably over the past 18 months. The transformer shortage, the grid interconnection

Akash Sharma
20 May 2026
5 min read
AI & Machine Learning
World

The conversation about AI infrastructure bottlenecks has evolved considerably over the past 18 months. The transformer shortage, the grid interconnection queue, and the construction workforce gap have each received extensive analysis as physical constraints binding the AI buildout. The original constraint, the one that defined 2023 and 2024, has received proportionally less attention in 2026 because Blackwell hardware is now shipping at volume and because hyperscaler AI infrastructure spending is projected to exceed $600 billion this year.

That appearance is misleading. The supply constraint has not disappeared. Instead, it has migrated up the hardware generation stack from H100 to GB200 to GB300, while TSMC’s CoWoS advanced packaging capacity remains oversubscribed through at least 2026. DigiTimes reporting on TSMC CoWoS capacity constraints The infrastructure buildout that hyperscalers are committing to has not outrun the supply constraint. It has expanded fast enough that the supply constraint continues chasing it.

Why CoWoS Capacity Still Controls the Market

CoWoS, or Chip-on-Wafer-on-Substrate, is the advanced packaging technology that enables high-bandwidth memory to sit directly adjacent to GPU logic on the same substrate, providing the memory bandwidth that Blackwell’s compute throughput requires. Without CoWoS packaging, Nvidia cannot assemble the Blackwell GPU modules that data center operators need. Without HBM3e memory stacked in those modules, the assembled hardware cannot deliver the performance that frontier AI training and inference workloads require.

Nvidia has reportedly booked over 50% of TSMC’s projected CoWoS capacity for 2026, with an estimated 800,000 to 850,000 wafers reserved, ensuring that while competitors scramble for remaining slots, Nvidia maintains priority access to the packaging capacity the AI market leader requires. That reservation leaves approximately 40 to 50% of CoWoS capacity for AMD’s MI series, Intel’s Gaudi platform, Google’s TPU packaging, and every other AI accelerator that depends on the same packaging technology. The competition for that remaining capacity is intense, and the outcome determines which non-Nvidia AI hardware platforms can scale in 2026.

Why the Constraint Is Structural

The Blackwell supply constraint is not a temporary disruption that resolves when TSMC brings its next CoWoS capacity increment online. It is a structural feature of a market where demand for the most capable AI hardware consistently exceeds the supply of the specialised packaging, memory, and wafer capacity needed to produce it, and where Nvidia’s dominant market position gives it first claim on whatever supply becomes available. The infrastructure market that plans around the assumption that Blackwell constraints will ease before the next hardware generation creates its own constraints is planning on a timeline that Nvidia’s product roadmap does not support. The operators who have built their procurement strategies around perpetual supply tightness at the frontier are the ones whose infrastructure programmes will deliver on schedule.

The HBM3e Bottleneck That Defines the Memory Stack

High-bandwidth memory is the second binding constraint in the Blackwell supply chain, operating in parallel with the CoWoS packaging constraint rather than in sequence with it. Micron’s high-bandwidth memory capacity sold out through calendar year 2026, and SK Hynix and Samsung are similarly fully allocated, with HBM supply fully committed through 2026 including HBM3e. The memory allocation is not simply a matter of production volume. It is a matter of stacking yield and qualification complexity. GB200 NVL72 racks require 192 gigabytes of HBM3e per GPU.

GB300 Ultra racks increase that to 288 gigabytes per GPU. SK Hynix, which has secured primary supplier status for Nvidia’s Blackwell Ultra and Rubin platforms, partners with TSMC’s 12nm process for the logic base die of its HBM4 architecture, creating a supply chain where Nvidia’s memory supply depends on the same TSMC capacity that its logic and packaging depend on, concentrating the constraint risk at a single foundry.

The practical consequence for data center operators is a delivery timeline for GB300 Ultra rack deployments that is governed by memory allocation at SK Hynix and Samsung as much as by GPU logic production at TSMC. An operator who has secured a commitment for GB300 Ultra racks in Q3 2026 is holding a delivery commitment that depends on three separate supply chains, each individually constrained, all converging on a single product integration at Nvidia’s assembly partners. Nvidia’s own management has indicated that demand will outrun supply until late 2026, with partners committed to expanding CoWoS lanes and memory suppliers pledging fresh HBM3e capacity, but the ramp timelines remain tight. That characterisation is a statement about a supply constraint that will ease but has not yet eased, and the delivery timelines for the AI infrastructure being planned and financed in mid-2026 reflect that persistent tightness.

The Competitive Dynamic That Blackwell Dominance Creates

Nvidia’s ability to lock over half of TSMC’s CoWoS capacity through 2027 creates a specific competitive dynamic that extends beyond Nvidia’s own market position. The alternative AI accelerator companies that are attempting to compete with Nvidia, AMD, Intel, and the hyperscaler custom silicon programmes, are all competing for access to the same packaging and memory supply that Nvidia has first call on. When AMD needs CoWoS capacity for its MI350 series or Intel needs it for Gaudi 3, they are bidding for the roughly 40 to 50% of capacity that Nvidia has not already reserved. That constraint is structural at the current scale of AI infrastructure deployment and cannot be resolved by AMD or Intel committing more capital to their product roadmaps, because the bottleneck is at TSMC’s packaging lines, not in AMD’s or Intel’s design capabilities.

Why Alternative AI Hardware Still Faces Supply Constraints

The CoWoS constraint also affects the Google-Blackstone TPU cloud venture that launched yesterday, May 19. Google’s TPU hardware uses a different packaging architecture than Nvidia’s CoWoS-based approach, using its own advanced packaging developed in partnership with TSMC through the SoIC stacking technology. The TPU architecture’s different packaging requirements give the Google-Blackstone venture a potential supply chain advantage if CoWoS constraints persist, because the venture’s hardware supply is not directly competing with Nvidia for the same constrained packaging capacity. Whether that packaging architecture advantage translates into a competitive supply chain position depends on whether Google can secure adequate SoIC capacity and HBM supply for its TPU scale targets, which faces its own allocation competition at TSMC and the memory manufacturers.

The supply chain dynamics that govern Blackwell delivery also govern the delivery of every competing AI accelerator architecture, and the operators evaluating alternative silicon commitments need to assess supply chain viability with the same rigour they apply to performance benchmarks — a dynamic we examined in depth in our analysis of why the Google-Blackstone TPU venture is the most direct challenge to Nvidia’s infrastructure dominance yet.

The GB300 Ultra Transition That Adds Complexity

The transition from GB200 to GB300 Ultra, which is underway in 2026, adds a specific layer of complexity to the supply constraint that operators planning AI data center deployments need to incorporate into their infrastructure roadmaps. GB300 Ultra delivers materially better inference performance per rack than GB200 and is already becoming the default specification for new hyperscaler AI compute deployments. Data center racks that once operated at 30 to 40 kilowatts now operate in the hundreds of kilowatts, with designs approaching the megawatt range because tightly coupled Blackwell clusters require synchronised power delivery and cooling. A GB300 Ultra rack operating at 1 megawatt per rack requires substantially different power delivery, cooling, and structural engineering than a GB200 rack operating at 120 kilowatts per rack, forcing facilities built to GB200 specifications to confront design compatibility challenges when customers want to deploy GB300 Ultra.

Hardware Obsolescence Is Accelerating

The transition also affects the secondary market for GB200 hardware. Operators who deployed GB200 racks in 2025 and early 2026 at significant capital cost are watching those assets transition to secondary status faster than their depreciation schedules assumed, creating the same hardware obsolescence dynamic that the H100 to Blackwell transition created for the operators who were earliest to deploy at scale. The private credit funds that financed GB200 deployments on 24-month depreciation assumptions are managing collateral whose commercial premium is eroding at the pace of Nvidia’s hardware roadmap rather than at the pace of their depreciation models— a risk our analysis of the private credit bet on GPU infrastructure examines in full.

The supply chain complexity that has sustained Blackwell constraints through 2026 is the same complexity that will sustain Rubin constraints when Nvidia’s next architecture generation begins ramping in 2026 and 2027. The operators and investors who built their plans on the assumption that Blackwell supply constraints were temporary are discovering that the constraint is structural, not cyclical, because Nvidia’s product roadmap will always be generating demand that exceeds supply capability for the newest and most capable hardware generation.

Topics

Akash Sharma

Kiara Mandavia is the Content Manager at Compute Forecast, a publication covering the data centre industry. She brings a background in technology and editorial strategy, with a focus on making complex infrastructure trends accessible and meaningful for industry audiences. Her work explores the business, innovation, and sustainability stories shaping how the world builds and scales its digital foundations. At Compute Forecast, Kiara leads feature stories, industry analysis, and thought leadership content that keeps readers ahead of the curve in a rapidly evolving sector.

[simple-author-box]

COMPUTE WEEKLY

The briefing that 40,000+ tech leaders read every Monday. Sharp, fast, essential.

Download Now

Building an AI Startup Without Owning GPUs

Not owning GPUs has become the default, deliberate strategy for building an AI company — not a compromise founders accept reluctantly. H100 rental rates fell 64-75% in fifteen months, a dense ecosystem of neoclouds and inference-as-a-service providers now lets startups skip infrastructure entirely, and credit programs can fund a company’s first year before a founder writes a check

Cerebras Systems

AI & Machine Learning

The chip that makes Nvidia nervous. Cerebras’ Wafer Scale Engine is rewriting the rules of AI inference at scale.

Faster

0 x

YoY Revenue

0 x

Transistors

0 T

Market Pulse

NVDA

$924.60

-2.11%

MSFT

$421.30

-2.94%

AMZN

$192.80

-4.87%

AMD

$924.60

-2.40%

TSMC

$924.60

-2.32%

Indicative only · Not financial advice

Upcoming Events

SEP

The AI Infrastructure Race (India)

WEBINAR · ONLINE

The AI Infrastructure Race: Won on Power, Land and Trust — Not Capital

MAY

AI Infrastructure Summit

DUBAI · IN PERSON

MEA’s premier AI infrastructure event.

JUN

0 0

Compute Forecast Summit

SINGAPORE · IN PERSON

Our flagship APAC event. Early bird open.

Latest Moves

Live

Ecolab Deepens Cooling Strategy With $4.75B CoolIT Acquisition

Ecolab is making one of its biggest moves yet into AI infrastructure after completing its $4.75 billion acquisition of liquid cooling specialist CoolIT Systems

Pure DC and AVK Deploy Europe’s First 110 MW Data Center Microgrid in Dublin

The Pure DC Dublin microgrid has made history as Europe’s first large-scale on-site data center microgrid, launched in partnership with power solutions provider AVK at Pure DC’s campus in Ireland.

Pace Digitek Partners With MEGMEET to Expand AI Data Center Power Business

India’s AI infrastructure ecosystem continues to mature as domestic technology manufacturers move beyond traditional telecommunications and industrial markets toward high-growth digital infrastructure opportunities

Follow Compute Forecast

11K followers

1200 followers

Companies to Watch

CoreWeave

Neo Cloud · $19B · IPO Watch

Cerebras Systems

AI Hardware · $4.25B · Pre-IPO

G42

G42

Sovereign AI · Abu Dhabi

Humain

Saudi AI · $40B Fund

Latest Podcast

EP . 041

AI Capex, Cloud Margins & the Nuclear Bet

48 MIN · 25 APR 2026

Breaking

AI & Machine Learning

Feature

The Blackwell Supply Constraint Is Still the Most Important Variable in AI Infrastructure Delivery

The conversation about AI infrastructure bottlenecks has evolved considerably over the past 18 months. The transformer shortage, the grid interconnection

Akash Sharma
20 May 2026
5 min read

847 SHARES

0
SHARES

Topics

[simple-author-box]

COMPUTE WEEKLY

The briefing that 40,000+ tech leaders read every Monday. Sharp, fast, essential.

Free Report

Global AI Infrastructure Outlook 2026

The briefing that 40,000+ tech leaders read every Monday. Sharp, fast, essential.

Download Free

Cerebras Systems

AI & Machine Learning

The chip that makes Nvidia nervous. Cerebras’ Wafer Scale Engine is rewriting the rules of AI inference at scale.

Faster

0 x

YoY Revenue

0 x

Transistors

0 T

Market Pulse

NVDA

$924.60

+2.4%

MSFT

$421.30

+1.1%

AMZN

$192.80

-0.6%

NVDA

$924.60

+2.4%

NVDA

$924.60

+2.4%

Indicative only · Not financial advice

Upcoming Events

MAY

0 0

DCD Global — London

LONDON · IN PERSON

World’s largest DC event. CF is media partner.

MAY

AI Infrastructure Summit

DUBAI · IN PERSON

MEA’s premier AI infrastructure event.

JUN

0 0

Compute Forecast Summit

SINGAPORE · IN PERSON

Our flagship APAC event. Early bird open.

Latest Moves

Live

Sam Altman

OpenAI appoints new Chief Infrastructure Officer to lead $100B DC programme

27 APR · OPENAI

Sam Altman

OpenAI appoints new Chief Infrastructure Officer to lead $100B DC programme

27 APR · OPENAI

Sam Altman

OpenAI appoints new Chief Infrastructure Officer to lead $100B DC programme

27 APR · OPENAI

Follow Compute Forecast

18.4K followers

12.1K followers

9.3K subscribers

41 episodes

Companies to Watch

CoreWeave

Neo Cloud · $19B · IPO Watch

Cerebras Systems

AI Hardware · $4.25B · Pre-IPO

G42

G42

Sovereign AI · Abu Dhabi

Humain

Saudi AI · $40B Fund

Latest Podcast

EP . 041

AI Capex, Cloud Margins & the Nuclear Bet

48 MIN · 25 APR 2026

The Blackwell Supply Constraint Is Still the Most Important Variable in AI Infrastructure Delivery

Why CoWoS Capacity Still Controls the Market

Why the Constraint Is Structural

The HBM3e Bottleneck That Defines the Memory Stack

The Competitive Dynamic That Blackwell Dominance Creates

Why Alternative AI Hardware Still Faces Supply Constraints

The GB300 Ultra Transition That Adds Complexity

Hardware Obsolescence Is Accelerating

More from AI Infrastructure

COMPUTE WEEKLY

Building an AI Startup Without Owning GPUs

Cerebras Systems

$924.60

$421.30

$192.80

$924.60

$924.60

The Blackwell Supply Constraint Is Still the Most Important Variable in AI Infrastructure Delivery

More from AI Infrastructure

COMPUTE WEEKLY

Global AI Infrastructure Outlook 2026

Cerebras Systems

$924.60

$421.30

$192.80

$924.60

$924.60