Breaking

AI & Machine Learning

Feature

Agentic AI Is About to Multiply Data Center Power Demand in Ways Nobody Has Modelled

The power models underpinning every major grid plan, capacity commitment, and power purchase agreement signed in the AI infrastructure buildout

Akash Sharma
28 May 2026
3 min read
AI & Machine Learning
World

The power models underpinning every major grid plan, capacity commitment, and power purchase agreement signed in the AI infrastructure buildout were built around two workload types: training and inference. Training is predictable, sustained, and concentrated. Inference is bursty, distributed, and latency-sensitive. Every watt-hour projection, every interconnection request, every utility load forecast submitted to a grid operator in 2024 and 2025 was built on some combination of those two demand profiles.

Agentic AI is neither. And it is arriving at scale in 2026 across every major enterprise platform simultaneously.

What Makes Agentic AI Power Demand Fundamentally Different

A chat-based inference workload begins when a user submits a query and ends when the model returns a response. The compute event is discrete, bounded, and paced by human interaction. A user types. The model processes. The user reads. The model waits. The duty cycle of that workload is a fraction of continuous because human interaction speed is the rate-limiting step. The entire era of inference infrastructure planning was built around that rhythm.

An agentic workload breaks that rhythm entirely. When an enterprise deploys an agent to manage a workflow, the agent does not wait for human prompting. It initiates actions, calls tools, queries databases, spawns sub-agents, evaluates outputs, and loops until it reaches a completion condition. A single agent task can chain dozens of model calls, each generating its own compute event, without any human in the loop between them. S&P Global’s 451 Research found that agentic systems consume significantly more IT capacity than chat-based systems precisely because they break free of human pacing and launch multiple prompts that cascade into other agents simultaneously.

The Compounding Effect Nobody Has Priced Into Infrastructure Plans

The power demand implication of this architecture shift is not linear. It is multiplicative. A single enterprise deploying ten agents running simultaneously, each chaining fifteen model calls per task completion, generates the equivalent compute load of 150 concurrent inference sessions rather than ten. When that enterprise scales from ten agents to a thousand, the compute load scales to the equivalent of 15,000 concurrent inference sessions from a single customer’s deployment.

The infrastructure that was provisioned to serve that enterprise’s inference workload, sized against the assumption that human interaction speed would govern compute consumption, is now being asked to serve a workload that operates at machine speed with no natural pause points. The agentic data center operating at no-human-in-loop pace, as CIO magazine described it in March 2026, requires infrastructure that treats every fan speed, fluid pressure point, and network state as real-time telemetry rather than periodic monitoring. That is a fundamentally different operational and power profile from the inference cluster it is replacing.

The Grid Plans Were Written for a Different Workload

The IEA projects global data center electricity consumption will reach approximately 945 terawatt hours by 2030. That projection is built on the workload mix that existed when the model was calibrated: predominantly training and inference, with agentic AI as an emerging but unquantified category. A research paper published in April 2026 in the academic literature on AI workload power profiles explicitly noted that agentic AI frameworks are introducing more dynamic resource utilisation behaviours that make load modelling at any scale more difficult. The researchers flagged this as a compounding uncertainty on top of already significant infrastructure planning challenges.

The utility load forecasts that data center operators submitted to support interconnection requests in 2024 and 2025 did not model agentic workloads at scale because agentic workloads at scale did not exist yet. Those forecasts are now the basis for grid investment plans that utilities are executing against. The capacity being built today to serve AI workloads in 2027 and 2028 was sized against demand projections that did not include the workload type that will account for a rapidly growing share of enterprise AI compute by the time that capacity comes online.

The Infrastructure That Agentic AI Actually Needs

The power demand profile of agentic AI is not just higher than inference. It is differently shaped in ways that create specific infrastructure challenges beyond raw capacity. Agentic swarms generate massive east-west traffic between agents negotiating tasks, which is server-to-server communication that inference clusters were not designed to accommodate at the required bandwidth and latency. The agentic AI creating a power demand profile that nobody designed data centers for is a problem that compounds as agent architectures mature, because each generation of agent frameworks adds more inter-agent communication, more tool calling, and more state persistence than the one before it.

The operators building AI infrastructure in 2026 and 2027 who factor agentic workload characteristics into their power provisioning, cooling design, and network architecture now will have assets that serve the workload mix of 2028 and 2029 without expensive retrofits. The operators who are building for the inference and training profiles that dominated the capacity planning conversations of 2024 are building infrastructure whose utilisation assumptions will be tested by the first wave of enterprise agentic deployments at scale, which is already underway and accelerating faster than any of the models predicted.

Topics

Akash Sharma

Kiara Mandavia is the Content Manager at Compute Forecast, a publication covering the data centre industry. She brings a background in technology and editorial strategy, with a focus on making complex infrastructure trends accessible and meaningful for industry audiences. Her work explores the business, innovation, and sustainability stories shaping how the world builds and scales its digital foundations. At Compute Forecast, Kiara leads feature stories, industry analysis, and thought leadership content that keeps readers ahead of the curve in a rapidly evolving sector.

[simple-author-box]

COMPUTE WEEKLY

The briefing that 40,000+ tech leaders read every Monday. Sharp, fast, essential.

Download Now

Building an AI Startup Without Owning GPUs

Not owning GPUs has become the default, deliberate strategy for building an AI company — not a compromise founders accept reluctantly. H100 rental rates fell 64-75% in fifteen months, a dense ecosystem of neoclouds and inference-as-a-service providers now lets startups skip infrastructure entirely, and credit programs can fund a company’s first year before a founder writes a check

Cerebras Systems

AI & Machine Learning

The chip that makes Nvidia nervous. Cerebras’ Wafer Scale Engine is rewriting the rules of AI inference at scale.

Faster

0 x

YoY Revenue

0 x

Transistors

0 T

Market Pulse

NVDA

$924.60

-2.11%

MSFT

$421.30

-2.94%

AMZN

$192.80

-4.87%

AMD

$924.60

-2.40%

TSMC

$924.60

-2.32%

Indicative only · Not financial advice

Upcoming Events

SEP

The AI Infrastructure Race (India)

WEBINAR · ONLINE

The AI Infrastructure Race: Won on Power, Land and Trust — Not Capital

MAY

AI Infrastructure Summit

DUBAI · IN PERSON

MEA’s premier AI infrastructure event.

JUN

0 0

Compute Forecast Summit

SINGAPORE · IN PERSON

Our flagship APAC event. Early bird open.

Latest Moves

Live

Ecolab Deepens Cooling Strategy With $4.75B CoolIT Acquisition

Ecolab is making one of its biggest moves yet into AI infrastructure after completing its $4.75 billion acquisition of liquid cooling specialist CoolIT Systems

Pure DC and AVK Deploy Europe’s First 110 MW Data Center Microgrid in Dublin

The Pure DC Dublin microgrid has made history as Europe’s first large-scale on-site data center microgrid, launched in partnership with power solutions provider AVK at Pure DC’s campus in Ireland.

Pace Digitek Partners With MEGMEET to Expand AI Data Center Power Business

India’s AI infrastructure ecosystem continues to mature as domestic technology manufacturers move beyond traditional telecommunications and industrial markets toward high-growth digital infrastructure opportunities

Follow Compute Forecast

11K followers

1200 followers

Companies to Watch

CoreWeave

Neo Cloud · $19B · IPO Watch

Cerebras Systems

AI Hardware · $4.25B · Pre-IPO

G42

G42

Sovereign AI · Abu Dhabi

Humain

Saudi AI · $40B Fund

Latest Podcast

EP . 041

AI Capex, Cloud Margins & the Nuclear Bet

48 MIN · 25 APR 2026

Breaking

AI & Machine Learning

Feature

Agentic AI Is About to Multiply Data Center Power Demand in Ways Nobody Has Modelled

The power models underpinning every major grid plan, capacity commitment, and power purchase agreement signed in the AI infrastructure buildout

Akash Sharma
28 May 2026
3 min read

847 SHARES

0
SHARES

Topics

[simple-author-box]

COMPUTE WEEKLY

The briefing that 40,000+ tech leaders read every Monday. Sharp, fast, essential.

Free Report

Global AI Infrastructure Outlook 2026

The briefing that 40,000+ tech leaders read every Monday. Sharp, fast, essential.

Download Free

Cerebras Systems

AI & Machine Learning

The chip that makes Nvidia nervous. Cerebras’ Wafer Scale Engine is rewriting the rules of AI inference at scale.

Faster

0 x

YoY Revenue

0 x

Transistors

0 T

Market Pulse

NVDA

$924.60

+2.4%

MSFT

$421.30

+1.1%

AMZN

$192.80

-0.6%

NVDA

$924.60

+2.4%

NVDA

$924.60

+2.4%

Indicative only · Not financial advice

Upcoming Events

MAY

0 0

DCD Global — London

LONDON · IN PERSON

World’s largest DC event. CF is media partner.

MAY

AI Infrastructure Summit

DUBAI · IN PERSON

MEA’s premier AI infrastructure event.

JUN

0 0

Compute Forecast Summit

SINGAPORE · IN PERSON

Our flagship APAC event. Early bird open.

Latest Moves

Live

Sam Altman

OpenAI appoints new Chief Infrastructure Officer to lead $100B DC programme

27 APR · OPENAI

Sam Altman

OpenAI appoints new Chief Infrastructure Officer to lead $100B DC programme

27 APR · OPENAI

Sam Altman

OpenAI appoints new Chief Infrastructure Officer to lead $100B DC programme

27 APR · OPENAI

Follow Compute Forecast

18.4K followers

12.1K followers

9.3K subscribers

41 episodes

Companies to Watch

CoreWeave

Neo Cloud · $19B · IPO Watch

Cerebras Systems

AI Hardware · $4.25B · Pre-IPO

G42

G42

Sovereign AI · Abu Dhabi

Humain

Saudi AI · $40B Fund

Latest Podcast

EP . 041

AI Capex, Cloud Margins & the Nuclear Bet

48 MIN · 25 APR 2026

Agentic AI Is About to Multiply Data Center Power Demand in Ways Nobody Has Modelled

What Makes Agentic AI Power Demand Fundamentally Different

The Compounding Effect Nobody Has Priced Into Infrastructure Plans

The Grid Plans Were Written for a Different Workload

The Infrastructure That Agentic AI Actually Needs

More from AI Infrastructure

COMPUTE WEEKLY

Building an AI Startup Without Owning GPUs

Cerebras Systems

$924.60

$421.30

$192.80

$924.60

$924.60

Agentic AI Is About to Multiply Data Center Power Demand in Ways Nobody Has Modelled

More from AI Infrastructure

COMPUTE WEEKLY

Global AI Infrastructure Outlook 2026

Cerebras Systems

$924.60

$421.30

$192.80

$924.60

$924.60