AI Compute Shortage Challenges ‘Bubble’ Narrative
OpenAI, Anthropic constraints and rising GPU prices signal demand is outpacing available compute
Georgina Mackie
WASHINGTON, April 21, 2026 – Artificial intelligence companies are running out of compute capacity as demand surges, contradicting claims of an AI bubble, said Vlad Galabov, senior director at Omdia, Tuesday at Data Center World.
Major AI firms are already hitting limits. OpenAI scaled back its video-generation tool Sora earlier this year to redirect compute to core services as usage spiked.
Token demand, the units of text processed by AI models, jumped from about 6 million tokens per minute in October 2025 to roughly 15 billion by March 2026.
At Anthropic, users of Claude Code reported hitting usage caps faster than expected in April. The company said addressing compute shortages tied to AI agents was a top priority.
Google CEO Sundar Pichai warned as early as November 2025 that compute limits were already constraining growth.
Meanwhile, xAI is scaling aggressively, expanding from more than 200,000 GPUs to 555,000 and targeting 1 million to train its latest models.
GPU prices are rising despite increased supply. The cost of Nvidia compute time has risen about 48 percent in recent months, reflecting persistent shortages.
The “AI bubble” narrative confuses stock market behavior with real infrastructure demand, Galabov said.
“There are two different concepts when I think about the aspect of an AI bubble. On one hand, you have the actual demand for chips. On the other hand, you have the stock market,” he said.
Arguments that more efficient models will reduce compute needs are not holding, Galabov said.
AI workloads are becoming more complex, shifting to multi-step reasoning and agent-driven tasks that require more compute.
AI agents are accelerating demand by triggering repeated model calls for a single task.
“The demand really is there. The demand is not limited by the population of the world. It is limited by how many bots we decide to spin,” Galabov said.
Supply remains limited by infrastructure bottlenecks, including power availability, grid constraints, shortages of memory and other components, and limited semiconductor manufacturing capacity.
“The demand is not high because there is a bubble. The demand is high because we’re using it,” Galabov said.
Companies could generate more revenue if they had access to additional compute, he added.

Member discussion