Author Warns of Looming Compute Shortage

Kerman Kohli argues in a May 20, 2026 Substack essay that global compute demand is growing exponentially and risks outpacing supply. Kohli writes that AI labs tend to double compute year-over-year and that infrastructure and application workloads can grow 10-100x every 2-3 years, a pattern he describes as "parabolic." He cites market signals such as Micron earnings trends to illustrate how analysts model flat memory demand beyond 2027, and he notes that SK hynix opened its M15X plant in Cheongju in February 2026 (as reported in his piece). Editorial analysis: Kohli frames these observations as the opening of a "war for compute," and discusses capex and memory dynamics for the semiconductor and data-center ecosystem.
What happened
Kerman Kohli published an essay titled "We're going to run out of compute" on May 20, 2026, arguing that aggregate compute demand is accelerating far faster than commonly modelled. Kohli reports that AI research labs are roughly doubling compute year-over-year and that infrastructure and application workloads can increase 10-100x every 2-3 years, which he describes as a parabolic trajectory. He references Micron's recent earnings cadence and analyst estimates to show expectations of flat memory revenue beyond FY2027, and he notes that SK hynix opened its M15X plant in Cheongju in February 2026 in the context of increased supply-side activity.
Technical details
Per Kohli's essay, the core claim is demand growth rather than a single technical bottleneck; he links the pressure to both large-scale training workloads and rapidly expanding production workloads in applications. The piece does not present new benchmarking or primary measurements; it synthesizes industry anecdotes, market charts, and public corporate milestones.
Editorial analysis
Industry context: Companies and labs that rapidly scale training and serving fleets tend to drive compounding demand for GPUs, memory, interconnect, and power. Observed patterns in comparable periods of hardware scarcity show lead times, bidding pressure, and accelerated capex for fabs and data centers, which can amplify market volatility.
What to watch
Indicators an observer can track include GPU spot pricing and lead times, memory utilization and pricing, fab ramp schedules (for example, new plants like M15X), and public capex guidance from major cloud and chip vendors. Kohli does not provide an operational plan from any company; his essay is a synthesis of trends and market signals.
Scoring Rationale
The essay raises a notable operational risk for ML practitioners: sustained, faster-than-expected compute growth can affect cost and scheduling. The piece is an opinion synthesis rather than new data, so its importance is notable but not industry-shaking.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

