Big Tech Confronts AI Token Cost Wall

Gizmodo reports several major technology companies are confronting sharply rising costs from AI "token" usage. Per Gizmodo, Amazon told employees, "Please don't use AI just for the sake of using AI," and Uber capped employee token spending at $1,500 per month after exhausting its annual AI budget. Gizmodo reports OpenAI CEO Sam Altman called token usage "a huge issue" at a recent event and cites an April post estimating agents can consume 1,000x more tokens than other systems. Gizmodo also reports GitHub is testing token-based billing and that some developers route work to smaller or bespoke models to cut fees. Independent coverage from TechCrunch and Tom's Hardware corroborates the squeeze, with a Goldman Sachs report cited by Tom's Hardware estimating agents could raise token demand roughly 24-fold.
What happened
Gizmodo reports that multiple large technology firms are reacting to surging AI "token" costs. Per Gizmodo, Amazon told employees, "Please don't use AI just for the sake of using AI," and Uber limited employee AI spending to $1,500 per month after exhausting its annual AI budget. Gizmodo reports OpenAI CEO Sam Altman described token usage as "a huge issue" at a recent event, and cites an April post estimating agents can use 1,000x more tokens than other AI systems. Gizmodo also reports GitHub is trialing token-based billing and that some developers route workloads to smaller or bespoke models, such as Chipotle's customer-service bot Pepper, to reduce costs.
Why agentic workflows raise costs
Two technical drivers compound the problem: agent-style workflows string many model calls together, and per-token billing scales linearly with call volume. Richer multimodal or agentic behavior therefore tends to multiply inference calls and token counts. Independent reporting corroborates the strain: TechCrunch describes an industry scramble to manage runaway AI costs, and Tom's Hardware cites a Goldman Sachs report estimating agents could increase token demand roughly 24-fold, naming Uber and Microsoft among companies feeling the pressure.
Industry response
For product teams and ML engineers, token costs now shape engineering and procurement decisions even when larger models are functionally superior. Common responses include benchmarking cost-per-task, caching, shortening contexts, routing bulk workloads to smaller or open models, and adding billing guardrails. Reporting frames the shift as a commercial constraint that could accelerate interest in alternative pricing and delivery models, including subscription, tiered throughput, and self-hosted stacks.
What to watch
Track pricing experiments from major model providers (per-token versus per-request versus subscription), enterprise billing controls implemented by cloud and platform vendors, and adoption signals for lightweight open models and specialized inference endpoints. Tooling for token accounting, agent orchestration, and client-side caching will materially affect cost-optimization strategies.
Key Points
- 1Token-based pricing is creating measurable adoption friction: per Gizmodo, Uber capped staff AI spend at $1,500/month and Amazon urged employees not to use AI "just for the sake of using AI."
- 2Agentic workflows sharply amplify token consumption - Gizmodo cites a 1,000x estimate and Tom's Hardware a 24x Goldman Sachs figure - pushing teams toward smaller models, caching, and routing.
- 3Cost-per-task engineering (prompt efficiency, model routing, billing controls) is becoming a first-order concern as providers like GitHub test token-based billing.
Scoring Rationale
Rising token costs directly affect day-to-day engineering, model selection, and procurement, and the story is well-corroborated across Gizmodo, TechCrunch, and Tom's Hardware (citing Goldman Sachs). It is a useful, broadly relevant practitioner explainer rather than a new capability or frontier release, supporting a solid mid-range score.
Sources
Public references used for this report.
Practice interview problems based on real data
1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems