Corporations Rein In AI Usage Over Token Costs

Executives at multiple large companies have begun restricting internal AI usage after sharply higher cloud-model token bills, reporting by The Wall Street Journal and CNET shows. Amazon removed an employee-created AI leaderboard called "KiroRank" and deprecated the beta dashboard, an Amazon spokesperson told CNET. Business Insider reports Uber COO Andrew Macdonald said it is "getting harder to justify" AI spending after internal usage exhausted part of the company's 2026 budget. Fortune, citing The Verge, reports Microsoft has canceled many internal Claude Code subscriptions. Notebookcheck, citing Gizmodo and Axios, reports an anonymous consultant said a client accidentally spent about $500 million in a single month on model usage. Notebookcheck also cites an OpenAI report claiming productivity gains of "an hour a day" and an MIT study of 350 deployments that found 95% failed to turn a profit.
What happened
Several large companies have limited internal AI usage after token-based bills grew substantially. Reporting by The Wall Street Journal summarizes a broader trend of corporate rationing as usage and costs surged. Amazon removed an employee-created AI leaderboard called "KiroRank" and deprecated the beta dashboard, an Amazon spokesperson told CNET, saying the tool was not an approved metric. Business Insider reports Uber COO Andrew Macdonald said it is "getting harder to justify" AI spending after internal usage consumed significant portions of its 2026 token budget. Fortune, citing The Verge, reports Microsoft canceled many internal Claude Code subscriptions for employees. Aggregated reporting in Notebookcheck, citing Gizmodo and Axios, describes an anonymous consultant saying one client spent roughly $500 million in a single month on model usage.
Technical details
Editorial analysis - technical context: Token-based pricing charges for model inputs and outputs, making high-volume or agentic workflows far more expensive than lightweight API calls. Reporting across outlets highlights Claude Code and other agent-focused products as particularly token-intensive. Industry coverage notes that internal leaderboards and gamified metrics raised token consumption without proving commensurate product value in many cases.
Context and significance
Public coverage frames the pullback as a response to cost shock rather than a single-company failure. Notebookcheck cites an OpenAI report that quantified productivity improvements at roughly "an hour a day," while the same aggregation cites an MIT study of 350 public deployments that found 95% did not achieve profitability or performance goals. Reporting by Fortune, CNET, and the WSJ connects those mixed ROI signals to changes in procurement and internal governance: companies are deprecating unvetted dashboards, restricting subscriptions to high-cost models, and re-evaluating how AI usage is measured.
For practitioners
Editorial analysis: Companies and platform teams should expect more scrutiny on per-query costs and model-selection policies. Industry reporting indicates a sharper emphasis on routing queries to cheaper models or cached results and on reducing agentic churn that multiplies token consumption. These developments affect cost modeling for production ML systems, vendor negotiations for model access, and the design of observability tooling that tracks token consumption at function-level granularity rather than raw usage leaderboards.
What to watch
- •Whether large cloud providers or model vendors introduce more granular, cost-optimized routing features or new pricing tiers; reporting to date highlights demand from executives for "smart routers" that choose cheaper models where feasible, a comment attributed to Marc Benioff in Fortune.
- •Corporate procurement moves: follow reporting from WSJ and FT for changes to enterprise contracts, and from outlets like The Verge and CNET for product-level subscription changes such as the Claude Code cancellations.
- •Evidence of sustained ROI: be alert for objective case studies or vendor dashboards that link token spend to measurable business outcomes, since multiple outlets report mixed productivity signals today.
Limitations of the reporting
The sources above report on deprecations, executive comments, and anecdotal overspend. None of the scraped coverage provides a comprehensive, audited dataset of corporate token spending at scale. Several cited figures-such as the $500 million monthly spend-derive from anonymous accounts reported via aggregators; those numbers are significant but not independently verified in the public reporting.
Scoring Rationale
Notable business impact across major tech firms makes this story material for practitioners managing production AI budgets and vendor contracts. It affects procurement and engineering priorities but does not represent a fundamental technological breakthrough.
Practice with real Ride-Hailing data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Ride-Hailing problems


