Back to News
Market Impact: 0.46

The AI Token Pricing Crisis Behind OpenAI and Anthropic’s Revenue Race

NYTUBERMSFTNVDAGOOGLRAMP
Artificial IntelligenceTechnology & InnovationCorporate EarningsCorporate Guidance & OutlookPrivate Markets & VentureIPOs & SPACsCompany FundamentalsFiscal Policy & Budget
The AI Token Pricing Crisis Behind OpenAI and Anthropic’s Revenue Race

OpenAI reportedly generated $5.7 billion of Q1 2026 revenue, but the article argues that Anthropic’s run-rate and growth trajectory are stronger, with Q2 revenue projected at $10.9 billion versus $4.8 billion in Q1. The bigger issue is enterprise token-cost strain: Uber burned its 2026 AI budget in four months, Microsoft is winding down most Claude Code usage, and GitHub is shifting to usage-based billing, all of which could pressure AI demand growth. The piece frames valuation and IPO math as increasingly dependent on whether cheaper infrastructure can catch up to surging usage.

Analysis

The market is starting to price a subtle but important shift: AI demand is intact, but the margin structure of that demand is moving from software-like to utility-like. That favors the platform owner with custom silicon and the cheapest inference path, while pressuring the pure-play model vendors whose growth now depends on customers tolerating volatile, hard-to-predict spend. In the near term, the winners are not necessarily the “best model” providers, but the lowest-cost distributors of acceptable intelligence at scale. The second-order effect is budget gatekeeping. Once enterprise AI spend becomes visible enough to force line-item review, usage-based pricing tends to trigger central procurement, vendor rationalization, and lower-seat utilization even if end users still want the tools. That creates a lagged deceleration risk for lab revenue over the next 1-2 quarters: usage can remain high in pilots while finance teams quietly cap expansion, which is often when revenue surprises peak and subsequent growth inflects down. For infrastructure, the setup is constructive but not linear. Cheaper inference hardware should ultimately expand total token consumption, but the intermediate phase is volatile because every cost-down innovation is met by immediate demand expansion. That means GPU leaders can still work, yet the path is more cyclical than the secular AI narrative implies; incremental pricing power may be limited if hyperscalers and model vendors pass savings through to preserve share. The contrarian miss is that this is less a demand problem than a billing-model problem. If customers move from unrestricted agentic usage to quota-managed workflows, headline adoption metrics can stay strong while monetization mix deteriorates. That is bearish for premium coding-assistant economics and bullish for lower-cost stack winners that can bundle AI inside broader enterprise contracts.