Google has secretly ordered an aggressive infrastructure expansion to double AI serving capacity every six months, targeting roughly a 1,000x increase within four to five years to support inference-heavy models such as Gemini 3 Pro; this directive, presented by infrastructure VP Amin Vahdat, contrasts with CEO Sundar Pichai’s public warnings about “irrationality” in the AI market. The plan relies on vertical integration and custom silicon—most notably the new Ironwood TPU (claimed 10x peak performance vs. v5p and ~2x performance-per-watt vs. the prior generation) and Arm-based Axion CPUs—plus software–hardware co-design to try to deliver exponentially more capability without a proportional rise in cost; Alphabet has lifted 2025 capex to $93 billion with a further increase planned for 2026, amid a broader industry spend of roughly $380 billion. For investors, the move signals Google is betting on a capital-intensive war of attrition to secure inference cost leadership—potentially creating a durable advantage over Nvidia-reliant rivals but exposing the company to margin pressure, energy-usage risks (the Jevons paradox) and the danger that hardware depreciation will outpace short-term monetization if AI revenue growth fails to materialize.
Google has issued an internal mandate to double AI serving capacity every six months, targeting roughly a 1,000-fold increase within four to five years to support inference-heavy models such as Gemini 3 Pro; this directive from Infrastructure VP Amin Vahdat contrasts with CEO Sundar Pichai’s public warnings about “irrationality.” The company has raised 2025 capital expenditures to $93 billion and signaled a significant further increase in 2026, framing the program as an existential bet to avoid underinvesting. Execution relies on vertical integration and custom silicon: the new Ironwood TPU is presented as offering ~10x peak performance versus the v5p and ~2x performance-per-watt versus the prior Trillium generation, while Arm-based Axion CPUs will offload general workloads and software–hardware co-design with DeepMind is intended to squeeze further gains. The strategic shift from training to continuous inference amplifies steady-state compute demand and makes energy, thermal limits, and deployment cadence the binding constraints. The move increases the stakes in a capital-intensive industry where the Big Four are spending roughly $380 billion on infrastructure; Google argues its balance sheet lets it outlast competitors, but risks include the Jevons paradox (efficiency driving higher demand), potential margin compression in a price war, and hardware depreciation outpacing monetization if inference features fail to generate revenue at scale.
AI-powered research, real-time alerts, and portfolio analytics for institutional investors.
Request a DemoOverall Sentiment
mixed
Sentiment Score
0.00
Ticker Sentiment