Insiders say the future of AI will be smaller and cheaper than you think

AI industry insiders forecast a shift from massive general-purpose LLMs toward smaller, task-specific AI agents that are cheaper to build and can run locally; firms cited include Superhuman, Mozilla and ARM. HSBC’s analysis highlights OpenAI’s large-scale capital challenge—the company claims $20 billion in revenue, has reportedly committed $1.4 trillion to data-center expansion, and even with potential $200 billion-plus revenues by 2030 would still need roughly $207 billion more funding—while investors such as IBM Ventures (a $500 million AI fund) are backing niche enterprise plays like Not Diamond’s model-router. The takeaway for allocators: structural opportunities in fit-for-purpose models and model-routing services, but sizeable financing risk remains for mega-scale LLM strategies.

Analysis

Market structure: Winners are edge/agent enablers (ARM, on-device inference silicon, niche SaaS with vertical data moats) and orchestration/software (model routers, app-store-like agent platforms); losers are scale‑dependent, capex‑heavy LLM plays (large portions of GOOGL/OpenAI economic thesis). If compact models routinely use 10–100x fewer parameters for many tasks, GPU-hour demand for those workloads could drop 30–60% over 24–36 months, shifting pricing power to software IP and domain data owners. Risk assessment: Tail risks include regulatory clamps on data use or antitrust actions that break platform bundling (6–24 months), and a funding shock that forces consolidation of expensive LLM players (12–36 months). Hidden dependency: many “small” agents still route to giant pretrained models/APIs today, creating concentration risk in a few API providers; a failure/price shock at those providers would cascade to agent startups quickly. Trade implications: Prefer long exposure to ARM (edge IP) and IBM (enterprise AI tooling/model-routing) with 6–12 month horizons and tactical hedges against hyperscaler weakness (GOOGL). Consider pair trades (long ARM, short GOOGL) and options: buy-call spreads on ARM or 9–12 month LEAPs; buy put spreads on GOOGL sized to 0.5–1% of portfolio to cap downside while collecting time decay. Contrarian angles: Consensus underappreciates that base LLMs may remain indispensable for pretraining and transfer learning—meaning hyperscalers retain a backstop revenue stream and Nvidia-like hardware winners may be underowned. Also, fragmentation raises integration/security costs that benefit established enterprise vendors (IBM) and could slow pure-play agent monetization, creating opportunities to sell early exuberance and buy durable incumbents.

AllMind

AllMind

Insiders say the future of AI will be smaller and cheaper than you think

Analysis

AllMind AI Terminal

Market Sentiment

Key Decisions for Investors