China’s DeepSeek prices new V4 AI model at 97% below OpenAI’s GPT-5.5

DeepSeek cut API prices sharply, taking input cache-hit costs to one-tenth of prior levels and offering an additional 75% discount on V4-Pro through May 5; the minimum input cost is now about US$0.14 per million tokens, while V4-Pro can be as low as US$0.0036 per million input tokens. The move makes DeepSeek-V4 far cheaper than OpenAI’s GPT-5.5 and could intensify price competition across the AI model market. Usage on OpenRouter jumped, with DeepSeek V4-Pro hitting 13.6 billion tokens on April 25, nearly 4x the prior day.

Analysis

This is less about a single model price cut and more about a structural reset in unit economics for agentic workloads. If DeepSeek can undercut incumbents by orders of magnitude while maintaining near-frontier performance on developer-relevant tasks, the competitive moat shifts away from raw model quality and toward distribution, workflow integration, and enterprise trust. That tends to compress margins first at the low end of the market, then forces incumbents to subsidize usage via bundling, which is negative for standalone API monetization across the stack. The second-order winner is the infrastructure layer that can absorb volume growth cheaply, especially domestic compute ecosystems and orchestration platforms. Lower inference prices expand the addressable market for always-on agents, but that also increases token consumption per workflow; the net effect is likely higher aggregate demand for GPUs, networking, and inference software even as per-token prices fall. The near-term loser is any vendor exposed to usage-based pricing with weak differentiation, because procurement teams will now anchor on DeepSeek-level economics in renewal conversations. The market is probably underestimating how fast this becomes an enterprise budgeting issue rather than a model race. In the next 1-2 quarters, the key catalyst is whether other Chinese labs follow with parallel cuts; if they do, it reinforces a deflationary spiral in API pricing and pressures global peers to defend share. The main contrarian risk is that ultra-low prices can be a loss-leader strategy that widens adoption but does not translate into durable pricing power, meaning the real monetization may accrue to the pick-and-shovel layer rather than the model providers themselves.

AllMind

AllMind

China’s DeepSeek prices new V4 AI model at 97% below OpenAI’s GPT-5.5

Analysis

AllMind AI Terminal

Market Sentiment

Key Decisions for Investors