Back to News
Market Impact: 0.8

Inside NVIDIA Blackwell Ultra: The Chip Powering the AI Factory Era | NVIDIA Technical Blog

NVDATSM
Artificial IntelligenceTechnology & InnovationProduct LaunchesCompany FundamentalsCybersecurity & Data PrivacyInfrastructure & Defense

NVIDIA's new Blackwell Ultra GPU, featuring a dual-reticle design with 208 billion transistors, represents a substantial leap in AI computing, delivering 15 PetaFLOPS of NVFP4 performance—a 7.5x increase over Hopper GPUs. This architecture integrates 288 GB of HBM3e memory (3.6x H100), enhanced NVLink, and accelerated attention layers, enabling the efficient training and deployment of multi-trillion-parameter AI models. These advancements are designed to power "AI factories" by dramatically improving compute efficiency, reducing operational costs, and expanding the scope of feasible large-scale AI applications, thereby redefining the economics of AI infrastructure.

Analysis

NVIDIA's announcement of the Blackwell Ultra GPU details a significant architectural and performance leap beyond its predecessor, the Hopper series. The new chip, built on TSMC's 4NP process, utilizes a dual-reticle design to integrate 208 billion transistors, delivering 15 PetaFLOPS of NVFP4 compute—a 7.5x increase over the H100/H200 GPUs. This performance is complemented by a 3.6x increase in on-package memory to 288 GB of HBM3e and a doubling of NVLink interconnect bandwidth to 1.8 TB/s. These enhancements are not merely incremental; they are designed to fundamentally alter the economics of AI infrastructure by enabling the efficient deployment of multi-trillion-parameter models directly on-chip, thereby reducing latency and improving tokens-per-watt efficiency. The introduction of the NVFP4 precision format and a 2x acceleration in attention-layer processing directly addresses key bottlenecks in large language model inference. Critically, the platform maintains full backward compatibility with the CUDA ecosystem, reinforcing NVIDIA's competitive moat by ensuring a seamless transition for its vast developer base and solidifying its position as the foundational provider for production-scale "AI factories".

AllMind AI Terminal

AI-powered research, real-time alerts, and portfolio analytics for institutional investors.

Request a Demo