DCBBS Blueprints for NVIDIA Vera Rubin NVL72
Built to Scale from 5MW to 1GW
- Complete 1,152-GPU NVIDIA Vera Rubin NVL72 Scalable Unit per 5MW power envelope, multiplied to scale from a single unit to gigawatt-class AI factories
- 331 TB of HBM4 GPU memory* and 864 TB of LPDDR5X CPU memory per Scalable Unit, coherently accessible across the NVLink fabric
- Industry-leading DLC-2 direct liquid cooling, from direct-to-chip cold plates through 1MW cooling towers, sized for 227 kW per rack
- Dedicated Supermicro team across the full lifecycle: site survey, project design, integration, deployment, and ongoing support
- Supporting NVIDIA’s latest reference architecture integrating NVIDIA Context Memory Storage Platform, NVIDIA Spectrum™-X Ethernet, and NVIDIA Quantum-X800 InfiniBand Platform
- Management Software Suite: End-to-end SuperCloud software delivers unified infrastructure control, deployment automation, developer tools, and multi-tenant GPU cloud management
* Physical GPU memory

DCBBS Blueprints for NVIDIA Vera Rubin NVL4
Native FP64 Performance for Converged HPC and AI Infrastructure
- Complete 1,152-GPU NVIDIA Vera Rubin NVL4 Scalable, multiplied to scale from a single scalable unit to gigawatt HPC and AI deployments
- 331 TB of HBM4 GPU memory* and 864 TB of LPDDR5X CPU memory per Scalable Unit, with over 4 TB of coherent memory per NVL4 node
- Industry-leading DLC-2 direct liquid cooling sized for 360 kW per rack, cooled by 1.8 MW in-row CDUs in 2+1 redundancy
- Full-stack single-vendor solution spanning compute, storage, networking, power, cooling, and site infrastructure
- Dedicated Supermicro team across the full lifecycle: site survey, project design, integration, deployment, and ongoing support
- Optimized HPC and AI Compute fabric aligned with NVIDIA's reference architecture, featuring NVIDIA Quantum-X800 InfiniBand
- Management Software Suite: End-to-end SuperCloud software delivers unified infrastructure control, deployment automation, developer tools, and multi-tenant GPU cloud management
* Physical GPU memory

NVIDIA Vera Rubin NVL72 SuperCluster
Supermicro is engineering its NVIDIA Vera Rubin NVL72 with new DCBBS liquid-cooling components to fully support the power and thermal envelope at rack and cluster scale. This includes the manufacturing of optimized NVIDIA MGX racks, in-rack or in-row CDU, RDHx and L2A sidecar to streamline production and deployment of the rack-scale AI supercomputer at scale. The Vera Rubin NVL72 operates as a single rack-scale accelerator, unifying six co-designed chips — Rubin GPU, Vera CPU, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X — to deliver 3.6 Exaflops of inference, 75TB of fast memory, and 1.6 PB/s of HBM4 bandwidth, targeting up to 10x the throughput per watt and one-tenth the token cost compared to NVIDIA Blackwell.

2U NVIDIA HGX Rubin NVL8 System
The 2U HGX Rubin NVL8 system provides the densest and most flexible HGX platform — and the first HGX platform to offer greater flexibility in CPU selections including NVIDIA Vera CPUs alongside next-generation AMD and Intel x86 processors. Built on the NVIDIA MGX rack architecture with Supermicro’s blind mate busbar and manifold for tool-free rack integration, it gives customers the freedom to pair eight Rubin GPUs with the CPU platform that best fits their workload and software stack.
2U Liquid-cooled System
For NVIDIA HGX Rubin NVL8

Front I/O Liquid-cooled system designed for the NVIDIA MGX rack architecture with Supermicro’s blindmate busbar

NVIDIA Vera CPU Rack
Supermicro’s air-cooled and 100% direct liquid-cooled rack solutions powered by the NVIDIA Vera CPU — a fully integrated system designed to handle the emerging requirements of agentic AI at scale as well as the most demanding HPC simulation workloads. Air-cooled configurations support up to 64 Vera CPUs in a single 48U rack, while the liquid-cooled configuration contains 256 liquid cooled NVIDIA Vera CPUs with 22,528 cores. With up to 300TB/s of aggregate memory bandwidth, Vera CPU racks are purpose-built to provide predictable throughput required for reinforcement learning and agentic AI workloads, while also being able to expose additional threads when concurrency is required. Engineered as a complete infrastructure stack to maximize density, efficiency, and deployability for both AI and HPC data centers, configurations are supported by Supermicro’s DLC-2 second-generation advanced direct liquid cooling technology for near-total heat capture and rack-level integration, allowing higher core densities per rack than traditional CPUs.

Contact Supermicro today to design your next-generation AI data center.