Skip to main content

DCBBS Blueprints for NVIDIA Vera Rubin NVL72

Built to Scale from 5MW to 1GW

  • Complete 1,152-GPU NVIDIA Vera Rubin NVL72 Scalable Unit per 5MW power envelope, multiplied to scale from a single unit to gigawatt-class AI factories
  • 331 TB of HBM4 GPU memory* and 864 TB of LPDDR5X CPU memory per Scalable Unit, coherently accessible across the NVLink fabric
  • Industry-leading DLC-2 direct liquid cooling, from direct-to-chip cold plates through 1MW cooling towers, sized for 227 kW per rack
  • Dedicated Supermicro team across the full lifecycle: site survey, project design, integration, deployment, and ongoing support
  • Supporting NVIDIA’s latest reference architecture integrating NVIDIA Context Memory Storage Platform, NVIDIA Spectrum™-X Ethernet, and NVIDIA Quantum-X800 InfiniBand Platform
  • Management Software Suite: End-to-end SuperCloud software delivers unified infrastructure control, deployment automation, developer tools, and multi-tenant GPU cloud management

* Physical GPU memory

End-to-end solution spanning compute, networking, storage, power and cooling for streamlined deployment
Supermicro DCBBS DLC-2 liquid cooling stack with in-rack or in-row CDU, RDHx, and L2A sidecar options
NVIDIA Context Memory Storage Platform and High Performance Storage integrated for long-context and agentic AI workloads
Dedicated Supermicro team manages deployment from site survey through commissioning and ongoing support
Learn More About DCBBS
DCBBS & NVIDIA Vera Rubin NVL72 scalable unit – 1152 GPUs total

DCBBS Blueprints for NVIDIA Vera Rubin NVL4

Native FP64 Performance for Converged HPC and AI Infrastructure

  • Complete 1,152-GPU NVIDIA Vera Rubin NVL4 Scalable, multiplied to scale from a single scalable unit to gigawatt HPC and AI deployments
  • 331 TB of HBM4 GPU memory* and 864 TB of LPDDR5X CPU memory per Scalable Unit, with over 4 TB of coherent memory per NVL4 node
  • Industry-leading DLC-2 direct liquid cooling sized for 360 kW per rack, cooled by 1.8 MW in-row CDUs in 2+1 redundancy
  • Full-stack single-vendor solution spanning compute, storage, networking, power, cooling, and site infrastructure
  • Dedicated Supermicro team across the full lifecycle: site survey, project design, integration, deployment, and ongoing support
  • Optimized HPC and AI Compute fabric aligned with NVIDIA's reference architecture, featuring NVIDIA Quantum-X800 InfiniBand
  • Management Software Suite: End-to-end SuperCloud software delivers unified infrastructure control, deployment automation, developer tools, and multi-tenant GPU cloud management

* Physical GPU memory

End-to-end solution spanning compute, networking, storage, power and cooling for streamlined deployment
Supermicro DCBBS DLC-2 liquid cooling stack with in-row CDUs in 2+1 redundancy, sized for 360 kW per rack
FP64 double-precision performance for simulation combined with Rubin-generation Al throughput for HPC-AI convergence
Dedicated Supermicro team manages deployment from site survey through commissioning and ongoing support
Learn More About DCBBS
DCBBS & NVIDIA Vera Rubin NVL4 scalable unit – 1152 GPUs total

NVIDIA Vera Rubin NVL72 SuperCluster

Supermicro is engineering its NVIDIA Vera Rubin NVL72 with new DCBBS liquid-cooling components to fully support the power and thermal envelope at rack and cluster scale. This includes the manufacturing of optimized NVIDIA MGX racks, in-rack or in-row CDU, RDHx and L2A sidecar to streamline production and deployment of the rack-scale AI supercomputer at scale. The Vera Rubin NVL72 operates as a single rack-scale accelerator, unifying six co-designed chips — Rubin GPU, Vera CPU, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X — to deliver 3.6 Exaflops of inference, 75TB of fast memory, and 1.6 PB/s of HBM4 bandwidth, targeting up to 10x the throughput per watt and one-tenth the token cost compared to NVIDIA Blackwell.

NVIDIA Vera Rubin NVL72 Unifies 72 Rubin GPUs and 36 Vera CPUs in a rack through the latest NVIDIA NVLink-C2C and NVLink 6
Power-efficient Scale-out and Scale-across Connectivity using NVIDIA Quantum-X800 InfiniBand or Spectrum™-X Ethernet Ethernet
Extreme co-design of Rubin GPU, Vera CPU, NVLink 6, ConnectX-9, BlueField®-4 and Spectrum-X
Supermicro DCBBS DLC-2 Liquid-cooling Optimized for Supermicro NVIDIA MGX racks, in-rack or in-row CDU, RDHx and L2A sidecar

2U NVIDIA HGX Rubin NVL8 System

The 2U HGX Rubin NVL8 system provides the densest and most flexible HGX platform — and the first HGX platform to offer greater flexibility in CPU selections including NVIDIA Vera CPUs alongside next-generation AMD and Intel x86 processors. Built on the NVIDIA MGX rack architecture with Supermicro’s blind mate busbar and manifold for tool-free rack integration, it gives customers the freedom to pair eight Rubin GPUs with the CPU platform that best fits their workload and software stack.

Up to 9 systems and 72 GPUs in a rack
Supports the new NVIDIA Vera CPUs, and next-Gen x86 CPUs
Extreme co-design of Rubin GPU, NVLink 6, ConnectX-9, BlueField®-4 and Spectrum-X
DLC-2 98+% Heat Capture with DCBBS L2A Sidecar Option

NVIDIA Vera CPU Rack

Supermicro’s air-cooled and 100% direct liquid-cooled rack solutions powered by the NVIDIA Vera CPU — a fully integrated system designed to handle the emerging requirements of agentic AI at scale as well as the most demanding HPC simulation workloads. Air-cooled configurations support up to 64 Vera CPUs in a single 48U rack, while the liquid-cooled configuration contains 256 liquid cooled NVIDIA Vera CPUs with 22,528 cores. With up to 300TB/s of aggregate memory bandwidth, Vera CPU racks are purpose-built to provide predictable throughput required for reinforcement learning and agentic AI workloads, while also being able to expose additional threads when concurrency is required. Engineered as a complete infrastructure stack to maximize density, efficiency, and deployability for both AI and HPC data centers, configurations are supported by Supermicro’s DLC-2 second-generation advanced direct liquid cooling technology for near-total heat capture and rack-level integration, allowing higher core densities per rack than traditional CPUs.

Up to 256 NVIDIA Vera CPUs, 22,528 cores, and 45,056 threads per rack
Up to 300TB/s aggregate memory bandwidth and up to 400TB memory capacity
100% direct liquid-cooling, fully factory-integrated 48U MGX rack
NVIDIA BlueField®-4 DPUs and Spectrum-X™ Ethernet for HPC-scale networking
Resources
Ready to Build the Future of AI?

Contact Supermicro today to design your next-generation AI data center.

Contact Us

Certain products may not be available in your region