Supermicro Solutions Featuring NVIDIA Vera Rubin

DCBBS Blueprints for NVIDIA Vera Rubin NVL72

Built to Scale from 5MW to 1GW

Complete 1,152-GPU NVIDIA Vera Rubin NVL72 Scalable Unit per 5MW power envelope, multiplied to scale from a single unit to gigawatt-class AI factories
331 TB of HBM4 GPU memory* and 864 TB of LPDDR5X CPU memory per Scalable Unit, coherently accessible across the NVLink fabric
Industry-leading DLC-2 direct liquid cooling, from direct-to-chip cold plates through 1MW cooling towers, sized for 227 kW per rack
Dedicated Supermicro team across the full lifecycle: site survey, project design, integration, deployment, and ongoing support
Supporting NVIDIA’s latest reference architecture integrating NVIDIA Context Memory Storage Platform, NVIDIA Spectrum™-X Ethernet, and NVIDIA Quantum-X800 InfiniBand Platform
Management Software Suite: End-to-end SuperCloud software delivers unified infrastructure control, deployment automation, developer tools, and multi-tenant GPU cloud management

* Physical GPU memory

End-to-end solution spanning compute, networking, storage, power and cooling for streamlined deployment

Supermicro DCBBS DLC-2 liquid cooling stack with in-rack or in-row CDU, RDHx, and L2A sidecar options

NVIDIA Context Memory Storage Platform and High Performance Storage integrated for long-context and agentic AI workloads

Dedicated Supermicro team manages deployment from site survey through commissioning and ongoing support

Learn More About DCBBS

DCBBS & NVIDIA Vera Rubin NVL72 scalable unit – 1152 GPUs total

DCBBS Blueprints for NVIDIA Vera Rubin NVL4

Native FP64 Performance for Converged HPC and AI Infrastructure

Complete 1,152-GPU NVIDIA Vera Rubin NVL4 Scalable, multiplied to scale from a single scalable unit to gigawatt HPC and AI deployments
331 TB of HBM4 GPU memory* and 864 TB of LPDDR5X CPU memory per Scalable Unit, with over 4 TB of coherent memory per NVL4 node
Industry-leading DLC-2 direct liquid cooling sized for 360 kW per rack, cooled by 1.8 MW in-row CDUs in 2+1 redundancy
Full-stack single-vendor solution spanning compute, storage, networking, power, cooling, and site infrastructure
Dedicated Supermicro team across the full lifecycle: site survey, project design, integration, deployment, and ongoing support
Optimized HPC and AI Compute fabric aligned with NVIDIA's reference architecture, featuring NVIDIA Quantum-X800 InfiniBand
Management Software Suite: End-to-end SuperCloud software delivers unified infrastructure control, deployment automation, developer tools, and multi-tenant GPU cloud management

* Physical GPU memory

End-to-end solution spanning compute, networking, storage, power and cooling for streamlined deployment

Supermicro DCBBS DLC-2 liquid cooling stack with in-row CDUs in 2+1 redundancy, sized for 360 kW per rack

FP64 double-precision performance for simulation combined with Rubin-generation Al throughput for HPC-AI convergence

Dedicated Supermicro team manages deployment from site survey through commissioning and ongoing support

Learn More About DCBBS

DCBBS & NVIDIA Vera Rubin NVL4 scalable unit – 1152 GPUs total

NVIDIA Vera Rubin NVL72 SuperCluster

Supermicro is engineering its NVIDIA Vera Rubin NVL72 with new DCBBS liquid-cooling components to fully support the power and thermal envelope at rack and cluster scale. This includes the manufacturing of optimized NVIDIA MGX racks, in-rack or in-row CDU, RDHx and L2A sidecar to streamline production and deployment of the rack-scale AI supercomputer at scale. The Vera Rubin NVL72 operates as a single rack-scale accelerator, unifying six co-designed chips — Rubin GPU, Vera CPU, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X — to deliver 3.6 Exaflops of inference, 75TB of fast memory, and 1.6 PB/s of HBM4 bandwidth, targeting up to 10x the throughput per watt and one-tenth the token cost compared to NVIDIA Blackwell.

NVIDIA Vera Rubin NVL72 Unifies 72 Rubin GPUs and 36 Vera CPUs in a rack through the latest NVIDIA NVLink-C2C and NVLink 6

Power-efficient Scale-out and Scale-across Connectivity using NVIDIA Quantum-X800 InfiniBand or Spectrum™-X Ethernet Ethernet

Extreme co-design of Rubin GPU, Vera CPU, NVLink 6, ConnectX-9, BlueField®-4 and Spectrum-X

Supermicro DCBBS DLC-2 Liquid-cooling Optimized for Supermicro NVIDIA MGX racks, in-rack or in-row CDU, RDHx and L2A sidecar

2U NVIDIA HGX Rubin NVL8 System

The 2U HGX Rubin NVL8 system provides the densest and most flexible HGX platform — and the first HGX platform to offer greater flexibility in CPU selections including NVIDIA Vera CPUs alongside next-generation AMD and Intel x86 processors. Built on the NVIDIA MGX rack architecture with Supermicro’s blind mate busbar and manifold for tool-free rack integration, it gives customers the freedom to pair eight Rubin GPUs with the CPU platform that best fits their workload and software stack.

Up to 9 systems and 72 GPUs in a rack

Supports the new NVIDIA Vera CPUs, and next-Gen x86 CPUs

Extreme co-design of Rubin GPU, NVLink 6, ConnectX-9, BlueField®-4 and Spectrum-X

DLC-2 98+% Heat Capture with DCBBS L2A Sidecar Option

2U Liquid-cooled System

For NVIDIA HGX Rubin NVL8

Supermicro 2-OU Liquid-cooled Front I/O System for NVIDIA HGX B300 8-GPU — Front I/O Liquid-cooled system designed for the NVIDIA MGX rack architecture with Supermicro’s blindmate busbar

NVIDIA Vera CPU Rack

Supermicro’s air-cooled and 100% direct liquid-cooled rack solutions powered by the NVIDIA Vera CPU — a fully integrated system designed to handle the emerging requirements of agentic AI at scale as well as the most demanding HPC simulation workloads. Air-cooled configurations support up to 64 Vera CPUs in a single 48U rack, while the liquid-cooled configuration contains 256 liquid cooled NVIDIA Vera CPUs with 22,528 cores. With up to 300TB/s of aggregate memory bandwidth, Vera CPU racks are purpose-built to provide predictable throughput required for reinforcement learning and agentic AI workloads, while also being able to expose additional threads when concurrency is required. Engineered as a complete infrastructure stack to maximize density, efficiency, and deployability for both AI and HPC data centers, configurations are supported by Supermicro’s DLC-2 second-generation advanced direct liquid cooling technology for near-total heat capture and rack-level integration, allowing higher core densities per rack than traditional CPUs.

Up to 256 NVIDIA Vera CPUs, 22,528 cores, and 45,056 threads per rack

Up to 300TB/s aggregate memory bandwidth and up to 400TB memory capacity

100% direct liquid-cooling, fully factory-integrated 48U MGX rack

NVIDIA BlueField®-4 DPUs and Spectrum-X™ Ethernet for HPC-scale networking

Resources

Ready to Build the Future of AI?

Contact Supermicro today to design your next-generation AI data center.

Contact Us

AI Infrastructure

Data Center Building Block Solutions® (DCBBS)

AI Factory

Edge AI

AI Storage

Industry AI Solutions

NVIDIA Solutions

AMD Solutions

Intel Solutions

Arm AGI Solutions

Rackmount Servers

Dual Processor

Single Processor

Multi-Processor

GPU Servers

8U/10U GPU Lines

4U/5U GPU Lines

2U GPU Lines

1U GPU Lines

Twin Servers

FlexTwin™

BigTwin®

GrandTwin®

TwinPro®

FatTwin®

Blade Servers

SuperBlade®

MicroBlade®

MicroCloud

Storage Servers

All Storage Systems

All-Flash NVMe

Top-Loading Storage

JBOF

Petascale Grace Storage

Enterprise-Optimized Storage

JBOD Storage Enclosures

Motherboards

Server Boards

Workstation Boards

Embedded / IoT Boards

Desktop / Gaming Boards

Motherboard Matrix

Global SKUs

Chassis

1U Chassis

2U Chassis

3U Chassis

4U / Tower Chassis

Mid / Mini-Tower

Embedded / IoT Chassis

Mobile Racks / Drive Kits

JBOD Storage Enclosures

Global SKUs

SuperRack®

Rack Integration Service

Accessories

Cable Matrix

Riser Card Matrix

Storage AOC Matrix

Power Supply Matrix

Heatsink Matrix

System Fan Matrix

Mobile Racks / Drive Kits

Front Chassis Bezels

Storage, I/O, Security

Edge AI and IoT Systems

Compact Edge Systems

Compact Edge Servers

Rackmount Edge Servers

Embedded Components

Embedded Motherboards

Embedded Chassis

Switches

Adapters

SuperWorkstations

Liquid-Cooled AI Development Platform

Single-Processor

Dual-Processor

Desktop