0% found this document useful (0 votes)

111 views8 pages

Blackwell Datasheet 3384703

The NVIDIA Blackwell architecture revolutionizes accelerated computing with significant advancements in generative AI, featuring the second-generation Transformer Engine and enhanced NVLink interconnect for superior performance. The GB200 NVL72 system connects multiple GPUs and CPUs, achieving 30X faster real-time inference for large language models and 4X faster training capabilities. Additionally, the HGX B200 platform offers improved energy efficiency and cost-effectiveness, making it ideal for demanding AI and data analytics workloads.

Uploaded by

Mohsen Eftekhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views8 pages

Blackwell Datasheet 3384703

Uploaded by

Mohsen Eftekhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Datasheet

NVIDIA Blackwell
The engine of the new industrial revolution.

Breaking Barriers in Accelerated Computing

Key Offerings
The NVIDIA Blackwell architecture introduces groundbreaking advancements
for generative AI and accelerated computing. The incorporation of the second- > NVIDIA GB200 NVL72
generation Transformer Engine, alongside the faster and wider NVIDIA NVLink™
> NVIDIA HGX B200
interconnect, propels the data center into a new era, with orders of magnitude more
performance compared to the previous architecture generation. Further advances
in NVIDIA Confidential Computing technology raise the level of security for real-
time LLM inference at scale without performance compromise. And Blackwell’s new
decompression engine combined with Spark RAPIDS™ libraries deliver unparalleled
database performance to fuel data analytics applications. Blackwell’s multiple
advancements build upon generations of accelerated computing technologies to define
the next chapter of generative AI with unparalleled performance, efficiency, and scale.

NVIDIA GB200 NVL72

Powering the New Era of Computing

NVIDIA Blackwell | Datasheet | 1

Unlocking Real-Time Trillion-Parameter Models
NVIDIA GB200 NVL72 connects 36 Grace CPUs and 72 Blackwell GPUs in an NVIDIA
NVLink-connected, liquid-cooled, rack-scale design. Acting as a single, massive GPU, it
delivers 30X faster real-time trillion-parameter large language model (LLM) inference.

The GB200 Grace Blackwell Superchip is a key component of the NVIDIA GB200
NVL72, connecting two high-performance NVIDIA Blackwell GPUs and an NVIDIA
Grace CPU with the NVLink-C2C interconnect.

Real-Time LLM Inference

GB200 NVL72 introduces cutting-edge capabilities and a second-generation
Transformer Engine, which enables FP4 AI. This advancement is made possible
with a new generation of Tensor Cores, which introduce new microscaling formats,
giving high accuracy and greater throughput. Additionally, the GB200 NVL72 uses
NVLink and liquid cooling to create a single, massive 72-GPU rack that can overcome
communication bottlenecks.

GPT-MoE-1.8T Real-Time Throughput

120 116
Output Tokens per Second per GPU

100

30X
60

3.5
0
HGX H100 GB200 NVL72
NVIDIA GB200 NVL72
Projected performance, subject to change. LLM inference and energy efficiency: token-to-token latency (TTL)
= 50 milliseconds (ms) real time, first token latency (FTL) = 5s, 32,768 input/1,024 output, NVIDIA HGX™ H100
Key Features
scaled over InfiniBand (IB) versus GB200 NVL72.
> 36 NVIDIA Grace™ CPUs

> 72 NVIDIA Blackwell GPUs

> Up to 17 terabytes (TB) of

LPDDR5X memory with
error-correction code (ECC)

> Supports up to 13.5TB of HBM3E

> Up to 30.5TB of fast-access

memory

> NVLink domain: 130 terabytes per

second (TB/s) of low latency
GPU communication

NVIDIA Blackwell | Datasheet | 2

Massive-Scale Training
GB200 NVL72 includes a faster second-generation Transformer Engine featuring 8-bit
floating point (FP8) precision, which enables a remarkable 4X faster training for large
language models at scale. This breakthrough is complemented by the fifth-generation
NVLink, which provides 1.8 terabytes per second (TB/s) of GPU-to-GPU interconnect,
InfiniBand networking, and NVIDIA Magnum IO™ software.

GPT-MoE-1.8T Model Training Speedup

4X
4X
Speedup Over H100

1X
1X

0X
HGX H100 GB200 NVL72

Training GPT-MoE-1.8T - 4096x HGX H100 scaled over IB vs. 456x GB200 NVL72 scaled over IB.
Cluster size: 32,768.

Data Processing
Databases play critical roles in handling, processing, and analyzing large volumes of
data for enterprises. GB200 NVL72 takes advantage of the high-bandwidth-memory
performance, NVLink-C2C, and dedicated decompression engines in the NVIDIA
Blackwell architecture to speed up key database queries by 18X compared to CPU,
delivering a 5X better TCO.

Database Join Query

100
90
90

80
Queries per Second

50 18X

20 15
10 5
0
x86 HGX H100 GB200 NVL72

Projected performance, subject to change. Database join query throughput comparing GB200 NVL72,
72x H100, and 72 x86 CPUs

NVIDIA Blackwell | Datasheet | 3

Energy-Efficient Infrastructure
Liquid-cooled GB200 NVL72 racks reduce a data center’s carbon footprint and energy
consumption. Liquid cooling increases compute density, reduces the amount of
floor space used, and facilitates high-bandwidth, low-latency GPU communication
with large NVLink domain architectures. Compared to the NVIDIA H100 air-cooled
infrastructure, GB200 NVL72 delivers 25X more performance at the same power
while reducing water consumption.

Energy Efficiency

25X
25
Relative Energy Efficiency

1X
0
H100 GPU GB200 NVL72

Projected performance, subject to change. Energy savings for 65 racks eight-way HGX H100 air-cooled
versus one rack GB200 NLV72 liquid-cooled with equivalent performance on GPT MoE 1.8T real-time
inference throughput.

NVIDIA Blackwell | Datasheet | 4

NVIDIA HGX B200
Propelling the Data Center Into a New Era of
Accelerated Computing

The NVIDIA HGX™ B200 propels the data center into a new era of accelerating
computing and generative AI, integrating NVIDIA Blackwell GPUs with a high-speed
interconnect to accelerate AI performance at scale. As a premier accelerated scale-
up x86 platform with up to 15X faster real-time inference performance, 12X lower
cost, and 12X less energy use, HGX B200 is designed for the most demanding AI, data
analytics, and high-performance computing (HPC) workloads.

Real-Time Inference for the Next Generation of HGX B200

Large Language Models Key Features
HGX B200 achieves up to 15X higher inference performance over the previous NVIDIA
> 8 NVIDIA Blackwell GPUs
Hopper™ generation for massive models such as GPT MoE 1.8T. The second-generation
Transformer Engine uses custom Blackwell Tensor Core technology combined with > Up to 1.4 terabytes (TB) of
TensorRT™-LLM and NVIDIA NeMo™ framework innovations to accelerate inference HBM3E memory
for LLMs and mixture-of-experts (MoE) models.
> 1800GB/s NVLink between GPUs
GPT-MoE-1.8T Real-Time Throughput via NVSwitch™ chip

> 15X faster real-time LLM

70
inference
Output Tokens per Second per GPU

60 58 > 3X faster training performance

15X
30

10
3.5
0
HGX H100 HGX B200

NVIDIA Blackwell | Datasheet | 5

Next-Level Training Performance
The second-generation Transformer Engine, featuring FP8 and new precisions,
enables a remarkable 3X faster training for large language models like GPT MoE
1.8T. This breakthrough is complemented by fifth-generation NVLink with 1.8TB/s
of GPU-to-GPU interconnect, NVSwitch chip, InfiniBand networking, and NVIDIA
Magnum IO software. Together, these ensure efficient scalability for enterprises
and extensive GPU computing clusters.

GPT-MoE-1.8T Model Training Speedup

3X
3X
Speedup Over H100

1X
1X

0X
HGX H100 HGX B200

Projected performance, subject to change. 32,768 GPU scale, 4,096x eight-way HGX H100 air-cooled cluster:
400G IB network, 4,096x 8-way HGX B200 air-cooled cluster: 400G IB network.

Sustainable Computing
By adopting sustainable computing practices, data centers can lower their carbon
footprints and energy consumption while improving their bottom line. The goal
of sustainable computing can be realized with efficiency gains using accelerated
computing with HGX. For LLM inference performance, HGX B200 improves energy
efficiency by 12X and lowers costs by 12X compared to the Hopper generation.

12X Lower Energy Use and TCO

Lower is Better

HGX H100 HGX B200

12X 12X

5 5

Total Cost of Ownership Energy Use

Projected performance, subject to change. Token-to-token latency (TTL) = 50ms real time, first token latency (FTL)
= 5s, input sequence length = 32,768, output sequence length = 1,028, 8x eight-way HGX H100 GPUs air-cooled
versus 1x eight-way HGX B200 air-cooled, per GPU performance comparison.TCO and energy savings for 100 racks
eight-way HGX H100 air-cooled versus 8 racks eight-way HGX B200 air-cooled with equivalent performance.

NVIDIA Blackwell | Datasheet | 6

Technical Specifications1

GB200 NVL72 HGX B200

Blackwell GPUs | Grace CPUs 72 | 36 8|0

CPU Cores 2,592 Arm Neoverse V2 Cores -

Total FP4 Tensor Core 1,440 petaFLOPS 144 petaFLOPS

Total FP8/FP6 Tensor Core 720 petaFLOPS/petaOPS 72 petaFLOPS/petaOPS

Total Fast Memory Up to 30TB Up to 1.4TB

Total Memory Bandwidth Up to 576TB/s Up to 62TB/s

Total NVLink Bandwidth 130TB/s 14.4TB/s

Individual Blackwell GPU Specifications

FP4 Tensor Core 20 petaFLOPS 18 petaFLOPS

FP8/FP6 Tensor Core 10 petaFLOPS 9 petaFLOPS

INT8 Tensor Core 10 petaOPS 9 petaOPS

FP16/BF16 Tensor Core 5 petaFLOPS 4.5 petaFLOPS

TF32 Tensor Core 2.5 petaFLOPS 2.2 petaFLOPS

FP32 80 teraFLOPS 75 teraFLOPS

FP64/FP64 Tensor Core 40 teraFLOPS 37 teraFLOPS

GPU Memory | Bandwidth 186GB HBM3e | 8 TB/s 180GB HBM3e | 7.7 TB/s

Multi-Instance GPU (MIG) 7

Decompression Engine Yes

Decoders 7 NVDEC2
7 NVJPG

Max Thermal Design Power (TDP) Configurable up to 1,200W Configurable up to 1,000W

Interconnect 5th Generation NVLink: 1.8TB/s

PCIe Gen5: 128GB/s

Server Options NVIDIA GB200 NVL72 partner and NVIDIA HGX B200 partner and NVIDIA-
NVIDIA-Certified Systems™ with 72 GPUs Certified Systems with 8 GPUs
1. Preliminary specifications, subject to change. All Tensor Core numbers except FP64 with sparsity.
2. Supported formats provide these speed-ups over H100 Tensor Core GPUs: 2X H.264, 1.25X HEVC, 1.25X VP9.
AV1 support is new to Blackwell GPUs.

NVIDIA Blackwell | Datasheet | 7

AI Superchip 2nd Generation Transformer Engine NVLink and NVLink Switch
Blackwell-architecture GPUs pack 208 The second-generation Transformer The fifth-generation of NVIDIA NVLink
billion transistors and are manufactured Engine uses custom Blackwell Tensor Core interconnect can scale up to 576 GPUs to
using a custom-built TSMC 4NP process. technology combined with NVIDIA TensorRT- unleash accelerated performance for multi-
All Blackwell products feature two reticle- LLM and NeMo Framework innovations to trillion-parameter AI models. The NVIDIA
limited dies connected by a 10 terabyte per accelerate inference and training for large NVLink Switch chip enables 130TB/s of GPU
second (TB/s) chip-to-chip interconnect language models (LLMs) and mixture-of- bandwidth in one 72-GPU NVLink domain
in a unified single GPU. experts (MoE) models. (NVL72) and delivers 4X bandwidth efficiency
with NVIDIA Scalable Hierarchical Aggregation
and Reduction Protocol (SHARP)™ FP8 support.

RAS Engine Secure AI Decompression Engine

Blackwell adds intelligent resiliency with Blackwell includes NVIDIA Confidential Blackwell’s decompression engine and ability
a dedicated reliability, availability, and Computing, which protects sensitive data to access massive amounts of memory in
serviceability (RAS) engine to identify and AI models from unauthorized access the NVIDIA Grace CPU over a high-speed
potential faults that may occur early on to with strong hardware-based security. link—900 gigabytes per second (GB/s) of
minimize downtime. NVIDIA’s AI-powered Blackwell is the first TEE-I/O capable GPU bidirectional bandwidth—accelerate the
predictive-management capabilities in the industry, while providing the most full pipeline of database queries for the
continuously monitor thousands of data performant confidential compute solution highest performance in data analytics and
points across hardware and software for with TEE-I/O capable hosts and inline data science with support for the latest
overall health to predict and intercept protection over NVIDIA NVLink. compression formats such as LZ4, Snappy,
sources of downtime and inefficiency. and Deflate.

Sustainable Computing
NVIDIA AI Enterprise is the end-to-end software platform that brings generative AI
into reach for every enterprise, providing the fastest and most efficient runtime for
generative AI foundation models. It includes NVIDIA NIM™ inference microservices,
AI frameworks, libraries, and tools that are certified to run on common data center
platforms and mainstream NVIDIA-Certified Systems integrated with NVIDIA
GPUs. Part of NVIDIA AI Enterprise, NVIDIA NIM is a set of easy-to-use inference
microservices for accelerating the deployment of foundation models on any cloud
or data center and helping to keep your data secure. Enterprises that run their
businesses on AI rely on the security, support, manageability, and stability provided
by NVIDIA AI Enterprise to ensure a smooth transition from pilot to production.

Together with the NVIDIA Blackwell GPUs, NVIDIA AI Enterprise not only simplifies
the building of an AI-ready platform but also accelerates time to value.

Learn about AI workload workflows with NVIDIA AI Enterprise via

NVIDIA Launchpad’s hands-on labs.

Ready to Get Started?

To learn more about the NVIDIA Blackwell, visit:
nvidia.com/blackwell
© 2024 NVIDIA Corporation and affiliates. All rights reserved. NVIDIA, the NVIDIA logo, Grace, HGX, Hopper,
Magnum IO, MGX, NeMo, NVIDIA-Certified Systems, NVLink, NVSwitch, Scalable Hierarchical Aggregation and
Reduction Protocol (SHARP), and TensorRT are trademarks and/or registered trademarks of NVIDIA Corporation
and affiliates in the U.S. and other countries. Other company and product names may be trademarks of the
respective owners with which they are associated. 3384703. Dec24

Nvidia Blackwell Architecture Technical Brief
No ratings yet
Nvidia Blackwell Architecture Technical Brief
24 pages
Systems Engineering Research SAC RAYNAR MANGGALA
No ratings yet
Systems Engineering Research SAC RAYNAR MANGGALA
3 pages
How The GB200 NVL72 System Is Revolutionizing Large-Scale AI Inferencing and Data Center Computing
No ratings yet
How The GB200 NVL72 System Is Revolutionizing Large-Scale AI Inferencing and Data Center Computing
3 pages
NVIDIA Blackwell Platform Advancing Generative AI and Accelerated Computing
No ratings yet
NVIDIA Blackwell Platform Advancing Generative AI and Accelerated Computing
33 pages
HPC Datasheet sc23 h200 Datasheet 3002446
No ratings yet
HPC Datasheet sc23 h200 Datasheet 3002446
4 pages
NVIDIA DGX SuperPOD With DGX GB200 Systems
No ratings yet
NVIDIA DGX SuperPOD With DGX GB200 Systems
3 pages
A100 80gb HGX A100 Datasheet Us Nvidia 1485640 r6 Web
No ratings yet
A100 80gb HGX A100 Datasheet Us Nvidia 1485640 r6 Web
3 pages
DGX Scale Ai Infrastructure DGX gh200 Datasheet Nvidia Us Web
No ratings yet
DGX Scale Ai Infrastructure DGX gh200 Datasheet Nvidia Us Web
2 pages
NVIDIA Data Center Roadmap With GX200NVL GX200 X100 and X40 AI Chips in 2025 - ServeTheHome
No ratings yet
NVIDIA Data Center Roadmap With GX200NVL GX200 X100 and X40 AI Chips in 2025 - ServeTheHome
1 page
NVIDIAPu
No ratings yet
NVIDIAPu
2 pages
DGX GH200 Datasheet
No ratings yet
DGX GH200 Datasheet
2 pages
Nvidia Update For Lenovo
No ratings yet
Nvidia Update For Lenovo
30 pages
Line Card
No ratings yet
Line Card
4 pages
NVIDIA L40S 48GB PCIe Accelerator Data sheet-PSN1014786967USEN
No ratings yet
NVIDIA L40S 48GB PCIe Accelerator Data sheet-PSN1014786967USEN
4 pages
2507 10789v1a
No ratings yet
2507 10789v1a
11 pages
Brochure AI SuperCluster NVIDIA GB200 NVL72
No ratings yet
Brochure AI SuperCluster NVIDIA GB200 NVL72
8 pages
NVIDIA 2025 Annual Report
No ratings yet
NVIDIA 2025 Annual Report
181 pages
Nvidia DGX Spark Workstation Datasheet
No ratings yet
Nvidia DGX Spark Workstation Datasheet
3 pages
Nvidia H100 GPU Datasheet
No ratings yet
Nvidia H100 GPU Datasheet
3 pages
HPC Datasheet sc23 h200 Datasheet 3002446
No ratings yet
HPC Datasheet sc23 h200 Datasheet 3002446
3 pages
Workstation Datasheet DGX Spark Gtc25 Spring Nvidia Us 3716899 Web
No ratings yet
Workstation Datasheet DGX Spark Gtc25 Spring Nvidia Us 3716899 Web
3 pages
Brochure NVIDIA Blackwell Solutions
No ratings yet
Brochure NVIDIA Blackwell Solutions
8 pages
Nvidia Story, PDF (1) (2) - 1
No ratings yet
Nvidia Story, PDF (1) (2) - 1
38 pages
Poweredge Server Gpu Matrix
No ratings yet
Poweredge Server Gpu Matrix
2 pages
Nvidia in Brief
No ratings yet
Nvidia in Brief
2 pages
Nvidia h100 Datasheet 2287922 Web
No ratings yet
Nvidia h100 Datasheet 2287922 Web
3 pages
NVIDIA Blackwell Report
No ratings yet
NVIDIA Blackwell Report
10 pages
NVIDIA Story
No ratings yet
NVIDIA Story
31 pages
Nvidia's AI Roadmap to 2027
No ratings yet
Nvidia's AI Roadmap to 2027
9 pages
NVIDIA
No ratings yet
NVIDIA
38 pages
GTC2025 Keynote
No ratings yet
GTC2025 Keynote
73 pages
Press Release
No ratings yet
Press Release
3 pages
NVIDIA's AI Stack
No ratings yet
NVIDIA's AI Stack
14 pages
Nvswitch Technical Overview
No ratings yet
Nvswitch Technical Overview
8 pages
Nvidia Connectx-7 400G Ethernet: Smart Acceleration For Cloud, Data-Center and Edge
No ratings yet
Nvidia Connectx-7 400G Ethernet: Smart Acceleration For Cloud, Data-Center and Edge
2 pages
NVIDIA GPU Comparison Final
No ratings yet
NVIDIA GPU Comparison Final
7 pages
Nvidia Story
No ratings yet
Nvidia Story
30 pages
NVIDIA GPU Evolution: Gaming to AI
100% (1)
NVIDIA GPU Evolution: Gaming to AI
91 pages
Corporate Nvidia in Brief
No ratings yet
Corporate Nvidia in Brief
2 pages
Nvidia DGX h100 Datasheet Nvidia Us Web
No ratings yet
Nvidia DGX h100 Datasheet Nvidia Us Web
2 pages
h100 Datasheet 2430615
No ratings yet
h100 Datasheet 2430615
4 pages
Nvidia Gears Up For Robotic Revolution, Unveils Powerful Ai Chip
No ratings yet
Nvidia Gears Up For Robotic Revolution, Unveils Powerful Ai Chip
4 pages
NF5288M5 DataSheet
No ratings yet
NF5288M5 DataSheet
4 pages
A100 80gb Datasheet Update Nvidia Us 1521051 r2 Web
No ratings yet
A100 80gb Datasheet Update Nvidia Us 1521051 r2 Web
3 pages
Trendy Rozwoju Układów CPU I GPU A.D.2024
No ratings yet
Trendy Rozwoju Układów CPU I GPU A.D.2024
37 pages
NVIDIA Investor Presentation Oct 2024
No ratings yet
NVIDIA Investor Presentation Oct 2024
30 pages
Pricelist NVIDIA Enterprise GPU July 2025
No ratings yet
Pricelist NVIDIA Enterprise GPU July 2025
1 page
Nvidia Hopper GPU and Grace CPU Highlights PDF
No ratings yet
Nvidia Hopper GPU and Grace CPU Highlights PDF
10 pages
Nvidia H100 Pcie Gpu: Product Brief
No ratings yet
Nvidia H100 Pcie Gpu: Product Brief
23 pages
GTC2025 Highlights v2
No ratings yet
GTC2025 Highlights v2
55 pages
Nvidia Annual Report 2024
No ratings yet
Nvidia Annual Report 2024
187 pages
NVIDIA 2024 Annual Report
No ratings yet
NVIDIA 2024 Annual Report
187 pages
Poweredge Server Gpu Matrix
No ratings yet
Poweredge Server Gpu Matrix
2 pages
Nvidia A40 Datasheet
No ratings yet
Nvidia A40 Datasheet
2 pages
NVIDIA-keynote v1
No ratings yet
NVIDIA-keynote v1
29 pages
Nvidia DGX h100 Datasheet
No ratings yet
Nvidia DGX h100 Datasheet
2 pages
Connectx Datasheet Connectx 8 Supernic 3231505
No ratings yet
Connectx Datasheet Connectx 8 Supernic 3231505
2 pages
Bizon V3000
No ratings yet
Bizon V3000
17 pages
470.223.02 474.64 Grid Vgpu Release Notes Ubuntu
No ratings yet
470.223.02 474.64 Grid Vgpu Release Notes Ubuntu
63 pages
Performance Evaluation of Advanced Features in CUDA Unified Memory
No ratings yet
Performance Evaluation of Advanced Features in CUDA Unified Memory
8 pages
Pytorch Distributed: Experiences On Accelerating Data Parallel Training
No ratings yet
Pytorch Distributed: Experiences On Accelerating Data Parallel Training
14 pages
(R) Dell EMC PowerScale and NVIDIA DGX A100 Systems For Deep Learning
No ratings yet
(R) Dell EMC PowerScale and NVIDIA DGX A100 Systems For Deep Learning
19 pages
NCA-AIIO Exam Dumps
No ratings yet
NCA-AIIO Exam Dumps
5 pages
Review of Chiplet-Based Design: System Architecture
No ratings yet
Review of Chiplet-Based Design: System Architecture
20 pages
Nvidia h100 Datasheet 2430615
No ratings yet
Nvidia h100 Datasheet 2430615
4 pages
Lecture-27-30-Graphics Processors Basics
No ratings yet
Lecture-27-30-Graphics Processors Basics
18 pages
Dell Server HCI Midrange DataProtection Update 27th Sept 2023
No ratings yet
Dell Server HCI Midrange DataProtection Update 27th Sept 2023
87 pages
Generative AI in The Enterprise - Model Training
No ratings yet
Generative AI in The Enterprise - Model Training
22 pages
Nvidia-Dgx-A100-80gb-Datasheet 08.09.2022
No ratings yet
Nvidia-Dgx-A100-80gb-Datasheet 08.09.2022
2 pages
Fabric Manager User Guide
No ratings yet
Fabric Manager User Guide
122 pages
Dell Emc Poweredge 15g Portfolio
No ratings yet
Dell Emc Poweredge 15g Portfolio
74 pages
NF5488M6 Ds
No ratings yet
NF5488M6 Ds
2 pages
Blackwell Datasheet 3384703
No ratings yet
Blackwell Datasheet 3384703
8 pages
Nvidia A100 80Gb Pcie Gpu: Product Brief
No ratings yet
Nvidia A100 80Gb Pcie Gpu: Product Brief
20 pages
Dell EMC PowerEdge Rack Servers 15G 16G Comparation Quick Reference Guide
No ratings yet
Dell EMC PowerEdge Rack Servers 15G 16G Comparation Quick Reference Guide
6 pages
Nvidia RTX A4500 Datasheet
No ratings yet
Nvidia RTX A4500 Datasheet
2 pages
h19825.1 Gen Ai Model Customization
No ratings yet
h19825.1 Gen Ai Model Customization
47 pages
NVIDIA Final Report
No ratings yet
NVIDIA Final Report
68 pages
Unified Memory Evaluation on NVIDIA GPUs
No ratings yet
Unified Memory Evaluation on NVIDIA GPUs
7 pages
Nsdi23 Hwang
No ratings yet
Nsdi23 Hwang
16 pages
OCI AI Portafolio
No ratings yet
OCI AI Portafolio
8 pages
Fabric Manager User Guide
No ratings yet
Fabric Manager User Guide
118 pages

Blackwell Datasheet 3384703

Uploaded by

Blackwell Datasheet 3384703

Uploaded by

Datasheet

Breaking Barriers in Accelerated Computing

NVIDIA GB200 NVL72

NVIDIA Blackwell | Datasheet | 1

Real-Time LLM Inference

GPT-MoE-1.8T Real-Time Throughput

> 72 NVIDIA Blackwell GPUs

> Up to 17 terabytes (TB) of

> Supports up to 13.5TB of HBM3E

> Up to 30.5TB of fast-access

> NVLink domain: 130 terabytes per

NVIDIA Blackwell | Datasheet | 2

GPT-MoE-1.8T Model Training Speedup

Database Join Query

NVIDIA Blackwell | Datasheet | 3

NVIDIA Blackwell | Datasheet | 4

Real-Time Inference for the Next Generation of HGX B200

> 15X faster real-time LLM

60 58 > 3X faster training performance

NVIDIA Blackwell | Datasheet | 5

GPT-MoE-1.8T Model Training Speedup

12X Lower Energy Use and TCO

HGX H100 HGX B200

Total Cost of Ownership Energy Use

NVIDIA Blackwell | Datasheet | 6

GB200 NVL72 HGX B200

Blackwell GPUs | Grace CPUs 72 | 36 8|0

CPU Cores 2,592 Arm Neoverse V2 Cores -

Total FP4 Tensor Core 1,440 petaFLOPS 144 petaFLOPS

Total FP8/FP6 Tensor Core 720 petaFLOPS/petaOPS 72 petaFLOPS/petaOPS

Total Fast Memory Up to 30TB Up to 1.4TB

Total Memory Bandwidth Up to 576TB/s Up to 62TB/s

Total NVLink Bandwidth 130TB/s 14.4TB/s

Individual Blackwell GPU Specifications

FP4 Tensor Core 20 petaFLOPS 18 petaFLOPS

FP8/FP6 Tensor Core 10 petaFLOPS 9 petaFLOPS

INT8 Tensor Core 10 petaOPS 9 petaOPS

FP16/BF16 Tensor Core 5 petaFLOPS 4.5 petaFLOPS

TF32 Tensor Core 2.5 petaFLOPS 2.2 petaFLOPS

FP32 80 teraFLOPS 75 teraFLOPS

FP64/FP64 Tensor Core 40 teraFLOPS 37 teraFLOPS

Multi-Instance GPU (MIG) 7

Decompression Engine Yes

Max Thermal Design Power (TDP) Configurable up to 1,200W Configurable up to 1,000W

Interconnect 5th Generation NVLink: 1.8TB/s

NVIDIA Blackwell | Datasheet | 7

RAS Engine Secure AI Decompression Engine

Learn about AI workload workflows with NVIDIA AI Enterprise via

Ready to Get Started?

You might also like