GPU

NVIDIA H200 SXM

Edit@4 months ago

Intergrated Memory(VRAM)
Capacity

141 GB

(HBM3e )

Bandwidth

4800 GB/s

685 Token/s

Vector Compute
FP64
33.45 T
FP32
66.91 T
FP16
X
BF16
X
INT32
INT8
X

NVIDIA H200 SXM General-Purpose Float-Point performance (Vector Performance / Scalar Performance)

FP64: 33.45 TFLOPS

FP32: 66.91 TFLOPS

Matirx Compute
FP64
66.91 T
133.82 T
FP32
X
FP16
990 T
1980 T
FP8
1979 T
3958 T
TF32
494 T
988 T
BF16
990 T
1980 T
INT16
X
INT8
1979 T
3958 T
INT4
X

NVIDIA H200 SXM AI performance (Tensor Performance / Matrix Performance)

FP64: 66.91 TFLOPS, with sparsity: 133.82 TFLOPS

FP16: 990 TFLOPS, with sparsity: 1980 TFLOPS

FP8: 1979 TFLOPS, with sparsity: 3958 TFLOPS

TF32: 494 TFLOPS, with sparsity: 988 TFLOPS

BF16: 990 TFLOPS, with sparsity: 1980 TFLOPS

INT8: 1979 TOPS, with sparsity: 3958 TOPS

Hardware Specs
NVIDIA H200 SXM is a 5nm chip, has 80000 million transistors, launched by NVIDIA at 2024. It has 141 GB built-in(On-Board/On-Chip) memory with bandwidth up to 4800 GB/s. It has 16896 general-purpose ALUs(CUDA cores/Shader cores) and 528 matrix cores(Tensor cores) .
Process Node
5 nm
Launch Year
2024

Vector(CUDA) Cores
16896
Matrix(Tensor) Cores
528
Core Frequency
1590 ~ 1980 MHz
Cache
50MB