A2 Datasheet
A2 Datasheet
Inference Speedup Inference Speedup Inference Speedup Inference Speedup Inference Speedup Inference Speedup Inference Speedup
parisons
ersus a of one NVIDIA A2 Tensor Core GPU
Comparisons
versus a of one NVIDIA A2 Tensor Core GPU
Comparisons
versus a of one NVIDIA A2 Tensor Core GPU
Comparisons
versus a of one NVIDIA A2 Tensor Core GPU
Comparisons
versus a of one NVIDIA A2 Tensor Core GPU
Comparisons
versus a of one NVIDIA A2 Tensor Core GPU
Comparisons
versus a of one NVIDIA A2 Tensor C
dual-socket Xeon Gold 6330N CPU dual-socket Xeon Gold 6330N CPU dual-socket Xeon Gold 6330N CPU dual-socket Xeon Gold 6330N CPU dual-socket Xeon Gold 6330N CPU dual-socket Xeon Gold 6330N CPU dual-socket Xeon Gold 6330N
330N
CPU: HPE DL380 Gen10System
Plus, 2S
Configuration:
Xeon Gold 6330N
CPU: HPE DL380 Gen10System
Plus, 2SConfiguration:
Xeon Gold 6330N
CPU: HPE DL380 Gen10System
Plus, 2S Configuration:
Xeon Gold 6330N
CPU: HPE DL380 Gen10System
Plus, 2S Configuration:
Xeon Gold 6330N
CPU: HPE DL380 Gen10System
Plus, 2SConfiguration:
Xeon Gold 6330N
CPU: HPE DL380 Gen10System
Plus, 2SConfiguration:
Xeon Gold 6330N
CPU: HPE DL380 Gen10System
Plus, 2SCon
Xe
512x512)
4 | Computer
| Vision: EfficientDet-D0
@2.2GHz, 512GB(COCO,
DDR4 512x512)
| Computer
| Vision: EfficientDet-D0
@2.2GHz, 512GB(COCO,
DDR4 512x512)
| NLP:|BERT-Large (Sequence
@2.2GHz, length:
512GB DDR4
384, SQuAD:
| NLP: BERT-Large (Sequence
@2.2GHz,length:
512GB DDR4
384, SQuAD:
| NLP: BERT-Large (Sequence
@2.2GHz, length:
512GB DDR4
384, SQuAD:
| Text-to-Speech: Tacotron2
@2.2GHz,+ Waveglow
512GB DDR4
end-to-end
| Text-to-Speech: Tacotron2
@2.2GHz,+ Wa51
cision:
on: INT8,
INT8,
BS:8 (GPU) | OpenVINO
TensorRT 2021.4,
8.2, Precision:
Precision:
INT8,
INT8,
BS:8 (GPU) | OpenVINO
v1.1) | TensorRT
2021.4, 8.2,
Precision:
Precision:
INT8,
INT8, BS:1 (GPU)
v1.1) | | OpenVINO
TensorRT 2021.4,
8.2, Precision: INT8, BS:1 (GPU)
v1.1) | | OpenVINO
TensorRT 2021.4,
8.2, Precision: INT8, BS:1 (GPU)
pipeline
| OpenVINO
(input length:
2021.4,
128) | PyTorch 1.9, Precision:
pipeline
FP16,
(input
BS:1
length:
(GPU)128)
| PyTorch
| PyTorch 1.9, Precision:
pipeline
FP16,
(inp
BS:8 (CPU) Precision: INT8, BS:1 (CPU) Precision: INT8, BS:1 (CPU) Precision: INT8, BS:1 (CPU) 1.9, Precision: FP32, BS:1 (CPU) 1.9, Precision: FP32, BS:1 (CPU) 1.9, Precisio
1.5x
1.3X
1.2X
1.0x
1.0X 1.0X
SECOND-GENERATION RT CORES
Learn more
To learn more about the NVIDIA A2 Tensor Core GPU, visit nvidia.com/a2.
© 2022 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo, Triton, NVIDIA-Certified Systems, and NGC are trademarks and/or registered
trademarks of NVIDIA Corporation in the U.S. and other countries. All other trademarks and copyrights are the property of their respective owners. MAR22