Transcription of NVIDIA A100 | Tensor Core GPU
1 NVIDIA a100 Tensor CORE GPU | | 2021 6 | 1 NVIDIA a100 Tensor Core GPU AI HPC NVIDIA NVIDIA Volta a100 20 a100 GPU GPU MIG NVIDIA a100 Tensor Core a100 80GB GPU 2TB/s a100 NVIDIA NGC AI AI HPC NVIDIA a100 Tensor CORE GPU NVIDIA a100 Tensor CORE GPU SXM4 PCIE A10 0 40GB PCIeA10 0 80GB PCIeA10 0 40GB SXMA10 0 80GB TFLOPSFP64 Tensor TFLOPST ensor Float 32 TF32 156 TFLOPS | 312 TFLOPS*BFLOAT16 Tensor Core312 TFLOPS | 624 TFLOPS*FP16 Tensor Core312 TFLOPS | 624 TFLOPS*INT8 Tensor Core624 TOPS| 1248 TOPS*GPU 40GB HBM280GB HBM2e40GB HBM280GB HBM2eGPU 1555GB/s1935GB/s1555GB/s2039GB/s TDP 250W300W400W400W GPU 7 MIG @ 5GB 7 MIG @ 10GB 7 MIG @ 5GB 7 MIG @ 10GB PCIeSXM 2 GPU NVIDIA NVLink 600GB/s**PCIe 64GB/sNVLink 600GB/sPCIe 64GB/s 1 8 GPU NVIDIA NVIDIA HGX A10 0 4 8 16 GPU NVIDIA 8 GPU NVIDIA DGX A10 0* ** SXM4
2 GPU HGX a100 PCIe GPU NVLink GPU NVIDIA AMPERE MIG a100 GPU NVLink GPU a100 a100 IT GPU Tensor CORE NVIDIA a100 312 teraFLOPS TFLOPS Tensor FLOPS Tensor TOPS NVIDIA Volta GPU 20 NVLINKA100 NVIDIA NVLink NVIDIA NVSwitch 16 a100 GPU 600GB/s NVLink a100 SXM GPU HGX a100 PCIe GPU NVLink 2 GPU HBM2E 80GB (HBM2e) a100 2TB/s GPU DRAM 95% a100 GPU MIG a100 GPU GPU MIG IT GPU GPU GPU AI a100 Tensor Core AI NVIDIA a100 Tensor CORE GPU | | 2021 6 | 2A100 80 GBFP16A100 40 GBFP1601X2X3X 1000 - 1XV100FP160 7X3X 3 AI DLRM HugeCTR DLRM = FP16 | NVIDIA a100 80GB = 48 | NVIDIA a100 40GB = 32 | NVIDIA
3 V100 32GB = 32 a100 80 GBA100 40GB050X100X150X250X200X - 245X CPU1X249X CPU 249 AI BERT-LARGE BERT-Large | CPU 6240 GHz = FP32 = 128 | V100 NVIDIA Tensor -RT (TRT) = INT8 = 256 | a100 40GB 80GB = 256 = INT8 a100 80 GBA100 40GB01X2X - 1X1 25X a100 40GB AI RNN-T (1/7) MIG MLPerf RNN-T TensorRT = LibriSpeech = FP16 01X2X3X4X5X9X8X7X6X - V100 32GB1XA100 40 GBA100 80GB8X4X a100 40GB 2 | GPU-BDB TPCx-BB GPU-BDB TPCx-BB | 10TB 30 ETL ML NLP | V100 32GB RAPIDS/Dask | a100 40GB a100 80GB RAPIDS/Dask/BlazingSQLA100 80 GBA100 40GB01X2X - 1X1 8X HPC Quantum Espresso CNT10 POR8 Quantum Espresso = FP64 V1002017P100 201601X2X3X4X7X5X11X10X9X8X6X1X2XV100201 83XV10020194XA100202011X HPC 11 HPC P100 Amber [PME-Cellulose_NVE]
4 Chroma [szscl21_24_128] GROMACS [ADH Dodec] MILC [Apex Medium] NAMD [stmv_nve_cuda] PyTorch (BERT-Large Fine Tuner] Quantum Espresso [AUSURF112-jR] FP32 [make_blobs (160000 x 64 : 10)] TensorFlow [ResNet-50] VASP 6 [Si Huge] | CPU 4 NVIDIA P100 V100 a100 GPU GPU NVIDIA a100 Tensor Core GPU 2021 NVIDIA Corporation. NVIDIA NVIDIA DGX HGX NGC NVIDIA NVLink NVSwitch Volta NVIDIA Corporation 2021 6 NVIDIA a100 Tensor Core GPU NVIDIA HPC 2000 a100 2000 GPU HPCAMBERAMBERHPCGAUSSIANGAUSSIANHPCOpenF OAMOpenFOAMHPCHPCANSYS FluentANSYS FluentHPCGROMACSGROMACSHPCHPCVASPVASPHPC A ltair nanoFluidXAltair nanoFluidXHPCDS SIMULIA AbaqusDS SIMULIA AbaqusHPCNAMDNAMDWRFWRFHPCHPCA ltair ultraFluidXAltair ultraFluidX)