GPU
Edit@4 months ago
FP64: 1.52 TFLOPS
FP32: 48.66 TFLOPS
FP16: 97.32 TFLOPS
FP16: 194.64 TFLOPS, with sparsity: 389.28 TFLOPS
FP8: 389.28 TFLOPS, with sparsity: 778.57 TFLOPS
BF16: 194.64 TFLOPS, with sparsity: 389.28 TFLOPS
INT8: 389.28 TOPS, with sparsity: 778.57 TOPS
INT4: 778.57 TOPS, with sparsity: 1557.14 TOPS