Read news on cutlass with our app.
Read more in the app
Fp8 runs ~100 tflops faster when the kernel name has "cutlass" in it
FP8 is ~100 tflops faster when the kernel name has "cutlass" in it