cutlass

Read news on cutlass with our app.

Read more in the app

Fp8 runs ~100 tflops faster when the kernel name has "cutlass" in it

FP8 is ~100 tflops faster when the kernel name has "cutlass" in it