Get the latest tech news

FP8 is ~100 tflops faster when the kernel name has "cutlass" in it


None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of fp8

fp8

Photo of ~100

~100

Photo of cutlass

cutlass

Related news:

News photo

LLVM/Clang 20.1 Released With AMX-AVX512, AMX-FP8, AVX10.2, AMD GFX950 & Much More

News photo

LLVM 20 Feature Development Wraps Up With AMX-AVX512, AMX-FP8, AVX10.2 & AMD GFX950