Read news on flashattention with our app.
Read more in the app
I rebuilt FlashAttention in Triton to understand the performance archaeology
FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention