Get the latest tech news

Linux Lazy Unmap Flush "LUF" Reducing TLB Shootdowns By 97%, Faster AI LLM Performance


SK has been working on a Linux kernel feature dubbed Lazy Unmap Flush 'LUF' to defer TLB flushes until folios have been unmapped and freed are eventually allocated again.

The end result is what is most interesting and important: the LUF patches yielded TLB shootdown interrupts being reduced by around 97%. The most recent Lazy Unmap Flush patches were stressed for a week by running an AI LLM inference workload with 140GB of memory to prove its stability. The huge reduction in TLB shootdown interrupts and several percent gain to the Llama.cpp AI runtime as an example are quite promising.

Get the Android app

Or read this on Phoronix

Read more on:

Photo of Performance

Performance

Photo of luf

luf

Photo of linux lazy

linux lazy

Related news:

News photo

NVIDIA GeForce RTX 5080/5090 Performance With Neat Video 6 On Linux

News photo

Linux 6.13 Performance For 250Hz vs. 1000Hz Timer Frequency Comparison

News photo

Healthy Competition With GCC 15 vs. LLVM Clang 20 Performance On AMD Zen 5