Get the latest tech news

Optimization of 32-bit Unsigned Division by Constants on 64-bit Targets


Granlund and Montgomery proposed an optimization method for unsigned integer division by constants [3]. Their method (called the GM method in this paper) was further improved in part by works such as [1] and [7], and is now adopted by major compilers including GCC, Clang, Microsoft Compiler, and Apple Clang. However, for example, for x/7, the generated code is designed for 32-bit CPUs and therefore does not fully exploit 64-bit capabilities. This paper proposes an optimization method for 32-bit unsigned division by constants targeting 64-bit CPUs. We implemented patches for LLVM/GCC and achieved speedups of 1.67x on Intel Xeon w9-3495X (Sapphire Rapids) and 1.98x on Apple M4 (Apple M-series SoC) in the microbenchmark described later. The LLVM patch has already been merged into llvm:main [6], demonstrating the practical applicability of the proposed method.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of optimization

optimization

Photo of constants

constants

Photo of Unsigned Division

Unsigned Division

Related news:

News photo

The gold standard of optimization: A look under the hood of RollerCoaster Tycoon

News photo

RADV Driver Lands Another Optimization: "Missing In RADV For A Very Long Time"

News photo

Perlsecret – Perl secret operators and constants