Get the latest tech news

Modern Minimal Perfect Hashing: A Survey


Given a set $S$ of $n$ keys, a perfect hash function for $S$ maps the keys in $S$ to the first $m \geq n$ integers without collisions. It may return an arbitrary result for any key not in $S$ and is called minimal if $m = n$. The most important parameters are its space consumption, construction time, and query time. Years of research now enable modern perfect hash functions to be extremely fast to query, very space-efficient, and scale to billions of keys. Different approaches give different trade-offs between these aspects. For example, the smallest constructions get within 0.1% of the space lower bound of $\log_2(e)$ bits per key. Others are particularly fast to query, requiring only one memory access. Perfect hashing has many applications, for example to avoid collision resolution in static hash tables, and is used in databases, bioinformatics, and stringology. Since the last comprehensive survey in 1997, significant progress has been made. This survey covers the latest developments and provides a starting point for getting familiar with the topic. Additionally, our extensive experimental evaluation can serve as a guide to select a perfect hash function for use in applications.

View a PDF of the paper titled Modern Minimal Perfect Hashing: A Survey, by Hans-Peter Lehmann and 6 other authors Years of research now enable modern perfect hash functions to be extremely fast to query, very space-efficient, and scale to billions of keys. Additionally, our extensive experimental evaluation can serve as a guide to select a perfect hash function for use in applications.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of survey

survey

Related news:

News photo

Survey shows Gmail users would gladly sacrifice features for more privacy

News photo

Teenage students often use AI to do homework, a survey finds. This is the impact on their grades

News photo

A Survey of AI Agent Protocols