Get the latest tech news

RE#: High performance derivative-based regular expression matching (2024)


We present a tool and theory RE# for regular expression matching that is built on symbolic derivatives, does not use backtracking, and, in addition to the classical operators, also supports complement, intersection and lookarounds. We develop the theory formally and show that the main matching algorithm has input-linear complexity both in theory as well as experimentally. We apply thorough evaluation on popular benchmarks that show that RE# is over 71% faster than the next fastest regex engine in Rust on the baseline, and outperforms all state-of-the-art engines on extensions of the benchmarks often by several orders of magnitude.

View a PDF of the paper titled RE#: High Performance Derivative-Based Regex Matching with Intersection, Complement and Lookarounds, by Ian Erik Varatalu and Margus Veanes and Juhan-Peep Ernits View PDFHTML (experimental) Abstract:We present a tool and theory RE# for regular expression matching that is built on symbolic derivatives, does not use backtracking, and, in addition to the classical operators, also supports complement, intersection and lookarounds. From: Juhan Ernits [ view email][v1] Tue, 30 Jul 2024 00:51:03 UTC (712 KB)

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Derivative

Derivative

Photo of high performance

high performance

Related news:

News photo

QuestDB 9.0 Released For High Performance, Time-Series Database

News photo

Cmapv2: A high performance, concurrent map

News photo

Dbus-Broker 37 Released For High Performance & Reliable D-Bus