Sound static analysis of regular expressions for vulnerabilities to denial of service attacks

Francesco Parolini,Antoine Miné

doi:10.1016/j.scico.2023.102960

Abstract

Modern programming languages often provide functions to manipulate regular expressions in standard libraries. If they offer support for advanced features, the matching algorithm has an exponential worst-case time complexity: for some so-called vulnerable regular expressions, an attacker can craft ad hoc strings to force the matcher to exhibit an exponential behavior and perform a Regular Expression Denial of Service (ReDoS) attack. In this paper, we introduce a framework based on a tree semantics to statically identify ReDoS vulnerabilities. In particular, we put forward an algorithm to extract an overapproximation of the set of words that are dangerous for a regular expression, effectively catching all possible attacks. We have implemented the analysis in a tool called rat, and testing it on a dataset of 74,669 regular expressions, we observed that in 99.78% of the instances the analysis terminates in less than one second. We compared rat to seven other ReDoS detectors, and we found that our tool is faster, often by orders of magnitude, than most other tools. While raising a low number of false positives, rat is the only ReDoS detector that does not report false negatives.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sound static analysis of regular expressions for vulnerabilities to denial of service attacks

Abstract

Talk to us

Similar Papers

More From: Science of Computer Programming

Lead the way for us

Journal: Science of Computer Programming	Publication Date: Jul 1, 2023
License type: publisher-specific-oa

Similar Papers

Sound Static Analysis of Regular Expressions for Vulnerabilities to Denial of Service Attacks
Francesco Parolini ... Antoine Miné
-
Francesco Parolini, et. al.Francesco Parolini ... Antoine Miné
01 Jan 2021
01 Jan 2021

Dynamically Reconfigurable Architecture with Atomic Configuration Updates for Flexible Regular Expressions Matching in FPGA
Vlastimil Koar ... Jan Korenek
-
Vlastimil Koar, et. al.Vlastimil Koar ... Jan Korenek
01 Aug 2016
01 Aug 2016

Exploring regular expression usage and context in Python
Carl Chapman ... Kathryn T Stolee
-
Carl Chapman, et. al.Carl Chapman ... Kathryn T Stolee
18 Jul 2016
18 Jul 2016

CICERO: A Domain-Specific Architecture for Efficient Regular Expression Matching
Daniele Parravicini ... Emanuele Del Sozzo
ACM Transactions on Embedded Computing Systems | VOL. 20
Daniele Parravicini, et. al.Daniele Parravicini ... Emanuele Del Sozzo
17 Sep 2021
ACM Transactions on Embedded Computing Systems | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sound static analysis of regular expressions for vulnerabilities to denial of service attacks

Abstract

Talk to us

Similar Papers

More From: Science of Computer Programming