Static Analysis Problems Research Articles

Context-free language (CFL) reachability is a standard approach in static analyses, where the analysis question (e.g., is there a dataflow from x to y ?) is phrased as a language reachability problem on a graph G wrt a CFL L . However, CFLs lack the expressiveness needed for high analysis precision. On the other hand, common formalisms for context-sensitive languages are too expressive, in the sense that the corresponding reachability problem becomes undecidable. Are there useful context-sensitive language-reachability models for static analysis? In this paper, we introduce Multiple Context-Free Language (MCFL) reachability as an expressive yet tractable model for static program analysis. MCFLs form an infinite hierarchy of mildly context sensitive languages parameterized by a dimension d and a rank r . Larger d and r yield progressively more expressive MCFLs, offering tunable analysis precision. We showcase the utility of MCFL reachability by developing a family of MCFLs that approximate interleaved Dyck reachability, a common but undecidable static analysis problem. Given the increased expressiveness of MCFLs, one natural question pertains to their algorithmic complexity, i.e., how fast can MCFL reachability be computed? We show that the problem takes O ( n 2 d +1 ) time on a graph of n nodes when r =1, and O ( n d ( r +1) ) time when r >1. Moreover, we show that when r =1, even the simpler membership problem has a lower bound of n 2 d based on the Strong Exponential Time Hypothesis, while reachability for d =1 has a lower bound of n 3 based on the combinatorial Boolean Matrix Multiplication Hypothesis. Thus, for r =1, our algorithm is optimal within a factor n for all levels of the hierarchy based on the dimension d (and fully optimal for d =1). We implement our MCFL reachability algorithm and evaluate it by underapproximating interleaved Dyck reachability for a standard taint analysis for Android. When combined with existing overapproximate methods, MCFL reachability discovers all tainted information on 8 out of 11 benchmarks, while it has remarkable coverage (confirming 94.3% of the reachable pairs reported by the overapproximation) on the remaining 3. To our knowledge, this is the first report of high and provable coverage for this challenging benchmark set.

Read full abstract

Pointer analysis is one of the fundamental problems in static program analysis. Given a set of pointers, the task is to produce a useful over-approximation of the memory locations that each pointer may point-to at runtime. The most common formulation is Andersen’s Pointer Analysis (APA), defined as an inclusion-based set of m pointer constraints over a set of n pointers. Scalability is extremely important, as points-to information is a prerequisite to many other components in the static-analysis pipeline. Existing algorithms solve APA in O ( n 2 · m ) time, while it has been conjectured that the problem has no truly sub-cubic algorithm, with a proof so far having remained elusive. It is also well-known that APA can be solved in O ( n 2 ) time under certain sparsity conditions that hold naturally in some settings. Besides these simple bounds, the complexity of the problem has remained poorly understood. In this work we draw a rich fine-grained and parallel complexity landscape of APA, and present upper and lower bounds. First, we establish an O ( n 3 ) upper-bound for general APA, improving over O ( n 2 · m ) as n = O ( m ). Second, we show that even on-demand APA (“may a specific pointer a point to a specific location b ?”) has an Ω( n 3 ) (combinatorial) lower bound under standard complexity-theoretic hypotheses. This formally establishes the long-conjectured “cubic bottleneck” of APA, and shows that our O ( n 3 )-time algorithm is optimal. Third, we show that under mild restrictions, APA is solvable in Õ( n ω ) time, where ω<2.373 is the matrix-multiplication exponent. It is believed that ω=2+ o (1), in which case this bound becomes quadratic. Fourth, we show that even under such restrictions, even the on-demand problem has an Ω( n 2 ) lower bound under standard complexity-theoretic hypotheses, and hence our algorithm is optimal when ω=2+ o (1). Fifth, we study the parallelizability of APA and establish lower and upper bounds: (i) in general, the problem is P-complete and hence unlikely parallelizable, whereas (ii) under mild restrictions, the problem is parallelizable. Our theoretical treatment formalizes several insights that can lead to practical improvements in the future.

Read full abstract

Static Analysis Problems Research Articles

Related Topics

Articles published on Static Analysis Problems

Program Analysis via Multiple Context Free Language Reachability

Penalty 4-Node Quadrilateral Element Formulation for Axisymmetric Couple Stress Problems.

Languages Generated by Conjunctive Query Fragments of FC[REG]

Continuum and Discrete Analytical Methods for Vibration and Buckling Eigenvalues Shape Sensitivities

Vibration of solid and thin-walled slender structures made of soft materials by high-order beam finite elements

Numerical investigation on the premature and extended contact behaviour of engineering thermoplastic gears and its effect in gear kinematics

Multiple context-free path querying by matrix multiplication

The Fine-Grained Complexity of CFL Reachability

Collapse capacity of masonry domes under horizontal loads: A static limit analysis approach

A finite difference method for the static limit analysis of masonry domes under seismic loads

Calculational design of a regular model checker by abstract interpretation

The fine-grained and parallel complexity of andersen’s pointer analysis

Выполнимость мю-исчисления с арифметическими ограничениями

Mu-Calculus Satisfiability with Arithmetic Constraints

Cognitive Regionology: The Experience of Modeling Regional Socio-Economic Processes

Systemizing Interprocedural Static Analysis of Large-scale Systems Code with Graspan

Generalized model of software code`s static analysis based on machine learning for vulnerabilitys search

Extended layerwise method for laminated piezoelectric and composite plates with delaminations, cracks or debonding of a piezoelectric patch

Rigid block modelling of historic masonry structures using mathematical programming: a unified formulation for non-linear time history, static pushover and limit equilibrium analysis

ABOUT WAVELET-BASED COMPUTATIONAL BEAM ANALYSIS WITH THE USE OF DAUBECHIES SCALING FUNCTIONS

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Static Analysis Problems Research Articles

Related Topics

Articles published on Static Analysis Problems

Program Analysis via Multiple Context Free Language Reachability

Penalty 4-Node Quadrilateral Element Formulation for Axisymmetric Couple Stress Problems.

Languages Generated by Conjunctive Query Fragments of FC[REG]

Continuum and Discrete Analytical Methods for Vibration and Buckling Eigenvalues Shape Sensitivities

Vibration of solid and thin-walled slender structures made of soft materials by high-order beam finite elements

Numerical investigation on the premature and extended contact behaviour of engineering thermoplastic gears and its effect in gear kinematics

Multiple context-free path querying by matrix multiplication

The Fine-Grained Complexity of CFL Reachability

Collapse capacity of masonry domes under horizontal loads: A static limit analysis approach

A finite difference method for the static limit analysis of masonry domes under seismic loads

Calculational design of a regular model checker by abstract interpretation

The fine-grained and parallel complexity of andersen’s pointer analysis

Выполнимость мю-исчисления с арифметическими ограничениями

Mu-Calculus Satisfiability with Arithmetic Constraints

Cognitive Regionology: The Experience of Modeling Regional Socio-Economic Processes

Systemizing Interprocedural Static Analysis of Large-scale Systems Code with Graspan

Generalized model of software code`s static analysis based on machine learning for vulnerabilitys search

Extended layerwise method for laminated piezoelectric and composite plates with delaminations, cracks or debonding of a piezoelectric patch

Rigid block modelling of historic masonry structures using mathematical programming: a unified formulation for non-linear time history, static pushover and limit equilibrium analysis

ABOUT WAVELET-BASED COMPUTATIONAL BEAM ANALYSIS WITH THE USE OF DAUBECHIES SCALING FUNCTIONS