Nondeterministic Choice Research Articles

Regular expressions (regexes) are ubiquitous in modern software. There is a variety of implementation techniques for regex matching, which can be roughly categorized as (1) relying on backtracking search, or (2) being based on finite-state automata. The implementations that use backtracking are often chosen due to their ability to support advanced pattern-matching constructs. Unfortunately, they are known to suffer from severe performance problems. For some regular expressions, the running time for matching can be exponential in the size of the input text. In order to provide stronger guarantees of matching efficiency, automata-based regex matching is the preferred choice. However, even these regex engines may exhibit severe performance degradation for some patterns. The main reason for this is that regexes used in practice are not exclusively built from the classical regular constructs, i.e., concatenation, nondeterministic choice and Kleene's star. They involve additional constructs that provide succinctness and convenience of expression. The most common such construct is bounded repetition (also called counting), which describes the repetition of the pattern a fixed number of times. In this paper, we propose a new algorithm for the efficient matching of regular expressions that involve bounded repetition. Our algorithms are based on a new model of automata, which we call nondeterministic bit vector automata (NBVA). This model is chosen to be expressively equivalent to nondeterministic counter automata with bounded counters, a very natural model for expressing patterns with bounded repetition. We show that there is a class of regular expressions with bounded repetition that can be matched in time that is independent from the repetition bounds. Our algorithms are general enough to cover the vast majority of challenging bounded repetitions that arise in practice. We provide an implementation of our approach in a regex engine, which we call BVA-Scan. We compare BVA-Scan against state-of-the-art regex engines on several real datasets.

Read full abstract

AbstractInternet‐scale distributed systems often replicate data at multiple geographic locations to provide low latency and high availability, despite node and network failures. According to the CAP theorem, low latency and high availability can only be achieved at the cost of accepting weak consistency. The conflict‐free replicated data type (CRDT) is a framework that provides a principled approach to maintaining eventual consistency among data replicas. CRDTs have been notoriously difficult to design and implement correctly. Subtle deep bugs lie in the complex and tedious handling of all possible cases of conflicting data updates. We argue that the CRDT design should be formally specified and model checked, to uncover deep bugs which are beyond human reasoning. The implementation further needs to be systematically tested. On the one hand, the testing needs to inherit the exhaustive nature of the model checking and ensures the coverage of testing. On the other hand, the testing is expected to find coding errors which cannot be detected by design level verification. Toward the challenges above, we propose the model‐checking‐driven explorative testing (MET) framework. At the design level, MET uses TLA+ to specify and model check CRDT designs. At the implementation level, MET conducts model‐checking‐driven explorative testing, in the sense that the test cases are automatically generated from the model‐checking traces. The system execution is controlled to proceed deterministically, following the model‐checking trace. The explorative testing systematically controls and permutes all nondeterministic choices of message reorderings. We apply MET in our practical development of CRDTs. The bugs in both designs and implementations of CRDTs are found. As for bugs which can be found by traditional testing techniques, MET greatly reduces the cost of fixing the bugs. Moreover, MET can find subtle deep bugs which cannot be found by existing techniques at a reasonable cost. Based on our practical use of MET, we discuss how MET provides us with sufficient confidence in the correctness of our CRDT designs and implementations.Conflict‐free replicated data type (CRDT) is a framework that provides a principled approach to maintaining eventual consistency among data replicas in distributed systems. CRDTs have been notoriously difficult to design and implement correctly. We propose model‐checking‐driven explorative testing (MET) framework for dealing with such problem. We apply MET in our practical development of CRDTs. MET successfully finds subtle deep bugs and provides us with sufficient confidence in the correctness of our CRDT designs and implementations.

Read full abstract

Nondeterministic Choice Research Articles

Related Topics

Articles published on Nondeterministic Choice

Backward Responsibility in Transition Systems Using General Power Indices

Positive Almost-Sure Termination: Complexity and Proof Rules

Quantum Bisimilarity via Barbs and Contexts: Curbing the Power of Non-deterministic Observers

Sleptsov nets are Turing-complete

Optimizing Reachability Probabilities for a Restricted Class of Stochastic Hybrid Automata via Flowpipe Construction

Model Checking of Possibilistic Linear-Time Properties Based on Generalized Possibilistic Decision Processes

Towards Abstraction-based Probabilistic Program Analysis

Hypernormalisation in an abstract setting

Regular Expression Matching using Bit Vector Automata

Implementation relations and testing for cyclic systems: Adding probabilities

Model‐checking‐driven explorative testing of CRDT designs and implementations

Using the past for resolving the future

Step-Indexed Logical Relations for Countable Nondeterminism and Probabilistic Choice

Kantorovich-Rubinstein quasi-metrics III: Spaces of sublinear and superlinear previsions

The Right Kind of Non-Determinism: Using Concurrency to Verify C Programs with Underspecified Semantics

Minimization and Canonization of GFG Transition-Based Automata

Specification of systems with parameterised events: An institution-independent approach

Modeling Method to Abstract Collective Behavior of Smart IoT Systems in CPS.

Towards Automated Discovery of God-Like Folk Algorithms for Rubik’s Cube

Proof Theory of Skew Non-Commutative MILL

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Nondeterministic Choice Research Articles

Related Topics

Articles published on Nondeterministic Choice

Backward Responsibility in Transition Systems Using General Power Indices

Positive Almost-Sure Termination: Complexity and Proof Rules

Quantum Bisimilarity via Barbs and Contexts: Curbing the Power of Non-deterministic Observers

Sleptsov nets are Turing-complete

Optimizing Reachability Probabilities for a Restricted Class of Stochastic Hybrid Automata via Flowpipe Construction

Model Checking of Possibilistic Linear-Time Properties Based on Generalized Possibilistic Decision Processes

Towards Abstraction-based Probabilistic Program Analysis

Hypernormalisation in an abstract setting

Regular Expression Matching using Bit Vector Automata

Implementation relations and testing for cyclic systems: Adding probabilities

Model‐checking‐driven explorative testing of CRDT designs and implementations

Using the past for resolving the future

Step-Indexed Logical Relations for Countable Nondeterminism and Probabilistic Choice

Kantorovich-Rubinstein quasi-metrics III: Spaces of sublinear and superlinear previsions

The Right Kind of Non-Determinism: Using Concurrency to Verify C Programs with Underspecified Semantics

Minimization and Canonization of GFG Transition-Based Automata

Specification of systems with parameterised events: An institution-independent approach

Modeling Method to Abstract Collective Behavior of Smart IoT Systems in CPS.

Towards Automated Discovery of God-Like Folk Algorithms for Rubik’s Cube

Proof Theory of Skew Non-Commutative MILL