On optimizing syntactic pattern recognition using tries and AI-based heuristic-search strategies

G Badr,B.J Oommen

doi:10.1109/tsmcb.2005.861860

Abstract

This paper deals with the problem of estimating, using enhanced artificial-intelligence (AI) techniques, a transmitted string X* by processing the corresponding string Y, which is a noisy version of X*. It is assumed that Y contains substitution, insertion, and deletion (SID) errors. The best estimate X+ of X* is defined as that element of a dictionary H that minimizes the generalized Levenshtein distance (GLD) D (X, Y) between X and Y, for all X epsilon H. In this paper, it is shown how to evaluate D (X, Y) for every X epsilon H simultaneously, when the edit distances are general and the maximum number of errors is not given a priori, and when H is stored as a trie. A new scheme called clustered beam search (CBS) is first introduced, which is a heuristic-based search approach that enhances the well-known beam-search (BS) techniques used in AI. The new scheme is then applied to the approximate string-matching problem when the dictionary is stored as a trie. The new technique is compared with the benchmark depth-first search (DFS) trie-based technique (with respect to time and accuracy) using large and small dictionaries. The results demonstrate a marked improvement of up to 75% with respect to the total number of operations needed on three benchmark dictionaries, while yielding an accuracy comparable to the optimal. Experiments are also done to show the benefits of the CBS over the BS when the search is done on the trie. The results also demonstrate a marked improvement (more than 91%) for large dictionaries.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On optimizing syntactic pattern recognition using tries and AI-based heuristic-search strategies

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)

Lead the way for us

Journal: IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)	Publication Date: Jun 1, 2006
Citations: 42

Similar Papers

Enhancing Trie-Based Syntactic Pattern Recognition Using AI Heuristic Search Strategies
Ghada Badr ... B John Oommen
-
Ghada Badr, et. al.Ghada Badr ... B John Oommen
01 Jan 2004
01 Jan 2004

An enhanced minimum classification error learning framework for balancing insertion, deletion and substitution errors
Yuan Fu Liao ... Sen Chia Chang
-
Yuan Fu Liao, et. al. Yuan Fu Liao ... Sen Chia Chang
01 Jan 2007
01 Jan 2007

A novel look-ahead optimization strategy for trie-based approximate string matching
Ghada Badr ... B John Oommen
Pattern Analysis and Applications | VOL. 9
Ghada Badr, et. al.Ghada Badr ... B John Oommen
26 Aug 2006
Pattern Analysis and Applications | VOL. 9

Insertion and deletion correcting DNA barcodes based on watermarks.
David Kracht ... Steffen Schober
BMC Bioinformatics | VOL. 16
David Kracht, et. al.David Kracht ... Steffen Schober
18 Feb 2015
BMC Bioinformatics | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On optimizing syntactic pattern recognition using tries and AI-based heuristic-search strategies

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)