Approximate String Matching Algorithm Research Articles

BackgroundPatient-Centered Medical Home (PCMH) adoption is an important strategy to help improve primary care quality within Health Resources and Service Administration (HRSA) community health centers (CHC), but evidence of its effect thus far remains mixed. A limitation of previous evaluations has been the inability to account for the proportion of CHC delivery sites that are designated medical homes.MethodsRetrospective cross-sectional study using HRSA Uniform Data System (UDS) and certification files from the National Committee for Quality Assurance (NCQA) and the Joint Commission (JC). Datasets were linked through geocoding and an approximate string-matching algorithm. Predicted probability scores were regressed onto 11 clinical performance measures using 10% increments in site-level designation using beta logistic regression.ResultsThe geocoding and approximate string-matching algorithm identified 2615 of the 6851 (41.8%) delivery sites included in the analyses as having been designated through the NCQA and/or JC. In total, 74.7% (n = 777) of the 1039 CHCs that met the inclusion criteria for the analysis managed at least one NCQA- and/or JC-designated site. A proportional increase in site-level designation showed a positive association with adherence scores for the majority of all indicators, but primarily among CHCs that designated at least 50% of its delivery sites. Once this threshold was achieved, there was a stepwise percentage point increase in adherence scores, ranging from 1.9 to 11.8% improvement, depending on the measure.ConclusionGeocoding and approximate string-matching techniques offer a more reliable and nuanced approach for monitoring the association between site-level PCMH designation and clinical performance within HRSA’s CHC delivery sites. Our findings suggest that transformation does in fact matter, but that it may not appear until half of the delivery sites become designated. There also appears to be a continued stepwise increase in adherence scores once this threshold is achieved.

Read full abstract

Approximate substring searching is a common but computationally demanding task in bioinformatics and text analysis. We present a new approach that recasts string search as a multiple convolution problem, then exploits highly efficient fast Fourier convolution techniques. This approach, which we call ffgrep, computes and caches the spectra of a target corpora, drastically reducing the cost of subsequent searches. Like other approaches, this algorithm is embarrassingly parallelizable; unlike other approaches, it is capable of operating on not only raw strings, but also word embeddings. ffgrep is applied to an original corpus of imperfect automatic transcriptions of campaign speeches in the 2012 U.S. presidential election. We contrast our approach with agrep, an industry-standard meta-algorithm that selects the optimal member from a number of highly optimized approximate string matching algorithms. Searching for approximate recurrences of a manually curated set of candidate catchphrases, we show that ffgrep speeds computation by up to a factor of 60x in typical settings, with increasing gains as alignments grow longer or more complex. Moreover, these computational gains come at little cost in performance. Taking agrep search results as ground truth, over a wide range of agrep parameters, we show that ffgrep is capable of recovering highly similar results with accuracies exceeding 0.94 and F1 of 0.84–0.9. Finally, we demonstrate how efficient substring matching enables new substantive research by identifying candidate catchphrases without human supervision. By rapidly computing and organizing 90 billion pairwise string comparisons, our proposed method automatically learns that the phrases “kick children off of Head Start or eliminate health insurance for the poor” and “kick students are [sic] financial aid or get rid of funding for Planned Parenthood or eliminate health care for millions on Medicaid” — along with 32 other campaign appeals — all map onto a single recurring theme, President Barack Obama’s critique of a proposed Medicare reform.

Read full abstract

Approximate String Matching Algorithm Research Articles

Related Topics

Articles published on Approximate String Matching Algorithm

Survey on Context Based Identification of Customer Name Variations using ML Techniques

An FPGA Based Energy-Efficient Read Mapper With Parallel Filtering and In-Situ Verification.

Hardware-Algorithm Codesign for Fast and Energy Efficient Approximate String Matching on FPGA for Computational Biology.

Association of Patient-Centered Medical Home designation and quality indicators within HRSA-funded community health center delivery sites

Keywords Search Correction Using Damerau Levenshtein Distance Algorithm

Ffgrep: Scalable Approximate String Matching

A Novel Algorithm for Online Inexact String Matching and its FPGA Implementation

Exact String Matching Algorithms: Survey, Issues, and Future Research Directions

Mining Query Plans for Finding Candidate Queries and Sub-Queries for Materialized Views in BI Systems Without Cube Generation

BAŞLICA BİYOİNFORMATİK ALGORİTMALARI İÇİN WEB ARA YÜZÜ VE YENİ OTOMAT TABANLI YAKLAŞIK DESEN EŞLEŞTİRME YAKLAŞIMI

Data mining algorithm for pre-processing biopharmaceutical drug product manufacturing records

IMPLEMENTASI ALGORITMA APPROXIMATE STRING MATCHING PADA APLIKASI FILOSOFI BERBASIS ANDROID

Correction to: New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance

New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance

A parallel approximate string matching under Levenshtein distance on graphics processing units using warp-shuffle operations

Parallelizing Exact and Approximate String Matching via Inclusive Scan on a GPU

A consensus algorithm for approximate string matching and its application to QRS complex detection

New rule-based phishing detection method

An efficient pruning strategy for approximate string matching over suffix tree

Approximate String Matching Algorithms: A Brief Survey and Comparison

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Approximate String Matching Algorithm Research Articles

Related Topics

Articles published on Approximate String Matching Algorithm

Survey on Context Based Identification of Customer Name Variations using ML Techniques

An FPGA Based Energy-Efficient Read Mapper With Parallel Filtering and In-Situ Verification.

Hardware-Algorithm Codesign for Fast and Energy Efficient Approximate String Matching on FPGA for Computational Biology.

Association of Patient-Centered Medical Home designation and quality indicators within HRSA-funded community health center delivery sites

Keywords Search Correction Using Damerau Levenshtein Distance Algorithm

Ffgrep: Scalable Approximate String Matching

A Novel Algorithm for Online Inexact String Matching and its FPGA Implementation

Exact String Matching Algorithms: Survey, Issues, and Future Research Directions

Mining Query Plans for Finding Candidate Queries and Sub-Queries for Materialized Views in BI Systems Without Cube Generation

BAŞLICA BİYOİNFORMATİK ALGORİTMALARI İÇİN WEB ARA YÜZÜ VE YENİ OTOMAT TABANLI YAKLAŞIK DESEN EŞLEŞTİRME YAKLAŞIMI

Data mining algorithm for pre-processing biopharmaceutical drug product manufacturing records

IMPLEMENTASI ALGORITMA APPROXIMATE STRING MATCHING PADA APLIKASI FILOSOFI BERBASIS ANDROID

Correction to: New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance

New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance

A parallel approximate string matching under Levenshtein distance on graphics processing units using warp-shuffle operations

Parallelizing Exact and Approximate String Matching via Inclusive Scan on a GPU

A consensus algorithm for approximate string matching and its application to QRS complex detection

New rule-based phishing detection method

An efficient pruning strategy for approximate string matching over suffix tree

Approximate String Matching Algorithms: A Brief Survey and Comparison