Text Analysis System Research Articles

To develop scalable informatics infrastructure for normalization of both structured and unstructured electronic health record (EHR) data into a unified, concept-based model for high-throughput phenotype extraction. Software tools and applications were developed to extract information from EHRs. Representative and convenience samples of both structured and unstructured data from two EHR systems-Mayo Clinic and Intermountain Healthcare-were used for development and validation. Extracted information was standardized and normalized to meaningful use (MU) conformant terminology and value set standards using Clinical Element Models (CEMs). These resources were used to demonstrate semi-automatic execution of MU clinical-quality measures modeled using the Quality Data Model (QDM) and an open-source rules engine. Using CEMs and open-source natural language processing and terminology services engines-namely, Apache clinical Text Analysis and Knowledge Extraction System (cTAKES) and Common Terminology Services (CTS2)-we developed a data-normalization platform that ensures data security, end-to-end connectivity, and reliable data flow within and across institutions. We demonstrated the applicability of this platform by executing a QDM-based MU quality measure that determines the percentage of patients between 18 and 75 years with diabetes whose most recent low-density lipoprotein cholesterol test result during the measurement year was <100 mg/dL on a randomly selected cohort of 273 Mayo Clinic patients. The platform identified 21 and 18 patients for the denominator and numerator of the quality measure, respectively. Validation results indicate that all identified patients meet the QDM-based criteria. End-to-end automated systems for extracting clinical information from diverse EHR systems require extensive use of standardized vocabularies and terminologies, as well as robust information models for storing, discovering, and processing that information. This study demonstrates the application of modular and open-source resources for enabling secondary use of EHR data through normalization into standards-based, comparable, and consistent format for high-throughput phenotyping to identify patient cohorts.

Read full abstract

Curation of biomedical literature is often supported by the automatic analysis of textual content that generally involves a sequence of individual processing components. Text mining (TM) has been used to enhance the process of manual biocuration, but has been focused on specific databases and tasks rather than an environment integrating TM tools into the curation pipeline, catering for a variety of tasks, types of information and applications. Processing components usually come from different sources and often lack interoperability. The well established Unstructured Information Management Architecture is a framework that addresses interoperability by defining common data structures and interfaces. However, most of the efforts are targeted towards software developers and are not suitable for curators, or are otherwise inconvenient to use on a higher level of abstraction. To overcome these issues we introduce Argo, an interoperable, integrative, interactive and collaborative system for text analysis with a convenient graphic user interface to ease the development of processing workflows and boost productivity in labour-intensive manual curation. Robust, scalable text analytics follow a modular approach, adopting component modules for distinct levels of text analysis. The user interface is available entirely through a web browser that saves the user from going through often complicated and platform-dependent installation procedures. Argo comes with a predefined set of processing components commonly used in text analysis, while giving the users the ability to deposit their own components. The system accommodates various areas and levels of user expertise, from TM and computational linguistics to ontology-based curation. One of the key functionalities of Argo is its ability to seamlessly incorporate user-interactive components, such as manual annotation editors, into otherwise completely automatic pipelines. As a use case, we demonstrate the functionality of an in-built manual annotation editor that is well suited for in-text corpus annotation tasks.Database URL: http://www.nactem.ac.uk/Argo

Read full abstract

Text Analysis System Research Articles

Related Topics

Articles published on Text Analysis System

Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium.

An Interactive System for Visual Analytics of Dynamic Topic Models

Time Trends in Printed News Coverage of Female Subjects, 1880–2008

Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification

Automatic prediction of rheumatoid arthritis disease activity from the electronic medical records.

Navigating an imagined Middle&ndash;earth: Finding and analyzing text&ndash;based and film&ndash;based mental images of Middle&ndash;earth through TheOneRing.net online fan community

Getting More Out of Biomedical Documents with GATE's Full Lifecycle Open Source Text Analytics

A common type system for clinical natural language processing

Methods for dictionary generation

Argo: an integrative, interactive, text mining-based workbench supporting curation

A system for coreference resolution for the clinical narrative

Automated discovery of drug treatment patterns for endocrine therapy of breast cancer within an electronic medical record

The Yale cTAKES extensions for document classification: architecture and application.

An evaluation of text analysis technologies

GENERATING AUTOMATED TEXT COMPLEXITY CLASSIFICATIONS THAT ARE ALIGNED WITH TARGETED TEXT COMPLEXITY STANDARDS

Using Computational Techniques to Fill the Gap between Qualitative Data Analysis and Text Analytics

Trading Strategies to Exploit Blog and News Sentiment

CyberGate: A Design Framework and System for Text Analysis of Computer-Mediated Communication

TOWARDS A SYSTEMATIC EVALUATION OF PROTEIN MUTATION EXTRACTION SYSTEMS

Text analytics for life science using the Unstructured Information Management Architecture

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Analysis System Research Articles

Related Topics

Articles published on Text Analysis System

Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium.

An Interactive System for Visual Analytics of Dynamic Topic Models

Time Trends in Printed News Coverage of Female Subjects, 1880–2008

Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification

Automatic prediction of rheumatoid arthritis disease activity from the electronic medical records.

Navigating an imagined Middle&amp;ndash;earth: Finding and analyzing text&amp;ndash;based and film&amp;ndash;based mental images of Middle&amp;ndash;earth through TheOneRing.net online fan community

Getting More Out of Biomedical Documents with GATE's Full Lifecycle Open Source Text Analytics

A common type system for clinical natural language processing

Methods for dictionary generation

Argo: an integrative, interactive, text mining-based workbench supporting curation

A system for coreference resolution for the clinical narrative

Automated discovery of drug treatment patterns for endocrine therapy of breast cancer within an electronic medical record

The Yale cTAKES extensions for document classification: architecture and application.

An evaluation of text analysis technologies

GENERATING AUTOMATED TEXT COMPLEXITY CLASSIFICATIONS THAT ARE ALIGNED WITH TARGETED TEXT COMPLEXITY STANDARDS

Using Computational Techniques to Fill the Gap between Qualitative Data Analysis and Text Analytics

Trading Strategies to Exploit Blog and News Sentiment

CyberGate: A Design Framework and System for Text Analysis of Computer-Mediated Communication

TOWARDS A SYSTEMATIC EVALUATION OF PROTEIN MUTATION EXTRACTION SYSTEMS

Text analytics for life science using the Unstructured Information Management Architecture

Navigating an imagined Middle–earth: Finding and analyzing text–based and film–based mental images of Middle–earth through TheOneRing.net online fan community