Abstract

Electronic Health Records (EHR) are a rich repository of valuable clinical information that exist in primary and secondary care databases. In order to utilize EHRs for medical observational research a range of algorithms for automatically identifying individuals with a specific phenotype have been developed. This review summarizes and offers a critical evaluation of the literature relating to studies conducted into the development of EHR phenotyping systems. This review describes phenotyping systems and techniques based on structured and unstructured EHR data. Articles published on PubMed and Google scholar between 2013 and 2017 have been reviewed, using search terms derived from Medical Subject Headings (MeSH). The popularity of using Natural Language Processing (NLP) techniques in extracting features from narrative text has increased. This increased attention is due to the availability of open source NLP algorithms, combined with accuracy improvement. In this review, Concept extraction is the most popular NLP technique since it has been used by more than 50% of the reviewed papers to extract features from EHR. High-throughput phenotyping systems using unsupervised machine learning techniques have gained more popularity due to their ability to efficiently and automatically extract a phenotype with minimal human effort.

Highlights

  • Electronic health records (EHR) are a rich repository of valuable clinical information that exist in primary and secondary care databases [1]

  • With the improvement and development in information technology, increasingly these records are adopted for secondary uses such as quality and safety measurement, clinical decision support, genome-wide association studies (GWAS), and clinical research [1,2]

  • Electronic Medical Records and Genomics Network [24], Strategic Health IT Advanced Research Project (SHARP) [25], and the National Patient-Centered Clinical Research Network (PCORnet) [26] are examples of collaboration aimed at enhancing research using EHRs across multiple institutions

Read more

Summary

A Review of Automatic Phenotyping Approaches using Electronic Health Records

Hadeel Alzoubi 1, *, Raid Alzubi 2 , Naeem Ramzan 3 , Daune West 3 , Tawfik Al-Hadhrami 4, *. Received: 27 September 2019; Accepted: 22 October 2019; Published: 29 October 2019

Introduction
EHR Warehouse
Pre-Processing
Feature Extraction
Structured Feature Extraction
Unstructured Feature Extraction
Keywords Search
Concept Extraction
Feature Selection
Classification
Rule-Based
Rules Based on Clinical Judgment
Rules Based on Healthcare Guidelines
Machine Learning
Supervised Learning
Decision Tree
Unsupervised Learning
Combined Approaches
Validation and Evaluation
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call