Using whole genome scores to compare three clinical phenotyping methods in complex diseases

Wenyu Song,Adam Wright,Cheng-Zhong Zhang,David W Bates,Hailiang Huang

doi:10.1038/s41598-018-29634-w

Abstract

Genome-wide association studies depend on accurate ascertainment of patient phenotype. However, phenotyping is difficult, and it is often treated as an afterthought in these studies because of the expense involved. Electronic health records (EHRs) may provide higher fidelity phenotypes for genomic research than other sources such as administrative data. We used whole genome association models to evaluate different EHR and administrative data-based phenotyping methods in a cohort of 16,858 Caucasian subjects for type 1 diabetes mellitus, type 2 diabetes mellitus, coronary artery disease and breast cancer. For each disease, we trained and evaluated polygenic models using three different phenotype definitions: phenotypes derived from billing data, the clinical problem list, or a curated phenotyping algorithm. We observed that for these diseases, the curated phenotype outperformed the problem list, and the problem list outperformed administrative billing data. This suggests that using advanced EHR-derived phenotypes can further increase the power of genome-wide association studies.

Highlights

A fundamental goal of precision medicine is to use genomic data to explain and predict health status
We applied a series of steps including linkage disequilibrium (LD) pruning to remove the low-quality SNPs before obtained 472,811 autosomal SNPs
Four complex diseases with different genetic heritabilities were chosen for this study: type 1 diabetes mellitus (T1DM), type 2 diabetes mellitus (T2DM), coronary artery disease (CAD) and breast cancer (BC)

Summary

Introduction

A fundamental goal of precision medicine is to use genomic data to explain and predict health status. Studies have shown that many human complex disorders are driven by genomic factors[1,2,3]. In light of these findings, researchers are trying to further address the causal relationship between genetic variations and specific diseases phenotypes. Many genome-wide association studies (GWAS) use self-reported binary phenotypic descriptions or administrative data to establish phenotypes[10,11]. Prior research has shown that self-reported disease status and administrative data, such as billing data, are often inaccurate[12,13]. Compared with traditional self-reported phenotypes, EHR data can efficiently create standardized phenotypes with refinable definitions in large cohort studies.

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Jul 27, 2018
Citations: 10	License type: open-access

R Discovery Prime

R Discovery Prime

Using whole genome scores to compare three clinical phenotyping methods in complex diseases

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

The rocky exhilarating journey from data to wisdom
Paul Kurlansky
The Journal of Thoracic and Cardiovascular Surgery | VOL. 162
Paul KurlanskyPaul Kurlansky
24 Jun 2020
The Journal of Thoracic and Cardiovascular Surgery | VOL. 162

Interdisciplinary Models for Research and Clinical Endeavors in Genomic Medicine: A Scientific Statement From the American Heart Association.
Kiran Musunuru ... Joseph Loscalzo
Circulation. Genomic and precision medicine | VOL. 11
Kiran Musunuru, et. al.Kiran Musunuru ... Joseph Loscalzo
01 Jun 2018
Circulation. Genomic and precision medicine | VOL. 11

Abstract 173: A Comparison of Administrative Claims Discharge Diagnosis with Electronic Medical Record Problem List in the Diagnosis of Acute Myocardial Infarction
Michael D Mcculloch ... Umesh N Khot
Circulation: Cardiovascular Quality and Outcomes | VOL. 7
Michael D Mcculloch, et. al.Michael D Mcculloch ... Umesh N Khot
01 Jul 2014
Circulation: Cardiovascular Quality and Outcomes | VOL. 7

Genetics of Coronary Artery Disease
Wolfgang Lieb ... Ramachandran S Vasan
Circulation | VOL. 128
Wolfgang Lieb, et. al.Wolfgang Lieb ... Ramachandran S Vasan
03 Sep 2013
Circulation | VOL. 128

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using whole genome scores to compare three clinical phenotyping methods in complex diseases

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports