DRAMS: A tool to detect and re-align mixed-up samples for integrative studies of multi-omics data.

Yi Jiang,Bingshan Li,Gina Giase,Sihan Liu,Chunyu Liu,Annie W Shieh,Rui Chen,Kay Grennan,Quan Wang,Qiang Wei,Kevin P White,Yan Xia,Lide Han,Chao Chen

doi:10.1371/journal.pcbi.1007522

Abstract

Studies of complex disorders benefit from integrative analyses of multiple omics data. Yet, sample mix-ups frequently occur in multi-omics studies, weakening statistical power and risking false findings. Accurately aligning sample information, genotype, and corresponding omics data is critical for integrative analyses. We developed DRAMS (https://github.com/Yi-Jiang/DRAMS) to Detect and Re-Align Mixed-up Samples to address the sample mix-up problem. It uses a logistic regression model followed by a modified topological sorting algorithm to identify the potential true IDs based on data relationships of multi-omics. According to tests using simulated data, the more types of omics data used or the smaller the proportion of mix-ups, the better that DRAMS performs. Applying DRAMS to real data from the PsychENCODE BrainGVEX project, we detected and corrected 201 (12.5% of total data generated) mix-ups. Of the 21 mix-ups involving errors of racial identity, DRAMS re-assigned all data to the correct racial group in the 1000 Genomes project. In doing so, quantitative trait loci (QTL) (FDR<0.01) increased by an average of 1.62-fold. The use of DRAMS in multi-omics studies will strengthen statistical power of the study and improve quality of the results. Even though very limited studies have multi-omics data in place, we expect such data will increase quickly with the needs of DRAMS.

Highlights

Investigation of complex traits and disorders can use multiple omics data to systematically explore regulatory networks and causal relationships
Sample mix-up happens inevitably during sample collection, processing, and data management. It leads to reduced statistical power and sometimes false findings
The goal of DRAMS is to detect and re-align mix-ups based on the grounds that all omics data originating from the same individual should match genotypes

Summary

Introduction

Investigation of complex traits and disorders can use multiple omics data to systematically explore regulatory networks and causal relationships. Is the detection and realignment of errors in data identifications (IDs) critical to ensuring accurate findings in integrative studies, such corrections can increase statistical power the number of positive findings [1]. For multi-omics data, the sample re-alignment procedure can be generally divided into two steps: first, to estimate genetic relatedness among the data of different omics and group together all the data of the same individual; to assign potential IDs for each data group. It is well-known that genetic information from the same individual should be identical regardless of the omics from which it originated. Using genotype data as a mediator, data originated from the same individual can be grouped together

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Apr 13, 2020
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

DRAMS: A tool to detect and re-align mixed-up samples for integrative studies of multi-omics data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

DRAMS: A tool to detect and re-align mixed-up samples for integrative studies of multi-omics data
Gina Giase ... Kevin P White
-
Gina Giase, et. al.Gina Giase ... Kevin P White
13 Apr 2020
13 Apr 2020

Simultaneous Integration of Multi-omics Data Improves the Identification of Cancer Driver Modules.
Dana Silverbush ... Simona Cristea
Cell Systems | VOL. 8
Dana Silverbush, et. al.Dana Silverbush ... Simona Cristea
01 May 2019
Cell Systems | VOL. 8

An integrative U method for joint analysis of multi-level omic data
Pei Geng ... Qing Lu
BMC Genetics | VOL. 20
Pei Geng, et. al.Pei Geng ... Qing Lu
10 Apr 2019
BMC Genetics | VOL. 20

An evaluation of the National Institutes of Health grants portfolio: identifying opportunities and challenges for multi-omics research that leverage metabolomics data
Catherine T Yu ... Krista A Zanetti
Metabolomics : Official journal of the Metabolomic Society | VOL. 18
Catherine T Yu, et. al.Catherine T Yu ... Krista A Zanetti
30 Apr 2022
Metabolomics : Official journal of the Metabolomic Society | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DRAMS: A tool to detect and re-align mixed-up samples for integrative studies of multi-omics data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology