Kernel weighted least square approach for imputing missing values of metabolomics data

Nishith Kumar,Masahiro Sugimoto,Md Aminul Hoque

doi:10.1038/s41598-021-90654-0

Abstract

Mass spectrometry is a modern and sophisticated high-throughput analytical technique that enables large-scale metabolomic analyses. It yields a high-dimensional large-scale matrix (samples × metabolites) of quantified data that often contain missing cells in the data matrix as well as outliers that originate for several reasons, including technical and biological sources. Although several missing data imputation techniques are described in the literature, all conventional existing techniques only solve the missing value problems. They do not relieve the problems of outliers. Therefore, outliers in the dataset decrease the accuracy of the imputation. We developed a new kernel weight function-based proposed missing data imputation technique that resolves the problems of missing values and outliers. We evaluated the performance of the proposed method and other conventional and recently developed missing imputation techniques using both artificially generated data and experimentally measured data analysis in both the absence and presence of different rates of outliers. Performances based on both artificial data and real metabolomics data indicate the superiority of our proposed kernel weight-based missing data imputation technique to the existing alternatives. For user convenience, an R package of the proposed kernel weight-based missing value imputation technique was developed, which is available at https://github.com/NishithPaul/tWLSA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: May 27, 2021
Citations: 8	License type: open-access

R Discovery Prime

R Discovery Prime

Kernel weighted least square approach for imputing missing values of metabolomics data

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
Nishith Kumar ... Md Aminul Hoque
Current Bioinformatics | VOL. 14
Nishith Kumar, et. al.Nishith Kumar ... Md Aminul Hoque
06 Dec 2018
Current Bioinformatics | VOL. 14

Effect of Missing Data Imputation on Deep Learning Prediction Performance for Vesicoureteral Reflux and Recurrent Urinary Tract Infection Clinical Study.
Timur Köse ... Ahmet Keskinoğlu
BioMed Research International | VOL. 2020
Timur Köse, et. al.Timur Köse ... Ahmet Keskinoğlu
15 Jul 2020
BioMed Research International | VOL. 2020

Investigating the impact of missing data imputation techniques on battery energy management system
Mehdi Pazhoohesh ... Sara Walker
IET Smart Grid | VOL. 4
Mehdi Pazhoohesh, et. al.Mehdi Pazhoohesh ... Sara Walker
15 Feb 2021
IET Smart Grid | VOL. 4

Systematic Review on Missing Data Imputation Techniques with Machine Learning Algorithms for Healthcare
Amelia Ritahani Ismail ... Nadzurah Zainal Abidin
Journal of Robotics and Control (JRC) | VOL. 3
Amelia Ritahani Ismail, et. al.Amelia Ritahani Ismail ... Nadzurah Zainal Abidin
05 Feb 2022
Journal of Robotics and Control (JRC) | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel weighted least square approach for imputing missing values of metabolomics data

Abstract

Talk to us

Similar Papers

More From: Scientific Reports