Dealing with missing values in proteomics data.

Weijia Kong,Harvard Wai Hann Hui,Hui Peng,Wilson Wen Bin Goh

doi:10.1002/pmic.202200092

Abstract

Proteomics data are often plagued with missingness issues. These missing values (MVs) threaten the integrity of subsequent statistical analyses by reduction of statistical power, introduction of bias, and failure to represent the true sample. Over the years, several categories of missing value imputation (MVI) methods have been developed and adapted for proteomics data. These MVI methods perform their tasks based on different prior assumptions (e.g., data is normally or independently distributed) and operating principles (e.g., the algorithm is built to address random missingness only), resulting in varying levels of performance even when dealing with the same dataset. Thus, to achieve a satisfactory outcome, a suitable MVI method must be selected. To guide decision making on suitable MVI method, we provide a decision chart which facilitates strategic considerations on datasets presenting different characteristics. We also bring attention to other issues that can impact proper MVI such as the presence of confounders (e.g., batch effects) which can influence MVI performance. Thus, these too, should be considered during or before MVI.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dealing with missing values in proteomics data.

Abstract

Talk to us

Similar Papers

More From: PROTEOMICS

Lead the way for us

Journal: PROTEOMICS	Publication Date: Nov 17, 2022
Citations: 30

Similar Papers

Incomplete data ensemble classification using imputation-revision framework with local spatial neighborhood information
Yuanting Yan ... Yanping Zhang
Applied Soft Computing | VOL. 99
Yuanting Yan, et. al.Yuanting Yan ... Yanping Zhang
13 Nov 2020
Applied Soft Computing | VOL. 99

On mining incomplete medical datasets: Ordering imputation and classification.
Chih-Wen Chen ... Shih-Wen Ke
Technology and health care : official journal of the European Society for Engineering and Medicine | VOL. 23
Chih-Wen Chen, et. al.Chih-Wen Chen ... Shih-Wen Ke
22 Sep 2015
Technology and health care : official journal of the European Society for Engineering and Medicine | VOL. 23

A Two-Stage Missing Value Imputation Method Based on Autoencoder Neural Network
Jiayin Yu ... Yulin He
-
Jiayin Yu, et. al.Jiayin Yu ... Yulin He
15 Dec 2021
15 Dec 2021

In-depth method assessments of differentially expressed protein detection for shotgun proteomics data with missing values
Jinxia Wang ... Jie Ma
Scientific Reports | VOL. 7
Jinxia Wang, et. al.Jinxia Wang ... Jie Ma
13 Jun 2017
Scientific Reports | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dealing with missing values in proteomics data.

Abstract

Talk to us

Similar Papers

More From: PROTEOMICS