Bayesian variable selection logistic regression with paired proteomic measurements.

Alexia Kakourou,Bart Mertens

doi:10.1002/bimj.201700182

Abstract

We explore the problem of variable selection in a case‐control setting with mass spectrometry proteomic data consisting of paired measurements. Each pair corresponds to a distinct isotope cluster and each component within pair represents a summary of isotopic expression based on either the intensity or the shape of the cluster. Our objective is to identify a collection of isotope clusters associated with the disease outcome and at the same time assess the predictive added‐value of shape beyond intensity while maintaining predictive performance. We propose a Bayesian model that exploits the paired structure of our data and utilizes prior information on the relative predictive power of each source by introducing multiple layers of selection. This allows us to make simultaneous inference on which are the most informative pairs and for which—and to what extent—shape has a complementary value in separating the two groups. We evaluate the Bayesian model on pancreatic cancer data. Results from the fitted model show that most predictive potential is achieved with a subset of just six (out of 1289) pairs while the contribution of the intensity components is much higher than the shape components. To demonstrate how the method behaves under a controlled setting we consider a simulation study. Results from this study indicate that the proposed approach can successfully select the truly predictive pairs and accurately estimate the effects of both components although, in some cases, the model tends to overestimate the inclusion probability of the second component.

Highlights

Proteomics is the large-scale study of proteins that aim to provide a better understanding of the function of cellular and disease processes at the protein level
We set the hyperparamaters of the Gamma distribution to α = β = 1 for the analysis presented in the paper, which results in a prior mean and variance of 1 for both sa and sb
We addressed the problem of isotope cluster selection through a Bayesian model formulation

Summary

Introduction

Proteomics is the large-scale study of proteins that aim to provide a better understanding of the function of cellular and disease processes at the protein level. Ultrahigh-resolution mass spectrometers (MS) such as Fourier-transform MS have become the most powerful and efficient tools for the quantitative analysis of complex protein mixtures in biological systems. In ultrahigh-resolution mass spectrometry, each species (such as peptide) is detected and expressed as a “density” of isotope peaks (as shown in Figure 1B)—rather than a single peak—in the mass spectrum, resulting from the distribution of naturally occurring elements.

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biometrical Journal	Publication Date: Jun 25, 2018
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Bayesian variable selection logistic regression with paired proteomic measurements.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biometrical Journal

Lead the way for us

Similar Papers

Variable Selection for Confounding Adjustment in High-dimensional Covariate Spaces When Analyzing Healthcare Databases.
Sebastian Schneeweiss ... Elisabetta Patorno
Epidemiology | VOL. 28
Sebastian Schneeweiss, et. al.Sebastian Schneeweiss ... Elisabetta Patorno
01 Mar 2017
Epidemiology | VOL. 28

Chapter 7 - Scalable Bayesian variable selection regression models for count data
Yinsen Miao ... Marina Vannucci
Flexible Bayesian Regression Modelling | VOL. -
Yinsen Miao, et. al.Yinsen Miao ... Marina Vannucci
01 Jan 2020
Flexible Bayesian Regression Modelling | VOL. -

Bayesian multilevel logistic regression models: a case study applied to the results of two questionnaires administered to university students
Cristian David Correa-Álvarez ... Luis Raúl Pericchi-Guerra
Computational Statistics | VOL. 38
Cristian David Correa-Álvarez, et. al.Cristian David Correa-Álvarez ... Luis Raúl Pericchi-Guerra
25 Oct 2022
Computational Statistics | VOL. 38

Evaluation of Bayesian Hui-Walter and logistic regression latent class models to estimate diagnostic test characteristics with simulated data.
Haifang Ni ... Irene Klugkist
Preventive Veterinary Medicine | VOL. 217
Haifang Ni, et. al.Haifang Ni ... Irene Klugkist
01 Aug 2023
Preventive Veterinary Medicine | VOL. 217

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian variable selection logistic regression with paired proteomic measurements.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biometrical Journal