Knowledge-Guided Bayesian Support Vector Machine for High-Dimensional Data with Application to Analysis of Genomics Data.

Wenli Sun,Changgee Chang,Qi Long,Yize Zhao

doi:10.1109/bigdata.2018.8622484

Abstract

Support vector machine (SVM) is a popular classification method for the analysis of wide range of data including big data. Many SVM methods with feature selection have been developed under frequentist regularization or Bayesian shrinkage frameworks. On the other hand, the importance of incorporating a priori known biological knowledge, such as gene pathway information which stems from the gene regulatory network, into the statistical analysis of genomic data has been recognized in recent years. In this article, we propose a new Bayesian SVM approach that enables the feature selection to be guided by the knowledge on the graphical structure among predictors. The proposed method uses the spike-and-slab prior for feature selection, combined with the Ising prior that encourages group-wise selection of the predictors adjacent to each other on the known graph. Gibbs sampling algorithm is used for Bayesian inference. The performance of our method is evaluated and compared with existing SVM methods in terms of prediction and feature selection in extensive simulation settings. In addition, our method is illustrated in the analysis of genomic data from a cancer study, demonstrating its advantage in generating biologically meaningful results and identifying potentially important features.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Knowledge-Guided Bayesian Support Vector Machine for High-Dimensional Data with Application to Analysis of Genomics Data.

Abstract

Talk to us

Similar Papers

More From: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data

Lead the way for us

Journal: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data	Publication Date: Dec 1, 2018
Citations: 36

Similar Papers

Bayesian Non-linear Support Vector Machine for High-Dimensional Data with Incorporation of Graph Information on Features.
Wenli Sun ... Changgee Chang
Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data | VOL. 2019
Wenli Sun, et. al.Wenli Sun ... Changgee Chang
01 Dec 2019
Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data | VOL. 2019

Graph-guided Bayesian SVM with Adaptive Structured Shrinkage Prior for High-dimensional Data.
Wenli Sun ... Changgee Chang
Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data | VOL. 2021
Wenli Sun, et. al.Wenli Sun ... Changgee Chang
15 Dec 2021
Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data | VOL. 2021

Joint Bayesian Variable Selection and Graph Estimation for Non-linear SVM with Application to Genomics Data
Wenli Sun ... Qi Long
-
Wenli Sun, et. al.Wenli Sun ... Qi Long
01 Oct 2020
01 Oct 2020

Proactive visual and statistical analysis of genomic data in Epiviz.
Zhe Cui ... John Hancock
Computer applications in the biosciences : CABIOS | VOL. 36
Zhe Cui, et. al.Zhe Cui ... John Hancock
29 Nov 2019
Computer applications in the biosciences : CABIOS | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Knowledge-Guided Bayesian Support Vector Machine for High-Dimensional Data with Application to Analysis of Genomics Data.

Abstract

Talk to us

Similar Papers

More From: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data