Classifier design given an uncertainty class of feature distributions via regularized maximum likelihood and the incorporation of biological pathway knowledge in steady-state phenotype classification

Mohammad Shahrokh Esfahani,Jason Knight,Amin Zollanvari,Byung-Jun Yoon,Edward R Dougherty

doi:10.1016/j.patcog.2013.02.017

Abstract

Contemporary high-throughput technologies provide measurements of very large numbers of variables but often with very small sample sizes. This paper proposes an optimization-based paradigm for utilizing prior knowledge to design better performing classifiers when sample sizes are limited. We derive approximate expressions for the first and second moments of the true error rate of the proposed classifier under the assumption of two widely used models for the uncertainty classes: ε-contamination and p-point classes. The applicability of the approximate expressions is discussed by defining the problem of finding optimal regularization parameters through minimizing the expected true error. Simulation results using the Zipf model show that the proposed paradigm yields improved classifiers that outperform traditional classifiers which use only training data. Our application of interest involves discrete gene regulatory networks possessing labeled steady-state distributions. Given prior operational knowledge of the process, our goal is to build a classifier that can accurately label future observations obtained in the steady state by utilizing both the available prior knowledge and the training data. We examine the proposed paradigm on networks containing NF-κB pathways, where it shows significant improvement in classifier performance over the classical data-only approach to classifier design. Companion website: http://gsp.tamu.edu/Publications/supplementary/shahrokh12a.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classifier design given an uncertainty class of feature distributions via regularized maximum likelihood and the incorporation of biological pathway knowledge in steady-state phenotype classification

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Mar 7, 2013
Citations: 49

Similar Papers

Designing enhanced classifiers using prior process knowledge: Regularized maximum-likelihood
Mohammad Shahrokh Esfahani ... Amin Zollanvari
-
Mohammad Shahrokh Esfahani, et. al.Mohammad Shahrokh Esfahani ... Amin Zollanvari
01 Dec 2011
01 Dec 2011

An Optimization-Based Framework for the Transformation of Incomplete Biological Knowledge into a Probabilistic Structure and Its Application to the Utilization of Gene/Protein Signaling Pathways in Discrete Phenotype Classification.
Mohammad Shahrokh Esfahani ... Edward R Dougherty
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 12
Mohammad Shahrokh Esfahani, et. al.Mohammad Shahrokh Esfahani ... Edward R Dougherty
01 Nov 2015
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 12

Characterization of the Effectiveness of Reporting Lists of Small Feature Sets Relative to the Accuracy of the Prior Biological Knowledge
Chen Zhao ... Robert S Chapkin
Cancer Informatics | VOL. 9
Chen Zhao, et. al.Chen Zhao ... Robert S Chapkin
01 Jan 2009
Cancer Informatics | VOL. 9

Is cross-validation valid for small-sample microarray classification?
Ulisses M Braga-Neto ... Edward R Dougherty
Bioinformatics | VOL. 20
Ulisses M Braga-Neto, et. al.Ulisses M Braga-Neto ... Edward R Dougherty
12 Feb 2004
Bioinformatics | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classifier design given an uncertainty class of feature distributions via regularized maximum likelihood and the incorporation of biological pathway knowledge in steady-state phenotype classification

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition