Abstract

This work is devoted to the problem of building a sample classifier based on data from microarray gene expression experiments. Two specific issues related to this are tackled in this paper: (a) selection of parameters of a classification model to ensure best generalization power, and (b) variability of expected prediction error (EPE) for new data as a function of the model parameters. A method is presented for selection of model parameters minimizing the EPE in studies where the number of samples (n) is much smaller then the number of attributes (d). Due to very unstable behaviour of the EPE in the space of model parameters, it seems essential that microarray studies involve systematic search for the right model parameters, as shown in this work.KeywordsClass predictiongene expression microarraysexpected prediction errorcross-validation

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call