Exact Performance of CoD Estimators in Discrete Prediction

Ting Chen,Ulisses Braga-Neto

doi:10.1155/2010/487893

Abstract

The coefficient of determination (CoD) has significant applications in genomics, for example, in the inference of gene regulatory networks. We study several CoD estimators, based upon the resubstitution, leave-one-out, cross-validation, and bootstrap error estimators. We present an exact formulation of performance metrics for the resubstitution and leave-one-out CoD estimators, assuming the discrete histogram rule. Numerical experiments are carried out using a parametric Zipf model, where we compute exact performance metrics of resubstitution and leave-one-out CoD estimators using the previously derived equations, for varying actual CoD, sample size, and bin size. These results are compared to approximate performance metrics of 10-repeated 2-fold cross-validation and 0.632 bootstrap CoD estimators, computed via Monte Carlo sampling. The numerical results lead to a perhaps surprising conclusion: under the Zipf model under consideration, and for moderate and large values of the actual CoD, the resubstitution CoD estimator is the least biased and least variable among all CoD estimators, especially at small number of predictors. We also observed that the leave-one-out and cross-validation CoD estimators tend to perform the worst, whereas the performance of the bootstrap CoD estimator is intermediary, despite its high computational complexity.

Highlights

The coefficient of determination (CoD) has significant applications in genomics, for example, in the inference of gene regulatory networks
Numerical experiments are carried out using a parametric Zipf model, where we compute the exact performance of resubstitution and leave-one-out CoD estimators using the previously derived equations, for varying actual CoD, sample size, and bin size
The leave-one-out and cross-validation CoD estimator tend to perform the worst whereas the performance of the bootstrap CoD estimator is intermediary, despite its high computational complexity. This indicates that provided one has evidence of moderate to tight regulation between the genes, and the number of predictors is not too large, one should use the CoD estimator based on resubstitution

Summary

Introduction

The coefficient of determination (CoD) has significant applications in genomics, for example, in the inference of gene regulatory networks. Numerical experiments are carried out using aparametric Zipf model, where we compute exact performance metrics of resubstitution and leave-oneout CoD estimators using the previously derived equations, for varying actual CoD, sample size, and bin size. These results are compared to approximate performance metrics of10-repeated 2-fold cross-validation and 0.632 bootstrap CoD estimators, computed via Monte Carlo sampling. Numerical experiments are carried out using a parametric Zipf model, where we compute the exact performance of resubstitution and leave-one-out CoD estimators using the previously derived equations, for varying actual CoD, sample size, and bin size We compare these results to approximate performance metrics of randomized CoD estimators (bootstrap and cross-validation), computed via Monte Carlo sampling.

Discrete Prediction

CoD Estimation

Performance Metrics of CoD Estimators

Exact Moments of Nonrandomized CoD Estimators

Numerical Experiments

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Advances in Signal Processing	Publication Date: Jul 27, 2010
Citations: 15	License type: cc-by

R Discovery Prime

R Discovery Prime

Exact Performance of CoD Estimators in Discrete Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing

Lead the way for us

Similar Papers

Optimal Bayesian MMSE estimation of the coefficient of determination for discrete prediction
Ting Chen ... Ulisses Braga-Neto
-
Ting Chen, et. al.Ting Chen ... Ulisses Braga-Neto
01 Nov 2013
01 Nov 2013

Inference of Gene Regulatory Networks Using Coefficient of Determination, Tsallis Entropy and Biological Prior Knowledge
Camila Y Koike ... Carlos H A Higa
-
Camila Y Koike, et. al.Camila Y Koike ... Carlos H A Higa
01 Oct 2016
01 Oct 2016

Incorporating and Generating Prior Knowledge to Improve Gene Regulatory Network Inference

-

17 Sep 2017
17 Sep 2017

Data Integration of Hybrid Microarray and Single Cell Expression Data to Enhance Gene Network Inference
Wei Zhang ... Jianming Zhang
Current Bioinformatics | VOL. 14
Wei Zhang, et. al.Wei Zhang ... Jianming Zhang
07 Mar 2019
Current Bioinformatics | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exact Performance of CoD Estimators in Discrete Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing