The false discovery rate for statistical pattern recognition

Clayton Scott,Rebecca Willett,Gowtham Bellala

doi:10.1214/09-ejs363

Abstract

The false discovery rate (FDR) and false nondiscovery rate (FNDR) have received considerable attention in the literature on multiple testing. These performance measures are also appropriate for classification, and in this work we develop generalization error analyses for FDR and FNDR when learning a classifier from labeled training data. Unlike more conventional classification performance measures, the empirical FDR and FNDR are not binomial random variables but rather a ratio of binomials, which introduces challenges not present in conventional formulations of the classification problem. We develop distribution-free uniform deviation bounds and apply these to obtain finite sample bounds and strong universal consistency. We also present a simulation study demonstrating the merits of variance-based bounds, which we also develop. In the context of multiple testing with FDR/FNDR, our framework may be viewed as a way to leverage training data to achieve distribution free, asymptotically optimal inference under the random effects model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2009
Citations: 43	License type: cc-by

R Discovery Prime

R Discovery Prime

The false discovery rate for statistical pattern recognition

Abstract

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

Generalization Error Analysis for FDR Controlled Classification
Clayton Scott ... Rebecca Willett
-
Clayton Scott, et. al.Clayton Scott ... Rebecca Willett
01 Aug 2007
01 Aug 2007

False discovery and false nondiscovery rates in single-step multiple testing procedures
Sanat K Sarkar
The Annals of Statistics | VOL. 34
Sanat K SarkarSanat K Sarkar
01 Feb 2006
The Annals of Statistics | VOL. 34

Optimal Rates and Tradeoffs in Multiple Testing
Maxim Rabinovich ... Aaditya Ramdas
Statistica Sinica | VOL. -
Maxim Rabinovich, et. al.Maxim Rabinovich ... Aaditya Ramdas
30 Jul 2019
Statistica Sinica | VOL. -

Large-scale multiple testing via multivariate hidden Markov models
Zhiqiang Hou ... Pengfei Wang
Communications in Statistics - Simulation and Computation | VOL. 53
Zhiqiang Hou, et. al.Zhiqiang Hou ... Pengfei Wang
02 Apr 2022
Communications in Statistics - Simulation and Computation | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The false discovery rate for statistical pattern recognition

Abstract

Talk to us

Similar Papers

More From: Electronic Journal of Statistics