Random-projection Ensemble Classification

Timothy I Cannings,Richard J Samworth

doi:10.1111/rssb.12228

Abstract

SummaryWe introduce a very general method for high dimensional classification, based on careful combination of the results of applying an arbitrary base classifier to random projections of the feature vectors into a lower dimensional space. In one special case that we study in detail, the random projections are divided into disjoint groups, and within each group we select the projection yielding the smallest estimate of the test error. Our random-projection ensemble classifier then aggregates the results of applying the base classifier on the selected projections, with a data-driven voting threshold to determine the final assignment. Our theoretical results elucidate the effect on performance of increasing the number of projections. Moreover, under a boundary condition that is implied by the sufficient dimension reduction assumption, we show that the test excess risk of the random-projection ensemble classifier can be controlled by terms that do not depend on the original data dimension and a term that becomes negligible as the number of projections increases. The classifier is also compared empirically with several other popular high dimensional classifiers via an extensive simulation study, which reveals its excellent finite sample performance.

Highlights

Supervised classification concerns the task of assigning an object to one of two or more groups, on the basis of a sample of labelled training data
Another key feature of our proposal is the realization that a simple majority vote of the classifications based on the retained projections can be highly suboptimal; instead, we argue that the voting threshold should be chosen in a data-driven fashion in an attempt to minimize the test error of the infinite simulation version of our random-projection ensemble classifier
For comparison we present the corresponding results of applying, where possible, the three base classifiers (LDA, Quadratic discriminant analysis (QDA), knn) in the original p-dimensional space alongside 11 other classification methods chosen to represent the state of the art

Summary

Introduction

Supervised classification concerns the task of assigning an object (or a number of objects) to one of two or more groups, on the basis of a sample of labelled training data. The problem was first studied in generality in the famous work of Fisher (1936), where he introduced some of the ideas of linear discriminant analysis (LDA) and applied them to his iris data set. Classification problems arise in a plethora of applications, including spam filtering, fraud detection, medical diagnoses, market research, natural language processing and many others. Alternative techniques include support vector machines (SVMs) (Cortes and Vapnik, 1995), tree classifiers and random forests (RFs) (Breiman et al, 1984; Breiman, 2001), kernel methods (Hall and Kang, 2005) and nearest neighbour classifiers (Fix and Hodges, 1951). More substantial overviews and detailed discussion of these techniques, and others, can be found in Devroye et al (1996) and Hastie et al (2009)

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of the Royal Statistical Society Series B: Statistical Methodology	Publication Date: Jun 30, 2017
Citations: 96	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Random-projection Ensemble Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society Series B: Statistical Methodology

Lead the way for us

Similar Papers

Random Projection Ensemble Conformal Prediction for High-dimensional Classification
Xiaoyu Qian ... Youwu Lin
Chemometrics and Intelligent Laboratory Systems | VOL. -
Xiaoyu Qian, et. al.Xiaoyu Qian ... Youwu Lin
01 Sep 2024
Chemometrics and Intelligent Laboratory Systems | VOL. -

Random Projections of Signal Manifolds
M.B Wakin ... R.G Baraniuk
-
M.B Wakin, et. al.M.B Wakin ... R.G Baraniuk
14 May 2006
14 May 2006

An Efficient and Effective Multiple Empirical Kernel Learning Based on Random Projection
Zhe Wang ... Daqi Gao
Neural Processing Letters | VOL. 42
Zhe Wang, et. al.Zhe Wang ... Daqi Gao
15 Oct 2014
Neural Processing Letters | VOL. 42

Randomized maximum-contrast selection: Subagging for large-scale regression
Jelena Bradic
Electronic Journal of Statistics | VOL. 10
Jelena BradicJelena Bradic
01 Jan 2015
Electronic Journal of Statistics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Random-projection Ensemble Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society Series B: Statistical Methodology