The high-dimension, low-sample-size geometric representation holds under mild conditions

J Ahn,Y.-Y Chi,K M Muller,J S Marron

doi:10.1093/biomet/asm050

Abstract

SUMMARY High-dimension, low-small-sample size datasets have different geometrical properties from those of traditional low-dimensional data. In their asymptotic study regarding increasing dimensionality with a fixed sample size, Hall et al. (2005) showed that each data vector is approximately located on the vertices of a regular simplex in a high-dimensional space. A perhaps unappealing aspect of their result is the underlying assumption which requires the variables, viewed as a time series, to be almost independent. We establish an equivalent geometric representation under much milder conditions using asymptotic properties of sample covariance matrices. We discuss implications of the results, such as the use of principal component analysis in a high-dimensional space, extension to the case of nonindependent samples and also the binary classification problem.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The high-dimension, low-sample-size geometric representation holds under mild conditions

Abstract

Talk to us

Similar Papers

More From: Biometrika

Lead the way for us

Journal: Biometrika	Publication Date: Aug 5, 2007
Citations: 145

Similar Papers

Improved time series clustering based on new geometric frameworks
Clément Péalat ... Vincent Cheutet
Pattern Recognition | VOL. 124
Clément Péalat, et. al.Clément Péalat ... Vincent Cheutet
09 Nov 2021
Pattern Recognition | VOL. 124

A Large-scale Dynamic Vector and Raster Data Visualization Geographic Information System Based on Parallel Map Tiling
Huan Wang
-
Huan WangHuan Wang
12 Apr 2012
12 Apr 2012

Improvement of PCA-Based Approximate Nearest Neighbor Search Using Distance Statistics
Toshiro Ogita ... Akira Notsu
Journal of Advanced Computational Intelligence and Intelligent Informatics | VOL. 18
Toshiro Ogita, et. al.Toshiro Ogita ... Akira Notsu
20 Jul 2014
Journal of Advanced Computational Intelligence and Intelligent Informatics | VOL. 18

Hilbert-Schmidt and Sobol sensitivity indices for static and time series Wnt signaling measurements in colorectal cancer - part A
Shriprakash Sinha
BMC Systems Biology | VOL. 11
Shriprakash SinhaShriprakash Sinha
01 Dec 2017
BMC Systems Biology | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The high-dimension, low-sample-size geometric representation holds under mild conditions

Abstract

Talk to us

Similar Papers

More From: Biometrika