Feature definition in pattern recognition with small sample size

Anil K Jain,Richard Dubes

doi:10.1016/0031-3203(78)90016-x

Abstract

The problem of feature definition in the design of a pattern recognition system where the number of available training samples is small but the number of potential features is excessively large has not received adequate attention. Most of the existing feature extraction and feature selection procedures are not feasible due to computational considerations when the number of features exceeds, say, 100, and are not even applicable when the number of features exceeds the number of patterns. The feature definition procedure which we have proposed involves partitioning a large set of highly correlated features into subsets, or clusters, through hierarchical clustering. Almost any feature selection or extraction procedure, including the constrained maximum variance approach introduced here, can then be applied to each subset to obtain a single representative feature. The original set of correlated features is thus reduced to a small set of nearly uncorrelated features. The utility of this procedure has been demonstrated on a speaker-identification data base which consists of 20 subjects, 156 features, and 180 samples.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature definition in pattern recognition with small sample size

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Jan 1, 1978
Citations: 31

Similar Papers

An Adaptive Semisupervised Feature Analysis for Video Semantic Recognition.
Minnan Luo ... Liqiang Nie
IEEE Transactions on Cybernetics | VOL. 48
Minnan Luo, et. al.Minnan Luo ... Liqiang Nie
20 Feb 2017
IEEE Transactions on Cybernetics | VOL. 48

A machine learning method based on lesion segmentation for quantitative analysis of CT radiomics to detect COVID-19
Seyed Masoud Rezaeijo ... Mohammad Alaei
-
Seyed Masoud Rezaeijo, et. al.Seyed Masoud Rezaeijo ... Mohammad Alaei
23 Dec 2020
23 Dec 2020

ALL/AML Cancer Classification by Gene Expression Data Using SVM and CSVM Approach
...
Genome Informatics | VOL. 11
, et. al. ...
01 Jan 1999
Genome Informatics | VOL. 11

Advanced data pre-processing for damage identification based on pattern recognition
W J Staszewski
International Journal of Systems Science | VOL. 31
W J StaszewskiW J Staszewski
01 Jan 1999
International Journal of Systems Science | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature definition in pattern recognition with small sample size

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition