Distribution In Feature Space Research Articles

The small number of samples available for training and testing is often the limiting factor in finding the most effective features and designing an optimal computer-aided diagnosis (CAD) system. Training on a limited set of samples introduces bias and variance in the performance of a CAD system relative to that trained with an infinite sample size. In this work, the authors conducted a simulation study to evaluate the performances of various combinations of classifiers and feature selection techniques and their dependence on the class distribution, dimensionality, and the training sample size. The understanding of these relationships will facilitate development of effective CAD systems under the constraint of limited available samples. Three feature selection techniques, the stepwise feature selection (SFS), sequential floating forward search (SFFS), and principal component analysis (PCA), and two commonly used classifiers, Fisher's linear discriminant analysis (LDA) and support vector machine (SVM), were investigated. Samples were drawn from multidimensional feature spaces of multivariate Gaussian distributions with equal or unequal covariance matrices and unequal means, and with equal covariance matrices and unequal means estimated from a clinical data set. Classifier performance was quantified by the area under the receiver operating characteristic curve Az. The mean Az values obtained by resubstitution and hold-out methods were evaluated for training sample sizes ranging from 15 to 100 per class. The number of simulated features available for selection was chosen to be 50, 100, and 200. It was found that the relative performance of the different combinations of classifier and feature selection method depends on the feature space distributions, the dimensionality, and the available training sample sizes. The LDA and SVM with radial kernel performed similarly for most of the conditions evaluated in this study, although the SVM classifier showed a slightly higher hold-out performance than LDA for some conditions and vice versa for other conditions. PCA was comparable to or better than SFS and SFFS for LDA at small samples sizes, but inferior for SVM with polynomial kernel. For the class distributions simulated from clinical data, PCA did not show advantages over the other two feature selection methods. Under this condition, the SVM with radial kernel performed better than the LDA when few training samples were available, while LDA performed better when a large number of training samples were available. None of the investigated feature selection-classifier combinations provided consistently superior performance under the studied conditions for different sample sizes and feature space distributions. In general, the SFFS method was comparable to the SFS method while PCA may have an advantage for Gaussian feature spaces with unequal covariance matrices. The performance of the SVM with radial kernel was better than, or comparable to, that of the SVM with polynomial kernel under most conditions studied.

Read full abstract

Conventional approaches to training a supervised image classification aim to fully describe all of the classes spectrally. To achieve a complete description of each class in feature space, a large training set is typically required. It is not, however, always necessary to have training statistics that provide a complete and representative description of the classes, especially if using nonparametric classifiers. For classification by a support vector machine, only the training samples that are support vectors, which lie on part of the edge of the class distribution in feature space, are required; all other training samples provide no contribution to the classification analysis. If regions likely to furnish support vectors can be identified in advance of the classification, it may be possible to intelligently select useful training samples. The ability to target useful training samples may allow accurate classification from small training sets. This potential for intelligent training sample collection was explored for the classification of agricultural crops from multispectral satellite sensor data. With a conventional approach to training, only a quarter of the training samples acquired actually made a positive contribution to the analysis and allowed the crops to be classified to a high accuracy (92.5%). The majority of the training set, therefore, was unnecessary as it made no contribution to the analysis. Using ancillary information on soil type, however, it would be possible to constrain the training sample acquisition process. By limiting training sample acquisition only to regions with a specific soil type, it was possible to use a small training set to classify the data without loss of accuracy. Thus, a small number of intelligently selected training samples may be used to classify a data set as accurately as a larger training set derived in a conventional manner. The results illustrate the potential to direct training data acquisition strategies to target the most useful training samples to allow efficient and accurate image classification.

Read full abstract

Distribution In Feature Space Research Articles

Articles published on Distribution In Feature Space

Semi-supervised behavioral learning and its application

Domain Invariant Transfer Kernel Learning

Semi-Supervised Learning by Local Behavioral Searching Strategy

Large margin aggregation of local estimates for medical image classification.

Human cognitive paradigm and its application in semi-supervised learning

Feature evaluation of radar signal based on aggregation, discreteness and divisibility

A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback

Remote Sensing Image Features Estimating Model Based onRobust Statistical Theory and Its Application

Methodology Multiclass microarray data classification based on confidence evaluation

A hierarchical naive Bayesian network classifier embedded GMM for textural image

Bridging Domains Using World Wide Knowledge for Transfer Learning

Effect of finite sample size on feature selection and classification: A simulation study

An elliptical basis function network for classification of remote sensing images

Toward intelligent training of supervised image classifications: directing training data acquisition for SVM classification

KNOWLEDGE-GUIDED CLASSIFICATION OF COASTAL ZONE COLOR IMAGES OFF THE WEST FLORIDA SHELF

Generalization properties of finite-size polynomial support vector machines

On the Distribution and Convergence of Feature Space in Self-Organizing Maps

Knowledge-based classification and tissue labeling of MR images of human brain

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Distribution In Feature Space Research Articles

Articles published on Distribution In Feature Space

Semi-supervised behavioral learning and its application

Domain Invariant Transfer Kernel Learning

Semi-Supervised Learning by Local Behavioral Searching Strategy

Large margin aggregation of local estimates for medical image classification.

Human cognitive paradigm and its application in semi-supervised learning

Feature evaluation of radar signal based on aggregation, discreteness and divisibility

A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback

Remote Sensing Image Features Estimating Model Based onRobust Statistical Theory and Its Application

Methodology Multiclass microarray data classification based on confidence evaluation

A hierarchical naive Bayesian network classifier embedded GMM for textural image

Bridging Domains Using World Wide Knowledge for Transfer Learning

Effect of finite sample size on feature selection and classification: A simulation study

An elliptical basis function network for classification of remote sensing images

Toward intelligent training of supervised image classifications: directing training data acquisition for SVM classification

KNOWLEDGE-GUIDED CLASSIFICATION OF COASTAL ZONE COLOR IMAGES OFF THE WEST FLORIDA SHELF

Generalization properties of finite-size polynomial support vector machines

On the Distribution and Convergence of Feature Space in Self-Organizing Maps

Knowledge-based classification and tissue labeling of MR images of human brain