Kernel-based distance metric learning for microarray data classification.

Huilin Xiong,Xue-Wen Chen

doi:10.1186/1471-2105-7-299

Abstract

BackgroundThe most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels. Compared with traditional pattern classifications, gene expression-based data classification is typically characterized by high dimensionality and small sample size, which make the task quite challenging.ResultsIn this paper, we present a modified K-nearest-neighbor (KNN) scheme, which is based on learning an adaptive distance metric in the data space, for cancer classification using microarray data. The distance metric, derived from the procedure of a data-dependent kernel optimization, can substantially increase the class separability of the data and, consequently, lead to a significant improvement in the performance of the KNN classifier. Intensive experiments show that the performance of the proposed kernel-based KNN scheme is competitive to those of some sophisticated classifiers such as support vector machines (SVMs) and the uncorrelated linear discriminant analysis (ULDA) in classifying the gene expression data.ConclusionA novel distance metric is developed and incorporated into the KNN scheme for cancer classification. This metric can substantially increase the class separability of the data in the feature space and, hence, lead to a significant improvement in the performance of the KNN classifier.

Highlights

As an important application of this novel technology, the gene expression data are used to determine and predict the state of tissue samples, which has shown to be very helpful in clinical oncology
The most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels
Compared with traditional pattern classifications, gene expression-based data classification is typically characterized by high dimensionality and small sample size, which make the task quite challenging

Summary

Introduction

As an important application of this novel technology, the gene expression data are used to determine and predict the state of tissue samples, which has shown to be very helpful in clinical oncology. The most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels. In combination with pattern classification techniques, gene expression data can provide more reliable means to diagnose and predict various types of cancers than the traditional clinical methods. A number of methods have been applied or developed to classify microarray data [1,2,3,4,5,6]. These methods include Knearest-neighbor (KNN), boosting, linear discriminant (page number not for citation purposes)

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jun 14, 2006
Citations: 67	License type: cc-by

R Discovery Prime

R Discovery Prime

Kernel-based distance metric learning for microarray data classification.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Data-Dependent Kernel Machines for Microarray Data Classification
Huilin Xiong ... Ya Zhang
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 4
Huilin Xiong, et. al.Huilin Xiong ... Ya Zhang
01 Oct 2007
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 4

A Novel K－Nearest Neighbor Classifier Based on Adaptive Metric Formed by Features Extracted by Nonparametric Feature Extraction Model
...
-
, et. al. ...
01 Dec 2010
01 Dec 2010

Efficient model selection for regularized linear discriminant analysis
Jieping Ye ... Ravi Janardan
-
Jieping Ye, et. al.Jieping Ye ... Ravi Janardan
01 Jan 2006
01 Jan 2006

On the classification techniques in data mining for microarray data classification
Husna Aydadenta ... Adiwijaya
Journal of Physics: Conference Series | VOL. 971
Husna Aydadenta, et. al.Husna Aydadenta ... Adiwijaya
01 Mar 2018
Journal of Physics: Conference Series | VOL. 971

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel-based distance metric learning for microarray data classification.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics