Evaluation of normalization methods for cDNA microarray data by k-NN classification

Wei Wu,Eric P Xing,I Saira Mian,Mina J Bissell,Connie Myers

doi:10.1186/1471-2105-6-191

Abstract

BackgroundNon-biological factors give rise to unwanted variations in cDNA microarray data. There are many normalization methods designed to remove such variations. However, to date there have been few published systematic evaluations of these techniques for removing variations arising from dye biases in the context of downstream, higher-order analytical tasks such as classification.ResultsTen location normalization methods that adjust spatial- and/or intensity-dependent dye biases, and three scale methods that adjust scale differences were applied, individually and in combination, to five distinct, published, cancer biology-related cDNA microarray data sets. Leave-one-out cross-validation (LOOCV) classification error was employed as the quantitative end-point for assessing the effectiveness of a normalization method. In particular, a known classifier, k-nearest neighbor (k-NN), was estimated from data normalized using a given technique, and the LOOCV error rate of the ensuing model was computed. We found that k-NN classifiers are sensitive to dye biases in the data. Using NONRM and GMEDIAN as baseline methods, our results show that single-bias-removal techniques which remove either spatial-dependent dye bias (referred later as spatial effect) or intensity-dependent dye bias (referred later as intensity effect) moderately reduce LOOCV classification errors; whereas double-bias-removal techniques which remove both spatial- and intensity effect reduce LOOCV classification errors even further. Of the 41 different strategies examined, three two-step processes, IGLOESS-SLFILTERW7, ISTSPLINE-SLLOESS and IGLOESS-SLLOESS, all of which removed intensity effect globally and spatial effect locally, appear to reduce LOOCV classification errors most consistently and effectively across all data sets. We also found that the investigated scale normalization methods do not reduce LOOCV classification error.ConclusionUsing LOOCV error of k-NNs as the evaluation criterion, three double-bias-removal normalization strategies, IGLOESS-SLFILTERW7, ISTSPLINE-SLLOESS and IGLOESS-SLLOESS, outperform other strategies for removing spatial effect, intensity effect and scale differences from cDNA microarray data. The apparent sensitivity of k-NN LOOCV classification error to dye biases suggests that this criterion provides an informative measure for evaluating normalization methods. All the computational tools used in this study were implemented using the R language for statistical computing and graphics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jan 1, 2005
Citations: 55	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Evaluation of normalization methods for cDNA microarray data by k-NN classification

Abstract

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

A composite three-stage normalization method for cDNA microarray data
Mingjie Ma ... Jinsoo Hwang
-
Mingjie Ma, et. al.Mingjie Ma ... Jinsoo Hwang
01 Dec 2010
01 Dec 2010

Characterizing dye bias in microarray experiments
K K Dobbin ... E S Kawasaki
Bioinformatics | VOL. 21
K K Dobbin, et. al.K K Dobbin ... E S Kawasaki
17 Mar 2005
Bioinformatics | VOL. 21

New normalization methods using support vector machine quantile regression approach in microarray analysis
Insuk Sohn ... Jae Won Lee
Computational Statistics & Data Analysis | VOL. 52
Insuk Sohn, et. al.Insuk Sohn ... Jae Won Lee
15 Feb 2008
Computational Statistics & Data Analysis | VOL. 52

A new approach to intensity-dependent normalization of two-channel microarrays
A R Dabney ... J D Storey
Biostatistics | VOL. 8
A R Dabney, et. al.A R Dabney ... J D Storey
24 Apr 2006
Biostatistics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of normalization methods for cDNA microarray data by k-NN classification

Abstract

Talk to us

Similar Papers

More From: BMC Bioinformatics