Application of boosting to classification problems in chemometrics

M.H Zhang,Q.S Xu,F Daeyaert,P.J Lewi,D.L Massart

doi:10.1016/j.aca.2005.01.075

Abstract

Application of boosting to both two-class and multi-class classification problems are studied. Five real chemical data sets are used. Each data is randomly divided into two subsets, one for training and the other for prediction. For two-class classification, each data is separated into a high response level class and a low response level class according to a threshold value. As a result, three data sets, wheat data, cream data and HIV data, show that boosting using classification and regression trees (CART) as a base learner may decrease the misclassification rate in prediction with respect to using a single CART. However, boosting for green tea data indicates that overfitting may occur when boosting is applied. For the chromatographic retention data, boosting performs worse than a single CART. The cream data and the HIV data are also used for multi-class classification. Both data sets demonstrate that boosting performs better than CART in multi-classification. Variable importance analysis suggests that the improvement made by boosting may be due to the use of more variables, which give more information on special types of samples in the training data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Application of boosting to classification problems in chemometrics

Abstract

Talk to us

Similar Papers

More From: Analytica Chimica Acta

Lead the way for us

Journal: Analytica Chimica Acta	Publication Date: Mar 16, 2005
Citations: 51

Similar Papers

Multi-stage decision tree based on inter-class and inner-class margin of SVM
Mingzhu Lu ... Xizhao Wang
-
Mingzhu Lu, et. al.Mingzhu Lu ... Xizhao Wang
01 Oct 2009
01 Oct 2009

The application of nonparametric data augmentation and imputation using classification and regression trees within a large-scale panel study

-

01 Jan 2017
01 Jan 2017

Building binary-tree-based multiclass classifiers using separability measures
Ana Carolina Lorena ... André C.P.L.F De Carvalho
Neurocomputing | VOL. 73
Ana Carolina Lorena, et. al.Ana Carolina Lorena ... André C.P.L.F De Carvalho
25 Jun 2010
Neurocomputing | VOL. 73

Decision tree for modeling survival data with competing risks
Kazeem Adesina Dauda ... Sushmita Mitra
Biocybernetics and Biomedical Engineering | VOL. 39
Kazeem Adesina Dauda, et. al.Kazeem Adesina Dauda ... Sushmita Mitra
04 Jun 2019
Biocybernetics and Biomedical Engineering | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application of boosting to classification problems in chemometrics

Abstract

Talk to us

Similar Papers

More From: Analytica Chimica Acta