A comparative analysis of predictive data mining techniques

Xueping Li,Godswill Chukwugozie Nsofor,Laigang Song

doi:10.1504/ijrapidm.2009.029380

Abstract

It is non-trivial to select the appropriate prediction technique from a variety of existing techniques for a datasets, since the competitive evaluation of techniques (bagging, boosting, stacking and meta-learning) can be time consuming. This paper compares five predictive data mining techniques on four unique datasets that have a combination of the following characteristics: few predictor variables, many predictor variables, highly collinear variables, very redundant variables and the presence of outliers. Different data mining techniques, including multiple linear regression (MLR), principal component regression (PCR), ridge regression, partial least squares (PLS) and non-linear partial least squares (NLPLS), are applied to each of the datasets. The comparisons are based on different criteria: R-square, R-square adjusted, mean square error (MSE), mean absolute error (MAE), coefficient of efficiency, condition number (CN) and the number of variables of features included in the model. The advantages and disadvantages of the techniques are discussed and summarised.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparative analysis of predictive data mining techniques

Abstract

Talk to us

Similar Papers

More From: International Journal of Rapid Manufacturing

Lead the way for us

Journal: International Journal of Rapid Manufacturing	Publication Date: Jan 1, 2009
Citations: 56

Similar Papers

Predictive Techniques in Data Mining: A Review
E Kesavulu Reddy
-
E Kesavulu ReddyE Kesavulu Reddy
21 Dec 2021
21 Dec 2021

Identification of significant features and data mining techniques in predicting heart disease
Mohammad Shafenoor Amin ... Kasturi Dewi Varathan
Telematics and Informatics | VOL. 36
Mohammad Shafenoor Amin, et. al.Mohammad Shafenoor Amin ... Kasturi Dewi Varathan
22 Nov 2018
Telematics and Informatics | VOL. 36

Ultrasonic concentration measurement of citrus pectin aqueous solutions using PC and PLS regression
...
International Journal of Agricultural and Biological Engineering | VOL. 5
, et. al. ...
07 Apr 2012
International Journal of Agricultural and Biological Engineering | VOL. 5

Comparative Analysis of Data Mining Techniques for Heart Disease Prediction: A Focus on Neural Networks and Decision Trees
Suman Kumari Panigrahi ... Gargi Balabantaray
International Journal of Computer and Communication Technology | VOL. -
Suman Kumari Panigrahi, et. al.Suman Kumari Panigrahi ... Gargi Balabantaray
01 Jul 2023
International Journal of Computer and Communication Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparative analysis of predictive data mining techniques

Abstract

Talk to us

Similar Papers

More From: International Journal of Rapid Manufacturing