Abstract

Background/Objectives: Since more than a decade, Ensemble methods like Bagging and Boosting have drawn great attention by the researchers aiming to improve the prediction accuracy over single classifiers Despite Some recent studies have noticed that Bagging and Boosting does not always improve the accuracy, it enhances the accuracy only if the classifier is unstable classifier. To overcome this problem, a Hybrid Ensemble Model with two phases of preprocessing is proposed in this paper and evaluated using 9 classifiers on 3 benchmark data sets of UCI Repository. Methods: In the first phase of preprocessing feature selection is performed using CFS to select the attributes highly correlated to the class and in the second phase K-means clustering algorithm is applied to remove the noisy instances. Finally, the resultant instances from the previous stages are trained with Bagging and Boosting ensembles to build the final Hybrid Ensemble classifier Model (HECM) using 10 fold cross validation. The result was evaluated using the confusion matrix and the performance measures like accuracy, kappa, mean absolute error and time to build the model. Findings: Results proved that proposed model is more efficient than the existing models and showed improved accuracy for both stable and unstable classifier ranging from 2% to 30.14% over traditional ensemble model depending upon the complexity of the algorithm.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.