Abstract
Most of the classification methods assume that the numbers of class observations are balanced. In such cases, models are predicted by giving biased weight to the the class with more observations. Therefore, the classifiers ignore the class with smaller number of observations and the majority class makes biased predictions. There are some advised performance measures to be used in datasets, as well as recommended approaches to solve class imbalance problem. One of the most widely used methods is resampling method. In this study, the difficulties relevant to random oversampling (ROS) and synthetic minority oversampling technique (SMOTE), which are some of the oversampling methods, are discussed. This study aims to propose a combination of a new noise detection method and SMOTE to overcome those difficulties. Using the boosting procedure in ensemble algorithms, noise detection is possible with the proposed SMOTE with boosting (SMOTEWB) method, which makes use of this information to determine the appropriate number of neighbors for each observation within SMOTE algorithm.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.