Abstract

Imbalanced data is an inevitable problem in many real world problems, including bleeding detection from endoscopic videos with a fewer clinically significant examples outnumbered by normal examples. In this paper, we have presented a comprehensive analysis of six different classifier performance for different class distribution of training dataset. We have addressed two questions: 1. Is there any advantage of using a certain classifier over others? 2. For bleeding detection problem, what is the optimal range of class distribution in training data set? We have built seven different training sets with different class distributions to answer the above questions. Besides the standard performance metrics, we have defined a metric to measure the robustness of the classifiers to get the optimal range of class distribution for a certain classifier. From our experiments, we found that balanced training set yields the best performance for all classifiers. Ensemble classifiers are more robust to the variation in training dataset compared to single classifier.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.