Abstract

Background and objectiveThe aim of computer-aided-detection (CAD) systems for mammograms is to assist radiologists by marking region of interest (ROIs) depicting abnormalities. However, the confusing appearance of some normal tissues that visually look like masses results in a large proportion of marked ROIs with normal tissues. This paper copes with this problem and proposes a framework to reduce false positive masses detected by CAD. MethodsTo avoid the error induced by the segmentation step, we proposed a segmentation-free framework with particular attention to improve feature extraction and classification steps. We investigated for the first time in mammogram analysis, Hilbert’s image representation, Kolmogorov–Smirnov distance and maximum subregion descriptors. Then, a feature selection step is performed to select the most discriminative features. Moreover, we considered several classifiers such as Random Forest, Support Vector Machine and Decision Tree to distinguish between normal tissues and masses. Our experiments were carried out on a large dataset of 10168 ROIs (8254 normal tissues and 1914 masses) constructed from the Digital Database for Screening Mammography (DDSM). To simulate practical scenario, our normal regions are false positives asserted by a CAD system from healthy cases. ResultsThe combination of all the descriptors yields better results than each feature set used alone, and the difference is statistically significant. Besides, the feature selection steps yields a statistically significant increase in the accuracy values for the three classifiers. Finally, the random forest achieves the highest accuracy (81.09%), outperforming the SVM classifier (80.01%)) and decision tree (79.12%), but the difference is not statistically significant. ConclusionsThe accuracy of discrimination between normal and abnormal ROIs in mammograms obtained with the proposed gray level texture features sets are encouraging and comparable to these obtained with multiresolution features. Combination of several features as well as feature selection steps improve the results. To improve false positives reduction in CAD systems for breast cancer diagnosis, these features could be combined with multiresolution features.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call