Abstract

AbstractApproximately 15% of all cancer deaths among women worldwide is due to breast cancer. Mammography is one of the most useful methods for the early detection of this disease. Over the last decade, several papers were published reporting the usage of different computer‐aided diagnosis systems using pattern recognition techniques as a second opinion to obtain a more accurate diagnosis. However, the theory of formal languages has not been explored in this field. In this context, the main contribution of this study is to present the usage of a new syntactic approach that is able to classify breast masses found in mammograms as benign or malignant. The experimental tests were performed using a dataset that contains 111 images from different sources. The grammar‐based classifiers achieved accuracy values ranging from 89% to 100% depending on the features and the model employed. Furthermore, to achieve a feature dimension reduction, a feature selection technique based on the Gini importance of each feature was employed. Additionally, we compared the obtained results with the grammar‐based classifiers to the more traditional classifiers used in this research area, such as artificial neural networks, support vector machines, k‐nearest neighbors, and random forest. The best result achieved by the grammar‐based classifiers was approximately 10% higher, in terms of accuracy, than the best results produced by the traditional classifiers, showing the strength of this grammatical approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.