Abstract

In the literature, several methods explored to analyze breast cancer dataset have failed to sufficiently handle quantitative attribute sharp boundary problem to resolve inter and intra uncertainties in breast cancer dataset analysis. In this study an Interval Type-2 fuzzy association rule mining approach is proposed for pattern discovery in breast cancer dataset. In the first part of this analysis, the interval Type-2 fuzzification of the breast cancer dataset is carried out using Hao and Mendel approach. In the second part, FP-growth algorithm is adopted for associative pattern discovery from the fuzzified dataset from the first part. To define the intuitive words for breast cancer determinant factors and expert data interval, thirty (30) medical experts from specialized hospitals were consulted through questionnaire poling method. To establish the adequacy of the linguistic word defined by the expert, Jaccard similarity measure is used. This analysis is able to discover associative rules with minimum number of symptoms at confidence values as high as 91%. It also identifies High Bare Nuclei and High Uniformity of Cell Shape as strong determinant factors for diagnosing breast cancer. The proposed approach performed better in terms of rules generated when compared with traditional quantitative association rule mining. It is able to eliminate redundant rules which reduce the number of generated rules by 39.5% and memory usage by 22.6%. The discovered rules are viable in building a comprehensive and compact expert driven knowledge�base for breast cancer decision support or expert system

Highlights

  • In today’s information age, medical databases are increasing rapidly with a large number of quantitative attributes

  • The interval type-2 fuzzy set: HM approach was used to model the inevitability of uncertainties of the medical experts that gave their respective ranges on each of the breast cancer determinant factors

  • Hao and Mendel approach was adopted for interval Type-2 data fuzzification while FP-growth algorithm was explored for pattern discovery in breast cancer

Read more

Summary

Introduction

In today’s information age, medical databases are increasing rapidly with a large number of quantitative attributes. Analyzing this dataset is crucial for enhancing medical decision making and management (Delgado et al, 2001). Detection of breast cancer incidence has been confirmed to increase the survival rate and reduce death rate (Ed-daoudy and Maalmi, 2020). According to (Yeh et al, 2009) around 97% of women can survive for 5 years or more due to earlier diagnosis and improved treatment This fact was further buttressed by American Cancer Society’s 2018 report. The report highlighted the decline death rate of about 39% from 1989 to 2015 by breast cancer due to early detection. The statistics above shows that early detection of breast cancer is vital to decreased life crisis

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call