Abstract

The increase in mortality rate due to life-threatening diseases has become an issue of concern in today’s world. Early detection and diagnosis of diseases thus becomes necessary to reduce the severity of their side effects. Computational intelligence techniques like rare association rule mining can be extensively used for the analysis of diseases. This paper introduces an efficient technique to identify the symptoms and risk factors for three adverse diseases: cardiovascular disease, hepatitis and breast cancer, in terms of rare association rules. Existing research on rare association rule mining is based on the notion that the entire data to be operated on is available at the onset of the mining process. The medical databases in practice may get modified over time due to the addition of new records or deletion of previous records. Moreover, the user may switch to a new threshold for generating the desired set of rare association rules when the database gets updated. A straightforward yet incompetent solution for generating the current set of rare association rules would be to re-execute the entire mining algorithm from scratch, for each modified bunch of data and updated threshold. The algorithm proposed in this study is capable of generating the new set of rare association rules from updated medical databases in a single database scan without re-executing the entire mining process. It can efficiently handle the cases of transaction insertion and deletion and also provides flexibility to the user to generate the new set of rare association rules when threshold is updated. Experimental analysis illustrates the significance of proposed approach over traditional approach of repeatedly mining the entire updated database.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call