Abstract
Starting from medical big data, this article uses data mining technology to analyze and study the pathogenic factors of lung cancer based on the lung cancer electronic medical record data from the oncology department of the authoritative third grade A hospital for many years. With respect to the processing of huge data from electronic medical records for lung cancer, traditional serial Apriori algorithm has the disadvantages of scanning database frequently, running slowly and consuming large amount of memory resources. Therefore, an improved Apriori algorithm based on MapReduce distributed computing model of Hadoop platform is proposed. The experimental cluster and lung cancer data mining experiments show that the improved Apriori algorithm has higher execution efficiency and good system scalability in dealing with lung cancer big data, and can well mine the relationship between lung cancer and pathogenic factors, which has important guiding significance for assisting the clinical diagnosis and risk prediction of lung cancer.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.