Abstract

While the 5-year survival rate of endometrial cancer is relatively high (82%), the incidence and death rates are rising. Currently screening methods (e.g. endometrial biopsy, pelvic ultrasound) are primarily used for women with a genetic predisposition (e.g. Lynch syndrome). So far, there is no effective way to conduct population- based screening. It is therefore crucial to have a risk stratification tool, based on comprehensive personal health data, which can identify high risk individuals who would benefit from screening. Such a stratification scheme could also help direct preventative interventions for individuals. In this study we developed a machine learning model capable of accurately predicting endometrial cancer risk from personal health data. We used the following inputs: age, BMI and weight chance, race, smoking habits, diabetes, emphysema, stroke, hypertension, heart disease, arthritis, another cancer, family history of cancer, gynecological surgeries, menarche age, parity, use of birth control, and age at menopause extracted from the Prostate, Lung, Colorectal, Ovarian Cancer Screening Trail data. Participants in the trial were followed for 13 years or until cancer diagnosis with 78,215 women participants, 952 of whom developed endometrial cancer within 5 years of enrolling. Splitting the data into training (70%) and testing (30%) sets we tested logistic regression, decision tree, random forest, linear discriminant analysis, support vector machine, naive Bayes, and neural networks (NN). The top model was then used to stratify endometrial cancer risk into low, medium, and high risk categories. The NN outperformed the other methods with a test AUC of 0.88. Using the NN, we classified 57.2% of those who developed cancer within 5 years as high risk, 41.8% as medium risk, and 1.1% as low risk. For those who did not develop cancer within 5 years we classified 0.9%, 71.0%, and 28.2% as high, medium, and low risk, respectively.Abstract 2304; Table 1# Low Risk% Low Risk# Medium Risk% Medium Risk# High Risk% High RiskCancer31.1%11941.8%16357.2%Non-Cancer6,52728.2%16,45471.0%1970.9%Risk stratification results on the test set by our NN. Open table in a new tab Risk stratification results on the test set by our NN. Our results indicate that the use of a NN based on personal health information can accurately discriminate between those at high risk of developing endometrial cancer and those who are not, offering a cost-effective and non-invasive way to stratify endometrial cancer risk for targeted screening and prevention.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.