Abstract

<p><strong>Aim <br /></strong>To assess machine-learning models, their methodological quality, compare their performance, and highlight their limitations.<br /><strong>Methods</strong> <br />The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) recommendations were applied. Electronic databases Science Direct, MEDLINE through (PubMed, Google Scholar), EBSCO, ERIC, and CINAHL were<br />searched for the period of January 2016 to September 2023. Using a pre-designed data extraction sheet, the review data were extracted. Big data, risk assessment, colorectal cancer, and artificial intelligence were the main terms.<br /><strong>Results</strong> <br />Fifteen studies were included. A total of 3,057,329 colorectal cancer (CRC) health records, including those of adult patients older than 18, were used to generate the results. The curve's area under the curve ranged from 0.704 to 0.976. Logistic regression, random forests, and colon flag were often employed techniques. Overall, these trials provide a considerable and accurate CRC risk prediction.<br /><strong>Conclusion</strong> <br />An up-to-date summary of recent research on the use of big data in CRC prediction was given. Future research can be<br />facilitated by the review's identification of gaps in the literature. Missing data, a lack of external validation, and the diversity of<br />machine learning algorithms are the current obstacles. Despite having a sound mathematical definition, area under the curve application depends on the modelling context. </p>

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.