Abstract
Accumulating evidence indicates that alterations of gut microbiota are associated with colorectal cancer (CRC). Therefore, the use of gut microbiota for the diagnosis of CRC has received attention. Recently, several studies have been conducted to detect the differences in the gut microbiota between healthy individuals and CRC patients using machine learning‐based gut bacterial DNA meta‐sequencing analysis, and to use this information for the development of CRC diagnostic model. However, to date, most studies had small sample sizes and/or only cross‐validated using the training dataset that was used to create the diagnostic model, rather than validated using an independent test dataset. Since machine learning‐based diagnostic models cause overfitting if the sample size is small and/or an independent test dataset is not used for validation, the reliability of these diagnostic models needs to be interpreted with caution. To circumvent these problems, here we have established a new machine learning‐based CRC diagnostic model using the gut microbiota as an indicator. Validation using independent test datasets showed that the true positive rate of our CRC diagnostic model increased substantially as CRC progressed from Stage I to more than 60% for CRC patients more advanced than Stage II when the false positive rate was set around 8%. Moreover, there was no statistically significant difference in the true positive rate between samples collected in different cities or in any part of the colorectum. These results reveal the possibility of the practical application of gut microbiota‐based CRC screening tests.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.