Abstract

The ability to predict the individual outcomes of clinical trials could support the development of tools for precision medicine and improve the efficiency of clinical-stage drug development. However, there are no published attempts to predict individual outcomes of clinical trials for cancer. We used machine learning (ML) to predict individual responses to a two-year course of bicalutamide, a standard treatment for prostate cancer, based on data from three Phase III clinical trials (n = 3653). We developed models that used a merged dataset from all three studies. The best performing models using merged data from all three studies had an accuracy of 76%. The performance of these models was confirmed by further modeling using a merged dataset from two of the three studies, and a separate study for testing. Together, our results indicate the feasibility of ML-based tools for predicting cancer treatment outcomes, with implications for precision oncology and improving the efficiency of clinical-stage drug development.

Highlights

  • This study reported that bicalutamide treatment significantly improved progression-free survival in patients with advanced prostate cancer, but not in patients with only localized disease

  • We developed a series of machine learning (ML) models to predict two-year outcomes of a standard treatment for prostate cancer, based on baseline data from three clinical trials

  • By creating a large dataset, merged from three clinical trials, we achieved an overall accuracy of 76%

Read more

Summary

Sources of Data

The sources of data for this study were the Astra-Zeneca Early Prostate Cancer clinical program. This program consisted of three Phase III clinical trials on bicalutamide or placebo in subjects with prostate cancer. These three studies had slightly different procedures but were designed to yield data that could be pooled. The datasets publicly released contain data for the drug arm but not the placebo arm. Prostate Cancer program, the sources of data for this study

Objectives of the Original Studies
Subjects and Datasets
Comparability of the Studies
Derivation of the Target Variable
Other Data Preprocessing Steps
Final Datasets
2.10. Planned Modeling
2.11.3. CatBoost
2.11.5. Voting Classifier
2.12. Modeling Procedures
Three-Study Merged Dataset Models
Separate Study Validation Models
Individual Study Models
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call