ML-based Systems Research Articles

Abstract Introduction Machine learning (ML) models offer the potential to provide rich, quantitative characterizations of the tumor and tumor micro-environment (TME). Here we deployed a machine learning-based approach to the analysis of H&E images from HUDSON (NCT03334617), an AstraZeneca Phase II Platform clinical trial, to identify and quantify cellular composition and tissue architecture features in the TME that are associated with genomic alterations and time to progression on anti-PD(L)1 therapies. Methods PathAI previously trained ML models on non-small cell lung carcinoma (NSCLC) samples from commercial and clinical datasets to identify cell types and tissue regions within the TME. With no additional training, the models were deployed on 169 digitized whole slide images (WSIs) of H&E-stained biopsies from an international, multi-site AstraZeneca-sponsored Phase II clinical trial of novel anti-cancer agents in subjects with metastatic NSCLC. Biopsies were across multiple body sites, and taken both pre- and post-checkpoint progression. ML models generated human interpretable features (HIFs) that characterize the cell composition and tissue architecture from each biopsied sample. HIFs from baseline samples that met minimum image quality thresholds (n=89) were clustered to reduce redundancy and were tested for association with weeks to progression on anti-PD(L)1 therapy using Cox regression analysis. Results The PathAI ML models were successfully deployed on WSIs from the HUDSON clinical trial. Following correction for biopsy timing and location, a total of 59 HIFs were found to be significantly associated (p &lt;0.05) with weeks to progression on anti-PD(L)1 therapy, including features related to plasma cell infiltration, proportion of cancer cells, presence of macrophages and fibroblasts, and blood vessel compression. Features characterizing both plasma cells and blood vessels were also found to be significantly associated with any class I HLA locus loss of heterozygosity. Conclusions PathAI models were able to identify TME-associated features from WSIs from a Phase II clinical trial which were associated with therapy failure and genomic alterations. These results suggest the power of deploying pre-trained ML-based systems in a clinical trial setting to identify pathobiological features associated with tumor characteristics and time to progression from only H&E images. Citation Format: Laura Dillon, Marylens Hernandez, Ben Glass, Guillaume Chhor, Sara Hoffman, Varsha Chinnaobireddy, Sai Chowdary Gullapally, Kris Sachsenmeier, Andy Beck, Jason Hipp. Deep learning identifies pathobiological features within H&E images associated with genomic alterations and progression on anti-PD(L)1 in HUDSON, an AstraZeneca-sponsored Phase II clinical trial [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr LB016.

Read full abstract

Abstract Introduction Machine learning models offer the potential to provide rich, quantitative characterizations of the tumor and tumor micro-environment (TME); however, historically it has been difficult to generalize trained models to new sets of clinical trial samples from trials not used in training. Here we evaluate the ability to deploy a machine learning based model (ML Model) for the identification of non-small cell lung tissue regions and lymphocytes within the tumor and TME on H&E stained images from clinical trial samples with no additional model training. Methods The ML model was previously trained on both squamous cell carcinoma and lung adenocarcinoma non-small cell lung carcinoma (NSCLC) samples from commercial and clinical datasets. The ML model was deployed on an AstraZeneca-sponsored phase II clinical trial of novel anti-cancer agents in patients with metastatic NSCLC. In order to validate the predictions of lymphocytes from the H&E stained images, we established a reference dataset for manual vs digital concordance consisting of 300, 150 × 150-micron–sized “frames” sampled from the trial dataset, removing frames of inadequate tissue quality or with presence of artifacts. For each frame, we collected exhaustive annotations from 5 pathologists to produce quantitative estimates of lymphocytes. Altogether, 43,932 annotations were collected and used to compute pathologist consensus scores for each frame. These scores were then correlated with each individual pathologist (inter-reader agreement) and with the PathAI-derived automated scores for evaluation of manual vs digital agreement. Results The PathAI system was successfully deployed on 169 H&E stained images from the phase II clinical trial to exhaustively identify all tumor associated lymphocytes from each whole slide image. In total, PathAI classified 2,859,796 lymphocytes, with an average number of 16,922 lymphocytes per image. We used frames-based validation to determine the correlation between the automated scoring and consensus scoring from pathologists hand labeling individual lymphocytes within image frames. The PathAI platform showed strong correlation between reference-based consensus scores (r2 = 0.84, CI [0.80 – 0.87]) and the ML model, which was similar to the level of agreement achieved between individual pathologists (r2 = 0.80, CI [0.76 – 0.85]). Conclusions The PathAI system showed strong generalizability for the identification of lymphocytes within the tumor and TME from H&E stained images from NSCLC clinical trial samples. These results suggest the power of deploying ML-based systems broadly for the automated, single cell resolution characterization of disease pathology from clinical trial material. Citation Format: Ben Glass, Laura Dillon, Guillaume Chhor, Sara Hoffman, Varsha Chinnaobireddy, Sai Chowdary Gullapally, Andy Beck, Jason Hipp. Robust deployment of ML models quantifying the H&E tumor microenvironment in NSCLC subjects from an AstraZeneca-sponsored phase II clinical trial [abstract]. In: Proceedings of the AACR Virtual Special Conference on Artificial Intelligence, Diagnosis, and Imaging; 2021 Jan 13-14. Philadelphia (PA): AACR; Clin Cancer Res 2021;27(5_Suppl):Abstract nr PO-072.

Read full abstract

ML-based Systems Research Articles

Related Topics

Articles published on ML-based Systems

Evaluating the Cybersecurity Risk of Real-world, Machine Learning Production Systems

Bias and Discrimination in Ml-Based Systems of Administrative Decision-Making and Support

Learning Assurance Analysis for Further Certification Process of Machine Learning Techniques: Case-Study Air Traffic Conflict Detection Predictor.

The Role of Explainability in Assuring Safety of Machine Learning in Healthcare

Amortized Generation of Sequential Algorithmic Recourses for Black-Box Models

Interpretable Model for Collaborative Filtering Using an Extended Latent Dirichlet Allocation Approach

Federated Learning for Healthcare: Systematic Review and Architecture Proposal

Accuracy and Interpretability: Struggling with the Epistemic Foundations of Machine Learning-Generated Medical Information and Their Practical Implications for the Doctor-Patient Relationship

Abstract LB016: Deep learning identifies pathobiological features within H&E images associated with genomic alterations and progression on anti-PD(L)1 in HUDSON, an AstraZeneca-sponsored Phase II clinical trial

Abstract PO-072: Robust deployment of ML models quantifying the H&E tumor microenvironment in NSCLC subjects from an AstraZeneca-sponsored phase II clinical trial

Bladder cancer in the time of machine learning: Intelligent tools for diagnosis and management.

Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions

Bridging the Gap between ISO 26262 and Machine Learning: A Survey of Techniques for Developing Confidence in Machine Learning Systems

On testing machine learning programs

Machine Learning for Quantitative Finance Applications: A Survey

Viability Assessment of a Cross-Tokamak AUG-JET Disruption Predictor

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

ML-based Systems Research Articles

Related Topics

Articles published on ML-based Systems

Evaluating the Cybersecurity Risk of Real-world, Machine Learning Production Systems

Bias and Discrimination in Ml-Based Systems of Administrative Decision-Making and Support

Learning Assurance Analysis for Further Certification Process of Machine Learning Techniques: Case-Study Air Traffic Conflict Detection Predictor.

The Role of Explainability in Assuring Safety of Machine Learning in Healthcare

Amortized Generation of Sequential Algorithmic Recourses for Black-Box Models

Interpretable Model for Collaborative Filtering Using an Extended Latent Dirichlet Allocation Approach

Federated Learning for Healthcare: Systematic Review and Architecture Proposal

Accuracy and Interpretability: Struggling with the Epistemic Foundations of Machine Learning-Generated Medical Information and Their Practical Implications for the Doctor-Patient Relationship

Abstract LB016: Deep learning identifies pathobiological features within H&amp;E images associated with genomic alterations and progression on anti-PD(L)1 in HUDSON, an AstraZeneca-sponsored Phase II clinical trial

Abstract PO-072: Robust deployment of ML models quantifying the H&amp;E tumor microenvironment in NSCLC subjects from an AstraZeneca-sponsored phase II clinical trial

Bladder cancer in the time of machine learning: Intelligent tools for diagnosis and management.

Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions

Bridging the Gap between ISO 26262 and Machine Learning: A Survey of Techniques for Developing Confidence in Machine Learning Systems

On testing machine learning programs

Machine Learning for Quantitative Finance Applications: A Survey

Viability Assessment of a Cross-Tokamak AUG-JET Disruption Predictor

Abstract LB016: Deep learning identifies pathobiological features within H&E images associated with genomic alterations and progression on anti-PD(L)1 in HUDSON, an AstraZeneca-sponsored Phase II clinical trial

Abstract PO-072: Robust deployment of ML models quantifying the H&E tumor microenvironment in NSCLC subjects from an AstraZeneca-sponsored phase II clinical trial