Static Machine Learning Research Articles

Abstract Aim: We recently published the first machine learning framework that integrates multi-omic data derived from the pre-therapy breast tumor ecosystem to accurately predict response to neoadjuvant systemic therapy (Sammut et al, Nature 2021). We aimed to extend this framework to incorporate serially acquired multi-omic data to further improve response predictions and model tumor biology as it is perturbed by treatment. Methods: Breast tumor core biopsies were acquired at diagnosis from 168 women that went on to receive pre-operative chemotherapy (or chemotherapy plus anti-HER2 targeted therapy). Serial tumor core biopsies were obtained midway (n=78) and on completion (n=69) of neoadjuvant systemic therapy. Response was assessed at surgery using the Residual Cancer Burden score. Core biopsies were molecularly profiled by shallow whole genome, exome and RNA sequencing and their histological architecture characterized using digital pathology. Baseline clinical, molecular, and digital pathology imaging features associated with response were identified and their dynamics modelled throughout therapy. Results: At baseline, a total of 34 features derived from multi-omic data were associated with response to neoadjuvant therapy. These included: tumor mutation and neoantigen burden, subclonal diversity, HRD and chromosomal instability. A suppressed immune response, typified by the presence of T-cell dysfunction and immune exclusion, was associated with extensive chemoresistance. The changes in abundance of these features across the serially sampled on-therapy tumors were then mapped to response outcomes. An early increase in adaptive and innate immune infiltration and activation was associated with a linear decrease in expressed neoantigens, tumor proliferation and loss of subclonal mutational and copy number diversity, indicating early response to therapy. Conversely, a stable tumor and microenvironment transcriptional landscape throughout treatment, corresponding to a minimal change in tumor clonal architecture and microenvironment composition, was associated with a poor response to therapy. Notably, tumors with LOH HLA failed to engage a productive immune response during treatment and this was associated with resistance. A dynamic framework that models the change of these on-therapy features and extends the functionality of the published static machine learning model is being developed. Conclusion: Response to neoadjuvant therapy is determined by the baseline characteristics of the tumor ecosystem. During therapy, both the tumor and its microenvironment follow distinct evolutionary trajectories that can be mapped to outcome. The change in enrichment of features derived from the tumor and its microenvironment can be integrated within machine learning frameworks that leverage dynamic data from the entire therapy time course for more accurate response prediction. Citation Format: Stephen-John Sammut, Mireia Crispin-Ortuzar, Suet-Feung Chin, Elena Provenzano, Wei Cope, Ali Dariush, Sarah-Jane Dawson, Paul D. Pharoah, Florian Markowetz, Oscar M. Rueda, Helena M. Earl, Carlos Caldas. Predicting response to treatment in early breast cancer using dynamic integrative multi-omic profiling [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 476.

Read full abstract

Lenders, such as banks and credit card companies, use credit scoring models to evaluate the potential risk posed by lending money to customers, and therefore to mitigate losses due to bad credit. The profitability of the banks thus highly depends on the models used to decide on the customer’s loans. State-of-the-art credit scoring models are based on machine learning and statistical methods. One of the major problems of this field is that lenders often deal with imbalanced datasets that usually contain many paid loans but very few not paid ones (called defaults). Recently, dynamic selection methods combined with ensemble methods and preprocessing techniques have been evaluated to improve classification models in imbalanced datasets presenting advantages over the static machine learning methods. In a dynamic selection technique, samples in the neighborhood of each query sample are used to compute the local competence of each base classifier. Then, the technique selects only competent classifiers to predict the query sample. In this paper, we evaluate the suitability of dynamic selection techniques for credit scoring problem, and we present Reduced Minority k-Nearest Neighbors (RMkNN), an approach that enhances state of the art in defining the local region of dynamic selection techniques for imbalanced credit scoring datasets. This proposed technique has a superior prediction performance in imbalanced credit scoring datasets compared to state of the art. Furthermore, RMkNN does not need any preprocessing or sampling method to generate the dynamic selection dataset (called DSEL). Additionally, we observe an equivalence between dynamic selection and static selection classification. We conduct a comprehensive evaluation of the proposed technique against state-of-the-art competitors on six real-world public datasets and one private one. Experiments show that RMkNN improves the classification performance of the evaluated datasets regarding AUC, balanced accuracy, H-measure, G-mean, F-measure, and Recall.

Read full abstract

Static Machine Learning Research Articles

Articles published on Static Machine Learning

Integrating Feature Selection with Machine Learning for Accurate Reservoir Landslide Displacement Prediction

LungEcho - Resource Constrained Lung Ultrasound Video Analysis Tool for Faster Triaging and Active Learning

Financial Time Series Forecasting: A Data Stream Mining-Based System

Enhanced Intrusion Detection with Data Stream Classification and Concept Drift Guided by the Incremental Learning Genetic Programming Combiner

A Static Machine Learning Based Evaluation Method for Usability and Security Analysis in E-Commerce Website

Personalized model to predict seizures based on dynamic and static continuous EEG monitoring data

KF-Loc: A Kalman Filter and Machine Learning Integrated Localization System Using Consumer-Grade Millimeter-Wave Hardware

Abstract 476: Predicting response to treatment in early breast cancer using dynamic integrative multi-omic profiling

Auditing static machine learning anti-Malware tools against metamorphic attacks

A novel approach to define the local region of dynamic selection techniques in imbalanced credit scoring problems

Classification Of Association Item Sets From Large Data Sets Based On User Awareness Using Hybrid

Incremental supervised learning: algorithms and applications in pattern recognition

Activity Recognition for Incomplete Spinal Cord Injury Subjects Using Hidden Markov Models

An adaptive strategy for the classification of g-protein coupled receptors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Static Machine Learning Research Articles

Articles published on Static Machine Learning

Integrating Feature Selection with Machine Learning for Accurate Reservoir Landslide Displacement Prediction

LungEcho - Resource Constrained Lung Ultrasound Video Analysis Tool for Faster Triaging and Active Learning

Financial Time Series Forecasting: A Data Stream Mining-Based System

Enhanced Intrusion Detection with Data Stream Classification and Concept Drift Guided by the Incremental Learning Genetic Programming Combiner

A Static Machine Learning Based Evaluation Method for Usability and Security Analysis in E-Commerce Website

Personalized model to predict seizures based on dynamic and static continuous EEG monitoring data

KF-Loc: A Kalman Filter and Machine Learning Integrated Localization System Using Consumer-Grade Millimeter-Wave Hardware

Abstract 476: Predicting response to treatment in early breast cancer using dynamic integrative multi-omic profiling

Auditing static machine learning anti-Malware tools against metamorphic attacks

A novel approach to define the local region of dynamic selection techniques in imbalanced credit scoring problems

Classification Of Association Item Sets From Large Data Sets Based On User Awareness Using Hybrid

Incremental supervised learning: algorithms and applications in pattern recognition

Activity Recognition for Incomplete Spinal Cord Injury Subjects Using Hidden Markov Models

An adaptive strategy for the classification of g-protein coupled receptors