Fisher Score Method Research Articles

Introduction Identification of gene mutation status prior treatment has improved our capability in risk stratifying acute myeloid leukemia (AML) patients greatly, but it is really a labor-exhausting work to identify these mutations. In this this study, we hypothesize immunophenotyping could predict gene mutation and we aim to develop machine learning algorithms that could predict the AML gene mutation status with the immunophenotype by clinical flow cytometry. Method Retrospective clinical data of patients with AML, including demographic (age & gender), molecular genetics, cytogenetics as well as flow cytometry (FC) data at National Taiwan University Hospital were collected. A total of 529 newly diagnosed de novo AML from 2009 to 2019 enrolled this study. The median age at diagnosis was 58 years (Table 1). In total, 428 NPM1, 415 FLT3, 331 CEBPA and 338 RUNX1 gene testing results and a total of 529 initial diagnostic FC data from these patients were used in developing the gene mutation prediction models. Each FC data sample contained 100,000 cells acquired on FACSCantoII machine 6 fluorescent channels with multiple fluorescent markers. The markers measured are listed in Table 2. There were 19 combinations of markers and fluorescent channels which were served as feature inputs of the machine learning framework. Our proposed machine learning framework can be divided as a phenotype representation learning paradigm and a classification model. To derive the phenotype representation, we trained a multivariate Gaussian Mixture Model (GMM) on the 19-dimension FC data to capture the training data distribution and characteristics in a probabilistic unsupervised manner. Then, a Fisher-scoring method was used to vectorize each sample as a high dimensional representation via differential computation in terms of the learned GMM parameters. This Fisher vectorization method transformed samples to a high dimensional feature space as phenotype vectors. We performed analysis of variance (ANOVA)-based feature selection on these representations which were finally fed into the support vector machine (SVM) classifier. To alleviate the negative effects of imbalance classes in gene mutation identification tasks, we applied synthetic minority oversampling technique (SMOTE) algorithm which augmented the minority class by interpolating samples near support vectors. We train independent SVM models to detect the occurrences of the four gene mutation. The algorithm is evaluated by randomly divided 5-fold cross validation which separates 80% data for training and 20% for testing. Results This gene mutation rate of this cohort for NPM1, FLT3, CEBPA and RUNX1 were 22.2% (95/428), 25.1% (104/415), 20.2% (67/331), and 13.6% (46/338), respectively. The average accuracies (ACC) of the prediction model performance for NPM1, FLT3, CEBPA and RUNX1 were 82.6%, 76.3%, 84.2% and 84.1%, respectively, whereas the area under the ROC curve (AUC) were 77.9%, 63.4%, 80.7% and 67.7%, respectively (Table. 3). Conclusions We demonstrated the potential of the correlation of recurrent AML gene mutation status with immunophenotype of AML through our preliminary gene mutation prediction model. Further study with larger cohorts followed by external validation are needed to further evaluate the feasibility of using machine learning based algorithm as one of triage tools to support physicians in aggressive AML clinical decision before receiving molecular genetic reports. Disclosures Ko: Roche: Honoraria.

Read full abstract

Through the study of pigmented skin lesions risk factors, the appearance of malignant melanoma turns the anomalous occurrence of these lesions to annoying sign. The difficulty of differentiation between malignant melanoma and melanocytic naive is the error-bone problem that usually faces the physicians in diagnosis. To think through the hard mission of pigmented skin lesions diagnosis different clinical diagnosis algorithms were proposed such as pattern analysis, ABCD rule of dermoscopy, Menzies method, and 7-points checklist. Computerized monitoring of these algorithms improves the diagnosis of melanoma compared to simple naked-eye of physician during examination. Toward the serious step of melanoma early detection, aiming to reduce melanoma mortality rate, several computerized studies and procedures were proposed. Through this research different approaches with a huge number of features were discussed to point out the best approach or methodology could be followed to accurately diagnose the pigmented skin lesion. This paper proposes automated system for diagnosis of melanoma to provide quantitative and objective evaluation of skin lesion as opposed to visual assessment, which is subjective in nature. Two different data sets were utilized to reduce the effect of qualitative interpretation problem upon accurate diagnosis. Set of clinical images that are acquired from a standard camera while the other set is acquired from a special dermoscopic camera and so named dermoscopic images. System contribution appears in new, complete and different approaches presented for the aim of pigmented skin lesion diagnosis. These approaches result from using large conclusive set of features fed to different classifiers. The three main types of different features extracted from the region of interest are geometric, chromatic, and texture features. Three statistical methods were proposed to select the most significant features that will cause a valuable effect in diagnosis; Fisher score method, t-test, and F-test. The selected features of high-ranking score based on the statistical methods are used for the diagnosis of the two lesion groups using Artificial Neural Network (ANN), K-Nearest Neighbor (KNN) and Support Vector Machine (SVM) as three different classifiers proposed. The overall System performance was then measured in regards to Specificity, Sensitivity and Accuracy. According to the different approaches that will be mentioned later the best result was showen by the ANN designed with the feature selected according to fisher score method enables a diagnostic accuracy of 96.25% and 97% for dermoscopic and clinical images respectively.

Read full abstract

Fisher Score Method Research Articles

Related Topics

Articles published on Fisher Score Method

Construction of machine learning models for recognizing comorbid anxiety in epilepsy patients based on their clinical and quantitative EEG features

An Integrated Approach for Diabetes Detection Using Fisher Score Feature Selection and Capsule Network

Attack detection analysis in software-defined networks using various machine learning method

Feature selection based on self-information and entropy measures for incomplete neighborhood decision systems

Non-Bayesian Parametric Missing-Mass Estimation

A self-learned decomposition and classification model for schizophrenia diagnosis

Prediction of chemoresistance trait of cancer cell lines using machine learning algorithms and systems biology analysis

Statistical Foundations of Actuarial Learning and its Applications

Accurate Prediction of Gene Mutations with Flow Cytometry Immune-Phenotyping By Machine Learning Algorithm

Stochastic Functional Estimates in Longitudinal Models with Interval-Censored Anchoring Events.

Feature selection using Lebesgue and entropy measures for incomplete neighborhood decision systems

Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification

A Neighborhood Rough Sets-Based Attribute Reduction Method Using Lebesgue and Entropy Measures.

Joint neighborhood entropy-based gene selection method with fisher score for tumor classification

A novel synergistic fibroblast optimization based Kalman estimation model for forecasting time-series data

Automated Imaging System for Pigmented Skin Lesion Diagnosis

Fast estimation of diffusion tensors under Rician noise by the EM algorithm

Pigmented Skin Lesion Diagnosis by Automated Imaging System

Diagnostics for a Linear Model with First-Order Autoregressive Symmetrical Errors

Association Test for X-Linked QTL in Family-Based Designs

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Fisher Score Method Research Articles

Related Topics

Articles published on Fisher Score Method

Construction of machine learning models for recognizing comorbid anxiety in epilepsy patients based on their clinical and quantitative EEG features

An Integrated Approach for Diabetes Detection Using Fisher Score Feature Selection and Capsule Network

Attack detection analysis in software-defined networks using various machine learning method

Feature selection based on self-information and entropy measures for incomplete neighborhood decision systems

Non-Bayesian Parametric Missing-Mass Estimation

A self-learned decomposition and classification model for schizophrenia diagnosis

Prediction of chemoresistance trait of cancer cell lines using machine learning algorithms and systems biology analysis

Statistical Foundations of Actuarial Learning and its Applications

Accurate Prediction of Gene Mutations with Flow Cytometry Immune-Phenotyping By Machine Learning Algorithm

Stochastic Functional Estimates in Longitudinal Models with Interval-Censored Anchoring Events.

Feature selection using Lebesgue and entropy measures for incomplete neighborhood decision systems

Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification

A Neighborhood Rough Sets-Based Attribute Reduction Method Using Lebesgue and Entropy Measures.

Joint neighborhood entropy-based gene selection method with fisher score for tumor classification

A novel synergistic fibroblast optimization based Kalman estimation model for forecasting time-series data

Automated Imaging System for Pigmented Skin Lesion Diagnosis

Fast estimation of diffusion tensors under Rician noise by the EM algorithm

Pigmented Skin Lesion Diagnosis by Automated Imaging System

Diagnostics for a Linear Model with First-Order Autoregressive Symmetrical Errors

Association Test for X-Linked QTL in Family-Based Designs