Monte-Carlo Search Research Articles

Pediatric acute myeloid leukemia (pAML) encompasses over 20 molecular subtypes driven by unique genetic alterations, including hallmark chromosomal rearrangements and less frequently, point mutations or tandem duplications. Collectively, many of these pAML molecular categories are enriched in pediatric populations and are not represented by current classification systems, including the recently updated WHO or ICC. While fusion detection from RNA-Seq-based approaches is robust, many fusion negative subtypes would need to be defined by expression-based approaches as mutation calling from RNA-Seq data is less developed. Nevertheless, this could be challenging for subtypes with similar transcriptional profiles, such as those with shared HOX expression patterns, including NPM1, NUP98r, UBTF, DEK::NUP214, and KMT2A-PTD. To aid in the appropriate molecular classification of pAML, which is crucial for prognosis, we developed and compared three gene expression-based classifiers. A total of 1707 pAML gene expression profiles were mapped and analyzed from three distinct sources (St. Jude = 659; TARGET = 168; AAML1031 = 880). Raw read count data was normalized and scaled to obtain a relative expression value in transcripts per million (TPM), which served as input for feature selection, model training, and testing. Ground truth labels for all 1707 samples were obtained through multi-omics analysis, including whole genome sequencing, to identify fusions and mutations. For validation purposes, the data was stratified by subtypes and split 70/30 into training (n=1187) and testing (n=520). Three machine learning models were selected: random forest, XGboost, and linear support vector machine (SVM). Each sample had gene expression TPM data for 60,754 transcripts, out of which 20,004 transcripts related to protein-coding genes were incorporated for feature selection. Feature selection was performed using a median absolute deviation (MAD) algorithm to select the 3000 transcripts with the highest variability. Top variable 3000 genes were selected to allow for adequate tuning of the number of predictors. Each model was independently trained using stratified cross-validation and Monte-Carlo search for hyperparameter tuning. The best model was selected based on the Matthews Correlation Coefficient (MCC). Each model was tested on the hold-out TPM set with z-score normalization. On the hold-out testing set (n=520), the linear SVM model outperformed the random forest and XGboost models on five performance metrics across all subtypes (sensitivity=0.9577; precision=0.9577; specificity=0.9978; F1=0.96; accuracy=0.9958). The random forest (sensitivity=0.9154; precision=0.9154; specificity=0.9955; F1=0.92; accuracy=0.9915) and XGboost models (sensitivity=0.9231; precision=0.9231; specificity=0.9960; F1=0.92; accuracy=0.9923) also performed well across all subtypes. Although feature selection was shared across all three models, performance within each subtype varied between models. The linear SVM model demonstrated strong performance overall, driven by high specificity in classifying the KMT2Ar subgroup (n=127) and equal sensitivity across the GLIS-rearranged (n=14), GATA1 (n=10), BCL11B (n=7), CBFB::MYH11 (n=60), CEBPA (n=33), and RUNX1::RUNX1T1 (n=78) subtypes. The primary difference in performance between models is the high false positive rate for KMT2Ar and NPM1 (n=55) in the random forest and XGboost models. A preliminary hypothesis for this might be due to the large representation of KMT2Ar and NPM1 in the training data (24.85% and 10.78%, respectively). Synthetic upsampling (SMOTE) for the training dataset (n=1369) counteracts bias towards the majority classes and increases performance for the random forest (sensitivity=0.9327; precision=0.9327; specificity=0.9965; F1=0.93; accuracy=0.9933), XGboost (sensitivity=0.9404; precision=0.9404; specificity=0.9969; F1=0.94; accuracy=0.9940) and linear SVM (sensitivity=0.9615; precision=0.9615; specificity=0.9980; F1=0.96; accuracy=0.9962) models. Conjointly, these models demonstrate the utility and effectiveness of a machine learning approach for classifying pAML samples from transcriptome sequencing data, which may have broad clinical and research utility, especially for fusion negative subtypes.

Read full abstract

With the aim of improving the image quality of the crucial components of transmission lines taken by unmanned aerial vehicles (UAV), a priori work on the defective fault location of high-voltage transmission lines has attracted great attention from researchers in the UAV field. In recent years, generative adversarial nets (GAN) have achieved good results in image generation tasks. However, the generation of high-resolution images with rich semantic details from complex backgrounds is still challenging. Therefore, we propose a novel GANs-based image generation model to be used for the critical components of power lines. However, to solve the problems related to image backgrounds in public data sets, considering that the image background of the common data set CPLID (Chinese Power Line Insulator Dataset) is simple. However, it cannot fully reflect the complex environments of transmission line images; therefore, we established an image data set named “KCIGD” (The Key Component Image Generation Dataset), which can be used for model training. CFM-GAN (GAN networks based on coarse–fine-grained generators and multiscale discriminators) can generate the images of the critical components of transmission lines with rich semantic details and high resolutions. CFM-GAN can provide high-quality image inputs for transmission line fault detection and line inspection models to guarantee the safe operation of power systems. Additionally, we can use these high-quality images to expand the data set. In addition, CFM-GAN consists of two generators and multiple discriminators, which can be flexibly applied to image generation tasks in other scenarios. We introduce a penalty mechanism-related Monte Carlo search (MCS) approach in the CFM-GAN model to introduce more semantic details in the generated images. Moreover, we presented a multiscale discriminator structure according to the multitask learning mechanisms to effectively enhance the quality of the generated images. Eventually, the experiments using the CFM-GAN model on the KCIGD dataset and the publicly available CPLID indicated that the model used in this work outperformed existing mainstream models in improving image resolution and quality.

Read full abstract

Monte-Carlo Search Research Articles

Related Topics

Articles published on Monte-Carlo Search

DrugSynthMC: An Atom-Based Generation of Drug-like Molecules with Monte Carlo Search.

Bounds on galaxy stochasticity from halo occupation distribution modeling

Design of parallel 𝛽-sheet nanofibrils using Monte Carlo search, coarse-grained simulations, and experimental testing.

Comparing search algorithms on the retrosynthesis problem.

Multi-step carbon emissions forecasting model for industrial process based on a new strategy and machine learning methods

TIR predictor and optimizer: Web-tools for accurate prediction of translation initiation rate and precision gene design in Saccharomyces cerevisiae.

Downhole Track Detection via Multi-dimensional Conditional Generative Adversarial Nets

Data Augmentation for Bayesian Deep Learning

Gene Expression Machine Learning Models Classify Pediatric AML Subtypes with High Performance

Data-Driven Discovery of Lithium-Ion Battery State of Charge Dynamics

Finding the best opening in chess with multi-armed bandit algorithm

The LISA Data Challenge Radler analysis and time-dependent ultra-compact binary catalogues

Design and analysis of a 2-DOF compliant serial micropositioner based on "S-shaped" flexure hinge

Review on the application of deep learning algorithms in video game AI agent

UAV Aerial Image Generation of Crucial Components of High-Voltage Transmission Lines Based on Multi-Level Generative Adversarial Network

Generative design of texture for sliding surface based on machine learning

Optimization design for slip/no-slip configuration of hydrophobic sliding bearings using Monte Carlo search

The taxicab sampler: MCMC for discrete spaces with application to tree models

Air combat manoeuvre strategy algorithm based on two-layer game decision-making and the distributed MCTS method with double game trees

Air Combat Maneuver Strategy Algorithm Based on Two-Layer Game Decision-Making and Distributed Double Game Trees MCTS under Uncertain Information

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Monte-Carlo Search Research Articles

Related Topics

Articles published on Monte-Carlo Search

DrugSynthMC: An Atom-Based Generation of Drug-like Molecules with Monte Carlo Search.

Bounds on galaxy stochasticity from halo occupation distribution modeling

Design of parallel 𝛽-sheet nanofibrils using Monte Carlo search, coarse-grained simulations, and experimental testing.

Comparing search algorithms on the retrosynthesis problem.

Multi-step carbon emissions forecasting model for industrial process based on a new strategy and machine learning methods

TIR predictor and optimizer: Web-tools for accurate prediction of translation initiation rate and precision gene design in Saccharomyces cerevisiae.

Downhole Track Detection via Multi-dimensional Conditional Generative Adversarial Nets

Data Augmentation for Bayesian Deep Learning

Gene Expression Machine Learning Models Classify Pediatric AML Subtypes with High Performance

Data-Driven Discovery of Lithium-Ion Battery State of Charge Dynamics

Finding the best opening in chess with multi-armed bandit algorithm

The LISA Data Challenge Radler analysis and time-dependent ultra-compact binary catalogues

Design and analysis of a 2-DOF compliant serial micropositioner based on "S-shaped" flexure hinge

Review on the application of deep learning algorithms in video game AI agent

UAV Aerial Image Generation of Crucial Components of High-Voltage Transmission Lines Based on Multi-Level Generative Adversarial Network

Generative design of texture for sliding surface based on machine learning

Optimization design for slip/no-slip configuration of hydrophobic sliding bearings using Monte Carlo search

The taxicab sampler: MCMC for discrete spaces with application to tree models

Air combat manoeuvre strategy algorithm based on two-layer game decision-making and the distributed MCTS method with double game trees

Air Combat Maneuver Strategy Algorithm Based on Two-Layer Game Decision-Making and Distributed Double Game Trees MCTS under Uncertain Information