Molecular Biology Laboratory Research Articles

Since 2017, we have used IonTorrent NGS platform in our hospital to diagnose and treat cancer. Analyzing variants at each run requires considerable time, and we are still struggling with some variants that appear correct on the metrics at first, but are found to be negative upon further investigation. Can any machine learning algorithm (ML) help us classify NGS variants? This has led us to investigate which ML can fit our NGS data and to develop a tool that can be routinely implemented to help biologists. Currently, one of the greatest challenges in medicine is processing a significant quantity of data. This is particularly true in molecular biology with the advantage of next-generation sequencing (NGS) for profiling and identifying molecular tumors and their treatment. In addition to bioinformatics pipelines, artificial intelligence (AI) can be valuable in helping to analyze mutation variants. Generating sequencing data from patient DNA samples has become easy to perform in clinical trials. However, analyzing the massive quantities of genomic or transcriptomic data and extracting the key biomarkers associated with a clinical response to a specific therapy requires a formidable combination of scientific expertise, biomolecular skills and a panel of bioinformatic and biostatistic tools, in which artificial intelligence is now successful in developing future routine diagnostics. However, cancer genome complexity and technical artifacts make identifying real variants challenging. We present a machine learning method for classifying pathogenic single nucleotide variants (SNVs), single nucleotide polymorphisms (SNPs), multiple nucleotide variants (MNVs), insertions, and deletions detected by NGS from different types of tumor specimens, such as: colorectal, melanoma, lung and glioma cancer. We compared our NGS data to different machine learning algorithms using the k-fold cross-validation method and to neural networks (deep learning) to measure the performance of the different ML algorithms and determine which one is a valid model for confirming NGS variant calls in cancer diagnosis. We trained our machine learning with 70% of our data samples, extracted from our local database (our data structure had 7 parameters: chromosome, position, exon, variant allele frequency, minor allele frequency, coverage and protein description) and validated it with the 30% remaining data. The model offering the best accuracy was chosen and implemented in the NGS analysis routine. Artificial intelligence was developed with the R script language version 3.6.0. We trained our model on 70% of 102,011 variants. Our best error rate (0.22%) was found with random forest machine learning (ntree = 500 and mtry = 4), with an AUC of 0.99. Neural networks achieved some good scores. The final trained model with the neural network achieved an accuracy of 98% and an ROC-AUC of 0.99 with validation data. We tested our RF model to interpret more than 2000 variants from our NGS database: 20 variants were misclassified (error rate < 1%). The errors were nomenclature problems and false positives. After adding false positives to our training database and implementing our RF model routinely, our error rate was always < 0.5%. The RF model shows excellent results for oncosomatic NGS interpretation and can easily be implemented in other molecular biology laboratories. AI is becoming increasingly important in molecular biomedical analysis and can be very helpful in processing medical data. Neural networks show a good capacity in variant classification, and in the future, they may be useful in predicting more complex variants.

Read full abstract

▪AimsLife expectancy of CML pts optimally responding to tyrosine kinase inhibitors (TKI) is close to that of the general population and recently, TFR has been acknowledged as a new goal of CML management. TKI discontinuation in the view of TFR requires the achievement of deep and long-lasting molecular responses (MR). The gold standard BCR-ABL mRNA quantification technology and MR definitions rely on internationally standardized (IS) RT-qPCR but atypical transcripts located outside the Major-BCR region, harbored by 1-2% of pts, cannot be expressed on the IS scale. Thus, most trials and clinical practice recommendations prevent such pts from attempting TFR. The Fi-LMC group retrospectively collected real-life observations to assess TFR likelihood in this rare population.MethodsData from CML pts with precise characterization of atypical transcripts in whom any line TKI was stopped for any reason but after at least 2 years of undetectable molecular residual disease (UMRD) by individualized non-standardized RT-qPCR were collected. RT-qPCR sensitivity varied depending on transcript type and local molecular biology laboratory. TFS was estimated by the Kaplan-Meier method. Relapse was analyzed using the cumulative incidence function, relapse being as UMRD loss at any time and any level during follow-up (FU).ResultsOur series comprised 16 adult CP CML pts with atypical BCR-ABL fusions including 12 males (75%). Median age at CML diagnosis was 56 years (range: 21-75) and that at TKI discontinuation was 67 years [range: 29-82]. Sokal score was low, intermediate and high in 7, 8 and 1 pts, respectively. ELTS score was low and intermediate in 10 and 4 pts, respectively and unknown in 2. Most pts expressed e19a2 (n=6) followed by e6a2 (n=4), b3a3 (n=3), b2a3 (n=2) and e8a2 (n=1). Seven pts discontinued imatinib, 4 stopped dasatinib, 4 nilotinib and 1 bosutinib. Number of lines of therapy was 2 in 8 pts, 1 in 5 pts and 3 in 3 pts. Median TKI treatment duration before discontinuation was 64 months (range: 31-218) and median duration of UMRD was 41 months (range: 21-168). The median FU after TKI discontinuation was 68 months (range: 3-149). Five pts experienced relapse leading to TKI resumption. Four relapses occurred within 3-6 months and included 2 loss of hematologic response in CP, 1 loss of hematologic response in accelerated phase CML and 1 molecular recurrence with BCR-ABL transcripts up to 1.5%. One relapse occurred at 49 months and consisted in loss of a complete cytogenetic response. These 5 pts resumed TKI and regained UMRD within 6 months, including 1 pt who died in UMRD from non-CML-related cause at the age of 82 years and 1 pt who rapidly failed a 2 nd TKI discontinuation attempt.In 1 additional pt, BCR-ABL transcripts became detectable intermittently with maximum transcript level of 0.15% and TKI was not resumed. The median FU of pts who remained treatment-free was 68 months (range: 8-149).Overall, the 5-year cumulative incidence of relapse regardless of whether TKI was resumed was 41.6% (95% confidence interval: 21.9%-78.7%) (Figure 1). The 5-year TFS rate was 65.2% (95% confidence interval: 40.3%-90.2%) (Figure 2).ConclusionsOur observational study of TKI discontinuation in CML pts with atypical BCR-ABL transcripts is the largest reported so far. While effort must be made for proper assessment of deep MR, preliminary results suggest that TFS pattern might favorably compare with that obtained in pts with Major-type BCR-ABL transcripts. However, relapses may be more aggressive and caution is required in order to avoid loss of hematologic responses and progression. Whether the type of atypical fusion gene influences TKI discontinuation outcome, as well as other potential prognostic factors, need to be determined in a larger series. [Display omitted] DisclosuresCharbonnier: Novartis: Speakers Bureau; Incyte: Speakers Bureau. Rea: Novartis: Consultancy, Honoraria, Membership on an entity's Board of Directors or advisory committees; Incyte: Honoraria, Membership on an entity's Board of Directors or advisory committees; Pfizer: Honoraria, Membership on an entity's Board of Directors or advisory committees. Etienne: Novartis: Consultancy, Speakers Bureau; Incyte: Consultancy, Speakers Bureau. Rousselot: Incyte, Pfizer: Consultancy, Research Funding. Nicolini: Novartis: Honoraria, Membership on an entity's Board of Directors or advisory committees, Other: travel, accommodations, expenses, Research Funding; Kartos Therapeutics: Consultancy, Membership on an entity's Board of Directors or advisory committees; Sun Pharma Ltd.: Consultancy, Membership on an entity's Board of Directors or advisory committees; Incyte Biosciences: Honoraria, Other: travel, accommodations, expenses, Research Funding, Speakers Bureau; BMS: Honoraria.

Read full abstract

Molecular Biology Laboratory Research Articles

Related Topics

Articles published on Molecular Biology Laboratory

Machine learning random forest for predicting oncosomatic variant NGS analysis

Treatment Free Survival (TFS) in Patients (pts) with Chronic Myeloid Leukemia (CML) Carrying Atypical BCR-ABL1 Fusion Transcripts: The French CML Group (Fi-LMC) Experience

High sensitivity-low cost detection of SARS-CoV-2 by two steps end point RT-PCR with agarose gel electrophoresis visualization

Fabrication of graphene nanoplatelets embedded “partition cartridge” for efficient separation of target-bound ssDNA during SELEX

Beeporter: Tools for high-throughput analyses of pollinator-virus infections.

Polymorphism of p53 Codon 72 Gene on Cervical Cancer Incidence in Malay Population

Construction of DNA ladder for determination of small size DNA fragments

Locating disease spread: cholera to coronavirus and the art of the image.

Integrating active learning activities and metacognition into STEM writing courses.

Development of Transgenic Sugarcane Resistant to Stem Borer by Transforming cry1Ab-cry1Ac Fusion Gene through Agrobacterium tumefaciens Transformation Method

UJI AKTIVITAS ANTIBAKTERI DAN ANTI-UV DARI EKSTRAK ETIL ASETAT ISOLAT JAMUR AFBK 5c YANG BERSIMBION DENGAN ASCIDIA Sigilina sp. DARI PERAIRAN PULAU BANGKA

42P Method optimization for the detection of chimerism by real-time PCR and droplet digital PCR

Assay Optimization Can Equalize the Sensitivity of Real-Time PCR with ddPCR for Detection of Helicoverpa armigera (Lepidoptera: Noctuidae) in Bulk Samples.

Connecting research and teaching introductory cell and molecular biology using an Arabidopsis mutant screen.

Natural Enemies Associated with Brevipalpus sp. (Acari: Tenuipalpidae), Vector of Citrus Leprosis

From Bench to Monitor – Turning Advanced Life Science Courses into Virtual

Interview with 2021 Hooke medal winner Stephen Royle

The First Wave of COVID-19 Pandemic: Sociodemographic Characteristics of Patients in A Tertiary Level Hospital of Bangladesh

The 2021 FASEB Virtual Science Research Conference on Protein Aggregation: Function, Dysfunction, and Disease, June 23-25, 2021.

Simultaneous ribosome profiling of hundreds of microbes from the human microbiome.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Molecular Biology Laboratory Research Articles

Related Topics

Articles published on Molecular Biology Laboratory

Machine learning random forest for predicting oncosomatic variant NGS analysis

Treatment Free Survival (TFS) in Patients (pts) with Chronic Myeloid Leukemia (CML) Carrying Atypical BCR-ABL1 Fusion Transcripts: The French CML Group (Fi-LMC) Experience

High sensitivity-low cost detection of SARS-CoV-2 by two steps end point RT-PCR with agarose gel electrophoresis visualization

Fabrication of graphene nanoplatelets embedded “partition cartridge” for efficient separation of target-bound ssDNA during SELEX

Beeporter: Tools for high-throughput analyses of pollinator-virus infections.

Polymorphism of p53 Codon 72 Gene on Cervical Cancer Incidence in Malay Population

Construction of DNA ladder for determination of small size DNA fragments

Locating disease spread: cholera to coronavirus and the art of the image.

Integrating active learning activities and metacognition into STEM writing courses.

Development of Transgenic Sugarcane Resistant to Stem Borer by Transforming cry1Ab-cry1Ac Fusion Gene through Agrobacterium tumefaciens Transformation Method

UJI AKTIVITAS ANTIBAKTERI DAN ANTI-UV DARI EKSTRAK ETIL ASETAT ISOLAT JAMUR AFBK 5c YANG BERSIMBION DENGAN ASCIDIA Sigilina sp. DARI PERAIRAN PULAU BANGKA

42P Method optimization for the detection of chimerism by real-time PCR and droplet digital PCR

Assay Optimization Can Equalize the Sensitivity of Real-Time PCR with ddPCR for Detection of Helicoverpa armigera (Lepidoptera: Noctuidae) in Bulk Samples.

Connecting research and teaching introductory cell and molecular biology using an Arabidopsis mutant screen.

Natural Enemies Associated with Brevipalpus sp. (Acari: Tenuipalpidae), Vector of Citrus Leprosis

From Bench to Monitor – Turning Advanced Life Science Courses into Virtual

Interview with 2021 Hooke medal winner Stephen Royle

The First Wave of COVID-19 Pandemic: Sociodemographic Characteristics of Patients in A Tertiary Level Hospital of Bangladesh

The 2021 FASEB Virtual Science Research Conference on Protein Aggregation: Function, Dysfunction, and Disease, June 23-25, 2021.

Simultaneous ribosome profiling of hundreds of microbes from the human microbiome.