Flexible Workflow Research Articles

Introduction: The acquisition of somatic mutations in hematopoietic stem and progenitor stem cells with resultant clonal expansion is known as clonal hematopoiesis (CH). CH is linked to a higher risk of hematologic malignancies and other adverse health outcomes. The prevalence of CH is dependent on sequencing technique. With ultra-high-depth sequencing, CH can be detected in most adults. However, CH variant calling at very low variant allele fractions (VAF) is challenging due to difficulty in distinguishing low frequency CH mutations from sequencing artifacts. Here we develop and validate a novel CH variant calling approach that combines ensemble based variant calling with advanced artifact filtering and outperforms commonly used somatic variant callers. Methods: We sequenced a tumor-normal dilution series from six AML patients and 27 technical controls with a targeted panel utilizing unique molecular indexes that included nine common CH genes (DNMT3A, TET2, ASXL1, TP53, PPM1D, JAK2, SF3B1, SRSF2) with a mean unique coverage of 18,268x. In total we identified 32 mutations in these six samples validated by an independent sequencing panel performed for the purpose of clinical testing. The variant allele fraction (VAF) in the resulting tumor-dilution samples ranged from 0.1%-50% VAF. Results: For higher VAF mutations (>5%), Mutect2, Lofreq, and Vardict showed reasonable sensitivity (range: 0.70-1.00) and positive predictive values (PPV) (range: 0.80-1.00). However, at lower VAF, PPV was poor; for mutations between 0.1-0.4% VAF, the PPV ranged between 0.01-0.22. We applied additional filters based on sequencing quality, depth, regional complexity, and an empiric estimate of sequencing error at the position of a given called variant using technical controls. Taking the consensus of three callers and including our additional filters, we not only retained sensitivity and PPV for higher VAF variants above 1% (0.92 sensitivity, 0.95 PPV) but improved the PPV with retained sensitivity for variants between 1-0.1% VAF (0.91 sensitivity, 0.83 PPV). Applying our consensus approach to blood samples drawn from 31 healthy individuals sequenced using the same targeted panel with variants validated using an orthogonal approach, we saw excellent sensitivity and PPV across a wide range of VAFs (Sensitivity 0.99 [>1% VAF], 0.92 [0.2-1% VAF], and 0.82 [0.1-0.2% VAF]; PPV 1.00 [>1% VAF], 0.98 [0.2-1% VAF], 0.97 [0.2-0.1% VAF]). We compared our consensus calling and advanced filtering approach to a machine learning model. A decision-tree-based ensemble machine learning algorithm, XGBoost, was trained on the tumor-dilution samples with validation using 31 blood samples with orthogonal sequencing. The variant caller output (allele counts, flags, etc.), additional false positive filtering metrics, and statistical results from the technical controls were used as features. Sensitivity and PPV was high for mutations above 1% VAF (0.97 sensitivity and 0.82 PPV) with the performance declining for lower VAF mutations (0.72 sensitivity and 0.65 PPV). Thus, a machine learning approach did not outperform consensus calling with advanced artifact filtering. Finally, we developed a custom annotation pipeline for CH variant pathogenicity classification incorporating previously reported CH mutations in large cohorts, cancer somatic mutation frequency databases and curated cancer driver knowledge databases. Conclusion: We developed ArCCH, an advanced artifact filtering and consensus CH calling pipeline that includes custom annotation features to facilitate CH pathogenicity determination. Our results show that our consensus approach with advanced error correction filters substantially improves the performance of CH variant calling beyond commonly used single variant callers. This end-to-end flexible workflow package is publicly available on Terra and will facilitate future CH studies.

Read full abstract

Abstract Analysis of circulating tumor cells (CTCs) by multiparameter immunofluorescence (IF) microscopy allows non-invasive characterization of cancer cell biomarker expression in real time. This information can be helpful in prognosis, treatment selection, and stratification of cancer patients. AccuCyte® is a density-based unbiased isolation method that transfers nucleated cells from whole blood to slides for the characterization of CTCs and other rare cells. RarePlex® panel kits are IF staining reagents used on automated slide staining instruments to label cells to differentiate CTCs from white blood cells (WBC). CyteFinder® is a seven-channel automated fluorescent imaging system that rapidly scans microscope slides and applies machine learning algorithms to identify CTCs. Together, these technologies provide an end-to-end solution for CTC characterization. For analysis, blood is drawn into AccuCyte blood collection tubes (BCTs) containing a preservative which maintains cell properties prior to processing onto slides. Once slides are prepared, they can be stored at -20°C without significant biomarker degradation. This flexible workflow allows investigators to bank samples for batch analysis and to begin sample collection prior to validating the IF assay to be used. This study was designed to evaluate: (1) stability time between collection in the AccuCyte BCT and sample processing; (2) performance of an improved version of the AccuCyte kit with higher nucleated cell isolation capacity; and (3) storage time that AccuCyte prepared slides can be banked frozen prior to staining. The study was performed using model CTCs and cancer patient samples. Metrics to determine performance were CTC recovery and mean fluorescence intensity (MFI) of biomarker expression. Our results demonstrate that the AccuCyte BCT preserves blood components for at least 5 days after collection without significant effect on CTC recovery or biomarker expression. The latest version of the AccuCyte kit demonstrated a higher cell isolation capacity and could collect up to 60% more nucleated blood cells than the previous version, increasing CTC recovery. The increased capacity was demonstrated in patients treated with hematopoietic growth factors, whose WBC count was significantly higher than the normal range. Finally, accelerated-aging study results demonstrated that AccuCyte-prepared slides can be stored at -20°C for at least 4 years without significant effect on most biomarkers tested. In conclusion, enhancements to the AccuCyte-CyteFinder platform reported here increase flexibility and performance for analysis of CTCs in global clinical trials by allowing longer periods of time before collected blood samples need to be processed and by extending the length of time processed slides can be banked before they are stained. Citation Format: Arturo B. Ramirez, Lillian Costandy, Brady S. Gardner, Ryan H. Huston, A Anders Larson Tevis, Casey E. Helmicki, Alisa C. Clein, Daniel E. Sabath, Joshua J. Nordberg, Tad C. George. Validation of enhanced performance of the AccuCyte®-CyteFinder® platform for circulating tumor cell characterization [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 1952.

Read full abstract

Flexible Workflow Research Articles

Related Topics

Articles published on Flexible Workflow

Derivation of rock's geomechanical parameters while drilling by combining surface drilling data, gamma ray data, and machine learning techniques in carbonate formations

FCLWebVis: A flexible cross-language web-based data visualization framework

Structural Properties of Metal-Organic Frameworks at Elevated Thermal Conditions via a Combined Density Functional Tight Binding Molecular Dynamics (DFTB MD) Approach.

Arcch: Improving the Performance of Clonal Hematopoiesis Variant Calling and Interpretation Using a Consensus Based Approach

Ferrobotic swarms enable accessible and adaptable automated viral testing.

HT-smFISH: a cost-effective and flexible workflow for high-throughput single-molecule RNA imaging.

Comparative evaluation of the heterozygous variant standard deviation as a quality measure for next-generation sequencing

Advancing caching and automation with FDO

Polar biodiversity data: From a national marine platform to a global data portal

GenES-MDA: A generic open-source software package to solve inverse problems via the Ensemble Smoother with Multiple Data Assimilation

The Landscape Fire Scars Database: mapping historical burned area and fire severity in Chile

Combining Computational and Archival Methods to Study International Organizations: Refugees and the International Labour Organization, 1919–2015

Modeling a Flexible Flow Shop Scheduling Problem without Unemployment by Considering Sequence-Dependent Preparation Times and Solving it with a Meta-Heuristic Algorithm

Abstract 1952: Validation of enhanced performance of the AccuCyte®-CyteFinder® platform for circulating tumor cell characterization

Nurses' experience of using video consultation in a digital care setting and its impact on their workflow and communication.

Lessons Learned from the Impact of COVID-19 on NCI-sponsored Cancer Prevention Clinical Trials: Moving Toward Participant-centric Study Designs.

Open data and model integration through generic model agent toolkit in CyberWater framework

MiND (miRNA NGS Discovery pipeline): a small RNA-seq analysis pipeline and report generator for microRNA biomarker discovery studies

Optimization of enzymatic fragmentation is crucial to maximize genome coverage: a comparison of library preparation methods for Illumina sequencing

Utilizing multi-level concepts for multi-phase modeling

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Flexible Workflow Research Articles

Related Topics

Articles published on Flexible Workflow

Derivation of rock's geomechanical parameters while drilling by combining surface drilling data, gamma ray data, and machine learning techniques in carbonate formations

FCLWebVis: A flexible cross-language web-based data visualization framework

Structural Properties of Metal-Organic Frameworks at Elevated Thermal Conditions via a Combined Density Functional Tight Binding Molecular Dynamics (DFTB MD) Approach.

Arcch: Improving the Performance of Clonal Hematopoiesis Variant Calling and Interpretation Using a Consensus Based Approach

Ferrobotic swarms enable accessible and adaptable automated viral testing.

HT-smFISH: a cost-effective and flexible workflow for high-throughput single-molecule RNA imaging.

Comparative evaluation of the heterozygous variant standard deviation as a quality measure for next-generation sequencing

Advancing caching and automation with FDO

Polar biodiversity data: From a national marine platform to a global data portal

GenES-MDA: A generic open-source software package to solve inverse problems via the Ensemble Smoother with Multiple Data Assimilation

The Landscape Fire Scars Database: mapping historical burned area and fire severity in Chile

Combining Computational and Archival Methods to Study International Organizations: Refugees and the International Labour Organization, 1919–2015

Modeling a Flexible Flow Shop Scheduling Problem without Unemployment by Considering Sequence-Dependent Preparation Times and Solving it with a Meta-Heuristic Algorithm

Abstract 1952: Validation of enhanced performance of the AccuCyte®-CyteFinder® platform for circulating tumor cell characterization

Nurses' experience of using video consultation in a digital care setting and its impact on their workflow and communication.

Lessons Learned from the Impact of COVID-19 on NCI-sponsored Cancer Prevention Clinical Trials: Moving Toward Participant-centric Study Designs.

Open data and model integration through generic model agent toolkit in CyberWater framework

MiND (miRNA NGS Discovery pipeline): a small RNA-seq analysis pipeline and report generator for microRNA biomarker discovery studies

Optimization of enzymatic fragmentation is crucial to maximize genome coverage: a comparison of library preparation methods for Illumina sequencing

Utilizing multi-level concepts for multi-phase modeling