End Motif Research Articles

Abstract Introduction: Genomic scale copy number, methylation, and fragmentation aberrations are proven genetic and epigenetic biomarkers of circulating tumor DNA (ctDNA). We developed a novel cell-free DNA (cfDNA) methylome sequencing assay that allows for integrative analysis of these genomic features for sensitive detection of multiple types of cancer. Methods: Whole methylome sequencing (WMS) libraries were generated from enzymatically converted cfDNA. Low-pass (~2X) paired-end NGS sequencing was performed on WMS libraries and paired whole genome sequencing (WGS) libraries of unconverted cfDNA for technical comparison and analytical validation. For development of cancer detection models, we profiled the genome-wide methylation density (MD), fragment size index (FSI), fragment end motif (motif) and chromosome instability (CIN) based on WMS data from a discovery cohort of 352 healthy controls and 559 newly diagnosed cancer patients (45 breast, 105 colorectal, 44 esophageal, 79 gastric, 79 liver, 110 lung, 83 pancreatic, and 14 others), 34.5% of which were at stage I or II. Machine learning models, including KNN, SVM, LR, GBDT, and random forest were trained and tested for individual biomarker types, with a final ensemble classifier to integrate all biomarkers. Performance of the predictive model was confirmed on an independent validation cohort consisting of 145 healthy controls and 236 cancer patients (21 breast, 45 colorectal, 18 esophageal, 35 gastric, 34 liver, 47 lung, and 36 pancreatic), among which 31.8% were at early stages (I or II). Results: WMS and WGS data from 512 cfDNA samples showed high concordance in CIN (R=0.988, 95% CI: 0.986-0.990) and FSI (R=0.961, 0.954-0.967) profiles. On the independent validation cohort, the optimal model selected for each of individual genomic features achieved following area under the ROC curve (AUC) values for cancer detection: MD-KNN, 0.830 (0.789-0.870); FSI-SVM, 0.904 (0.874-0.933); motif-SVM, 0.943 (0.920-0.966); and CIN-PAscore, 0.812 (0.770-0.854). The ensemble classifier based on linear SVM outperformed individual biomarkers, with an AUC value of 0.952 (0.934-0.971), which translated to, at 95% specificity, detection sensitivity of 66.7% for breast, 77.8% for colorectal, 83.3% for esophageal, 62.9% for gastric, 82.4% for liver, 66.0% for lung, and 77.8% for pancreatic cancers. Noteworthily, the overall sensitivity on early-stage cancer was 74.7%. Conclusions: These results demonstrate the first proof of principle on the feasibility of integrating multiple genomic cancer markers on the same WMS technical platform. Low-pass WMS on plasma cfDNA from 10ml of blood with integrative multimodal analysis of methylation, fragmentation, and CNV profiles yields in satisfactory sensitivity and specificity for detection of multiple types of cancer, warranting a forthcoming prospective study to further assess its clinical performance in a larger cohort. Citation Format: Yulong Li, Fenglong Bie, Fengwei Tan, Tiancheng Han, Shunli Yang, Fang Lv, Peiyao Nie, Qi Zhang, Yuanyuan Hong, Zhijie Wang, Ji He, Weizhi Chen, Liang Zhao, Shugeng Gao. Multimodal analysis of plasma cell-free DNA methylome for sensitive multi-cancer detection [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 5150.

Read full abstract

Abstract▪2173▪This icon denotes a clinically relevant abstractThe adaptive arm of the immune system - the T-cell compartment – may become compromised by inherited or acquired defects resulting in cancer, autoimmunity, or increased susceptibility to microbial infectious agents. A normal polyclonal T cell compartment comprises an estimated number of 2.5 × 10E7 individual T cell clones each expressing a unique antigen recognizing T cell receptor (TCR). The functionality of the T cell compartment is thus – at least partly - reflected by TCR sequence diversity. There is a medical need for rapid and robust diagnostic approaches that accurately monitor TCR diversity in patient samples, e.g. after bone marrow transplantation (BMT).Previously, complementarity determining region 3 (CDR3) size spectratyping in TCR β-chain subfamilies (Vβ), an immunoscopic technique, was employed for the analysis of T cell diversity. However, spectratyping is limited to the analysis of CDR3 length polymorphisms only. Underlying diversity of TCR Vβ sequences of equal length remain undetected. Furthermore, spectratyping is time consuming and consequently data can only be interpreted with a delay of weeks.To determine TCR diversity fast and accurately we developed next-generation-sequencing spectratyping (NGS-S), which employs high coverage, massive parallel Roche/454-sequencing of TCR Vβ-chain amplicons.Three different sample groups were analyzed in parallel by spectratyping and NGS-S: T cells from (1) healthy children (n=6), (2) children at diagnosis of severe aplastic anemia (n=7), and (3) children who underwent haploidentical BMT (n=7). In brief, RNA was extracted from bone marrow derived CD8+ cells and transcribed to cDNA. Amplicon libraries were generated by PCR employing two degenerated wobble primers (VP1, VP2), designed to cover most of the known TCR Vβ gene segments and a universal reverse primer (CP1) located in the conserved TCR region (Figure 1). The mean overall coverage of the CDR3 region achieved was 23.133 per patient. [Display omitted] For simultaneous characterization of these individual amplicons we generated the TCR Profiler (Figure 2). This new software tool automatically preprocessed raw sequencing data using a threshold quality value (q=30) to trim the 3’ end of the TCR β-chain sequences. Rearranged germline TCR Vβ and Jβ genes were identified by Smith-Waterman local alignment against each human TCR β-chain germline gene of the IMGT/GENE-DB reference directory. Base call reliability was assured by incorporating phred-like quality values provided by the sequencer into the Smith-Waterman local alignment routine. A quality value, q, was computed for each base as the log-transformation of the probability p of a base being incorrectly called, q = − 10 x log10(p). Transformed into a reliability measure, r = 1- (1/10^(q/10)), q-values indicated the probability for each base to be correctly called. CDR3 regions were delimited using specific flanking amino acid sequence motifs. The 5’ end motif varies dependently on the rearranged TCR Vβ gene and was identified using the IMGT/GENE-DB reference directory sequence set, whereas the 3’ end motif [W/F]GXG (IUPAC code) is conserved in all TCR β-chains. CDR3 length was calculated including all amino acids between the two motifs. Screening for in-frame stop codons was done to exclude non-functional TCR β-chain transcripts. [Display omitted] The TCR profiler identified on average 16165 of the input sequences as unique CDR3 sequences. Of these a mean of 9840 were predicted to be functionally rearranged.Whereas spectratyping gave a rough estimate of CDR3 size and allowed T cell deficient samples to be identified, NGS-S determined the exact length and sequence composition of the CDR3, identified the rearranged TCR Vβ and Jβ genes and the specific recombination (Figure 3A). [Display omitted] Utilization of specific genes, the resulting amino acid composition of the CDR3 region as well as its length and overall diversity were integrated into a novel NGS-S score.This score reliably differentiated not only between normal and T cell deficient samples, but also between the different T cell deficient groups (SAA and BMT) (Figure 3B). We conclude that NGS-S allows rapid and precise determination of TCR diversity in clinical samples. Disclosures:No relevant conflicts of interest to declare.

Read full abstract

End Motif Research Articles

Articles published on End Motif

Abstract 6371: Deep learning algorithm for multi-cancer detection and classification using cf-WGS

Abstract 5150: Multimodal analysis of plasma cell-free DNA methylome for sensitive multi-cancer detection

Abstract 1912: A versatile computational pipeline for the preprocessing of cell-free DNA fragmentation data

Cell-Free DNA Fragmentomics in Liquid Biopsy.

Refined characterization of circulating tumor DNA through biological feature integration

Emerging frontiers of cell-free DNA fragmentomics

Single-molecule sequencing reveals a large population of long cell-free DNA molecules in maternal plasma

451P Utility of circulating free DNA 5’-end motif profile in the prediction of pathological response after neoadjuvant chemoradiotherapy in patients with locally advanced rectal cancer

Characterization of fragment sizes, copy number aberrations and 4-mer end motifs in cell-free DNA of hepatocellular carcinoma for enhanced liquid biopsy-based cancer detection.

Plasma DNA Profile Associated with DNASE1L3 Gene Mutations: Clinical Observations, Relationships to Nuclease Substrate Preference, and In Vivo Correction

Abstract IA01: Plasma DNA-based molecular diagnostics: Fragments, circles and beyond

Plasma DNA End-Motif Profiling as a Fragmentomic Marker in Cancer, Pregnancy, and Transplantation.

RNA Circularization Diminishes Immunogenicity and Can Extend Translation Duration InVivo.

Dnase1l3 deletion causes aberrations in length and end-motif frequencies in plasma DNA

Shiga Toxin Subtypes and Virulence Genes in Escherichia coli Isolated from Cattle.

A supramolecular DNA self-assembly based on β-cyclodextrin–adamantane complexation as a bioorthogonal sticky end motif

Next Generation Sequencing Spectratyping (NGS-S) Comprehensively Monitors T Cell Receptor Diversity in Children with T Cell Abnormalities

Synthesis of Biantennary Complex-Type Nonasaccharyl Asn Building Blocks for Solid-Phase Glycopeptide Synthesis

Mint-3 Regulates the Retrieval of the Internalized Membrane-type Matrix Metalloproteinase, MT5-MMP, to the Plasma Membrane by Binding to Its Carboxyl End Motif EWV

Recombination signal sequence variations and the mechanism of patterned T-cell receptor-β locus rearrangement

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

End Motif Research Articles

Articles published on End Motif

Abstract 6371: Deep learning algorithm for multi-cancer detection and classification using cf-WGS

Abstract 5150: Multimodal analysis of plasma cell-free DNA methylome for sensitive multi-cancer detection

Abstract 1912: A versatile computational pipeline for the preprocessing of cell-free DNA fragmentation data

Cell-Free DNA Fragmentomics in Liquid Biopsy.

Refined characterization of circulating tumor DNA through biological feature integration

Emerging frontiers of cell-free DNA fragmentomics

Single-molecule sequencing reveals a large population of long cell-free DNA molecules in maternal plasma

451P Utility of circulating free DNA 5’-end motif profile in the prediction of pathological response after neoadjuvant chemoradiotherapy in patients with locally advanced rectal cancer

Characterization of fragment sizes, copy number aberrations and 4-mer end motifs in cell-free DNA of hepatocellular carcinoma for enhanced liquid biopsy-based cancer detection.

Plasma DNA Profile Associated with DNASE1L3 Gene Mutations: Clinical Observations, Relationships to Nuclease Substrate Preference, and In Vivo Correction

Abstract IA01: Plasma DNA-based molecular diagnostics: Fragments, circles and beyond

Plasma DNA End-Motif Profiling as a Fragmentomic Marker in Cancer, Pregnancy, and Transplantation.

RNA Circularization Diminishes Immunogenicity and Can Extend Translation Duration InVivo.

Dnase1l3 deletion causes aberrations in length and end-motif frequencies in plasma DNA

Shiga Toxin Subtypes and Virulence Genes in Escherichia coli Isolated from Cattle.

A supramolecular DNA self-assembly based on β-cyclodextrin–adamantane complexation as a bioorthogonal sticky end motif

Next Generation Sequencing Spectratyping (NGS-S) Comprehensively Monitors T Cell Receptor Diversity in Children with T Cell Abnormalities

Synthesis of Biantennary Complex-Type Nonasaccharyl Asn Building Blocks for Solid-Phase Glycopeptide Synthesis

Mint-3 Regulates the Retrieval of the Internalized Membrane-type Matrix Metalloproteinase, MT5-MMP, to the Plasma Membrane by Binding to Its Carboxyl End Motif EWV

Recombination signal sequence variations and the mechanism of patterned T-cell receptor-β locus rearrangement