Abstract

Recent advances in throughput and accuracy mean that the Oxford Nanopore Technologies PromethION platform is a now a viable solution for genome sequencing. Much of the validation of bioinformatic tools for this long-read data has focussed on calling germline variants (including structural variants). Somatic variants are outnumbered many-fold by germline variants and their detection is further complicated by the effects of tumour purity/subclonality. Here, we evaluate the extent to which Nanopore sequencing enables detection and analysis of somatic variation. We do this through sequencing tumour and germline genomes for a patient with diffuse B-cell lymphoma and comparing results with 150 bp short-read sequencing of the same samples. Calling germline single nucleotide variants (SNVs) from specific chromosomes of the long-read data achieved good specificity and sensitivity. However, results of somatic SNV calling highlight the need for the development of specialised joint calling algorithms. We find the comparative genome-wide performance of different tools varies significantly between structural variant types, and suggest long reads are especially advantageous for calling large somatic deletions and duplications. Finally, we highlight the utility of long reads for phasing clinically relevant variants, confirming that a somatic 1.6 Mb deletion and a p.(Arg249Met) mutation involving TP53 are oriented in trans.

Highlights

  • Generation sequencing (NGS) technologies have enabled a number of applications in ­genomics[1,2,3,4,5]

  • Short-read, Illumina sequencing is considered the gold standard for the majority of clinical sequencing ­projects[21], such data lead to biases even in Whole Genome Sequencing (WGS), due to uneven coverage of regions with high/low GC content and the difficulty of aligning short reads derived from repetitive DNA s­ equences[20,22,23]

  • The Diffuse Large B-cell Lymphoma (DLBCL) patient was recruited as part of a large-scale clinical sequencing study utilizing the Illumina short read platform, details of which have been previously ­published[10]

Read more

Summary

Introduction

Generation sequencing (NGS) technologies have enabled a number of applications in ­genomics[1,2,3,4,5]. We aimed to evaluate the extent to which high coverage, long-read Nanopore sequencing enables the chromosome-wide or genome-wide analysis of a broad range of somatic variation, by comparison with the current gold-standard, Illumina short-read WGS. We do this through conducting in-depth analysis of germline and tumour sequencing data from a patient with Diffuse Large B-cell Lymphoma (DLBCL), an aggressive form of non-Hodgkin’s Lymphoma. We compare the performance of multiple tools on the long-read data and provide recommendations for future studies

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call