Abstract

BackgroundIdentification of germline variation and somatic mutations is a major issue in human genetics. However, due to the limitations of DNA sequencing technologies and computational algorithms, our understanding of genetic variation and somatic mutations is far from complete.MethodsIn the present study, we performed whole-genome sequencing using long-read sequencing technology (Oxford Nanopore) for 11 Japanese liver cancers and matched normal samples which were previously sequenced for the International Cancer Genome Consortium (ICGC). We constructed an analysis pipeline for the long-read data and identified germline and somatic structural variations (SVs).ResultsIn polymorphic germline SVs, our analysis identified 8004 insertions, 6389 deletions, 27 inversions, and 32 intra-chromosomal translocations. By comparing to the chimpanzee genome, we correctly inferred events that caused insertions and deletions and found that most insertions were caused by transposons and Alu is the most predominant source, while other types of insertions, such as tandem duplications and processed pseudogenes, are rare. We inferred mechanisms of deletion generations and found that most non-allelic homolog recombination (NAHR) events were caused by recombination errors in SINEs. Analysis of somatic mutations in liver cancers showed that long reads could detect larger numbers of SVs than a previous short-read study and that mechanisms of cancer SV generation were different from that of germline deletions.ConclusionsOur analysis provides a comprehensive catalog of polymorphic and somatic SVs, as well as their possible causes. Our software are available at https://github.com/afujimoto/CAMPHOR and https://github.com/afujimoto/CAMPHORsomatic.

Highlights

  • Identification of germline variation and somatic mutations is a major issue in human genetics

  • These results showed that the error rates of Nanopore long reads were not low, but the majority of reads were uniquely mapped to the human reference genome and could be used for the identification of structural variations (SVs)

  • We considered an insertion candidate as “an insertion caused by a deletion event in the human population” if it was present in the chimpanzee genome and “an insertion caused by an insertion event in the human population” if not seen in the chimpanzee genome (Additional file 1: Fig. S7)

Read more

Summary

Introduction

Identification of germline variation and somatic mutations is a major issue in human genetics. Somatic mutations are known to cause cancerogenesis and rare diseases [2,3,4]. Identification of these variants and mutations is a crucial issue in human genetics. Due to the limitations of DNA sequencing technologies and computational algorithms, our understanding of genetic variations and somatic mutations is far from complete [8, 9]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call