NpInv: accurate detection and genotyping of inversions using long read sub-alignment

Haojing Shao,Lachlan J M Coin,Clive J Hoggart,Minh Duc Cao,Tania Duarte,Devika Ganesamoorthy

doi:10.1186/s12859-018-2252-9

Haojing Shao, Lachlan J M Coin + Show 4 more

Open Access

https://doi.org/10.1186/s12859-018-2252-9

Copy DOI

Abstract

BackgroundDetection of genomic inversions remains challenging. Many existing methods primarily target inzversions with a non repetitive breakpoint, leaving inverted repeat (IR) mediated non-allelic homologous recombination (NAHR) inversions largely unexplored.ResultWe present npInv, a novel tool specifically for detecting and genotyping NAHR inversion using long read sub-alignment of long read sequencing data. We benchmark npInv with other tools in both simulation and real data. We use npInv to generate a whole-genome inversion map for NA12878 consisting of 30 NAHR inversions (of which 15 are novel), including all previously known NAHR mediated inversions in NA12878 with flanking IR less than 7kb. Our genotyping accuracy on this dataset was 94%. We used PCR to confirm the presence of two of these novel inversions. We show that there is a near linear relationship between the length of flanking IR and the minimum inversion size, without inverted repeats.ConclusionThe application of npInv shows high accuracy in both simulation and real data. The results give deeper insight into understanding inversion.

Highlights

Inversions can be broadly classified on the basis by which they are formed as nonhomologous end joining (NHEJ [2]), non allelic homologous recombination (NAHR) or fork stalling and template switching (FoSTeS [3]) inversions
Detecting and genotyping inversion We present Nanopore inversion (npInv), a novel tool designed for detecting and genotyping non-allelic homologous recombination (NAHR) mediated inversions from long read sequencing data
NpInv scans the alignment file for reads that contain pairs of subread alignments mapping to the same chromosome but with a different orientation (Fig. 2). npInv records this subread alignment pair as an inversion signal

Summary

Introduction

Many existing methods primarily target inzversions with a non repetitive breakpoint, leaving inverted repeat (IR) mediated non-allelic homologous recombination (NAHR) inversions largely unexplored. Inversions can be broadly classified on the basis by which they are formed as nonhomologous end joining (NHEJ [2]), non allelic homologous recombination (NAHR) or fork stalling and template switching (FoSTeS [3]) inversions. The inversion sequence ligates directly to breakpoint without large homologous sequence [2]. Inversion polymorphisms remain one of the most poorly mapped classes of genetic variation. Inversions can be detected from aberrant linkage disequilibrium (LD) patterns from population single-nucleotide polymorphism (SNP) genotyping data, but this provides limited power to detect inversions smaller than 500 kb or with minor allele frequency less than 25% [7,8,9]. Inversions can be inferred from second generation sequence data by abnormal pair end mapping and split read align-

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jul 13, 2018
Citations: 30	License type: open-access

R Discovery Prime

R Discovery Prime

NpInv: accurate detection and genotyping of inversions using long read sub-alignment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Rad51 and Rad54 promote noncrossover recombination between centromere repeats on the same chromatid to prevent isochromosome formation
Atsushi T Onaka ... Takuro Nakagawa
Nucleic Acids Research | VOL. 44
Atsushi T Onaka, et. al.Atsushi T Onaka ... Takuro Nakagawa
03 Oct 2016
Nucleic Acids Research | VOL. 44

Cruciform-forming inverted repeats appear to have mediated many of the microinversions that distinguish the human and chimpanzee genomes
Jessica Kolb ... David N Cooper
Chromosome Research | VOL. 17
Jessica Kolb, et. al.Jessica Kolb ... David N Cooper
01 May 2009
Chromosome Research | VOL. 17

Insertion Sequence Inversions Mediated by Ectopic Recombination between Terminal Inverted Repeats
Alison Ling ... Richard Cordaux
PLoS ONE | VOL. 5
Alison Ling, et. al.Alison Ling ... Richard Cordaux
20 Dec 2010
PLoS ONE | VOL. 5

Molecular characterization of a new patient with a non‐recurrent inv dup del 2q and review of the mechanisms for this rearrangement
Ascensión Vera-Carbonell ... María Ballesta-Martínez
American Journal of Medical Genetics Part A | VOL. 152A
Ascensión Vera-Carbonell, et. al.Ascensión Vera-Carbonell ... María Ballesta-Martínez
26 Aug 2010
American Journal of Medical Genetics Part A | VOL. 152A

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

NpInv: accurate detection and genotyping of inversions using long read sub-alignment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics