Abstract

Whole-genome sequencing with short-read technologies is well suited for calling single nucleotide polymorphisms, but has major problems with the detection of structural variants larger than the read length. One such type of variation is copy number variation (CNV), which entails deletion or duplication of genomic regions, and the expansion or contraction of repeated elements. Duplicated and deleted regions will typically be collapsed during de novo assembly of sequence data, or ignored when mapping reads toward a reference. However, signatures of the copy number variation can be detected in the resultant read depth at each position in the genome. We here provide instructions on how to analyze this read depth signal with the R package CNOGpro, allowing for estimation of copy numbers with uncertainty for each feature in a genome.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call