Genotyping Polyploids from Messy Sequencing Data.

David Gerard,Luis Felipe Ventorim Ferrão,Antonio Augusto Franco Garcia,Matthew Stephens

doi:10.1534/genetics.118.301468

Abstract

Detecting and quantifying the differences in individual genomes (i.e., genotyping), plays a fundamental role in most modern bioinformatics pipelines. Many scientists now use reduced representation next-generation sequencing (NGS) approaches for genotyping. Genotyping diploid individuals using NGS is a well-studied field, and similar methods for polyploid individuals are just emerging. However, there are many aspects of NGS data, particularly in polyploids, that remain unexplored by most methods. Our contributions in this paper are fourfold: (i) We draw attention to, and then model, common aspects of NGS data: sequencing error, allelic bias, overdispersion, and outlying observations. (ii) Many datasets feature related individuals, and so we use the structure of Mendelian segregation to build an empirical Bayes approach for genotyping polyploid individuals. (iii) We develop novel models to account for preferential pairing of chromosomes, and harness these for genotyping. (iv) We derive oracle genotyping error rates that may be used for read depth suggestions. We assess the accuracy of our method in simulations, and apply it to a dataset of hexaploid sweet potato (Ipomoea batatas). An R package implementing our method is available at https://cran.r-project.org/package=updog.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Genotyping Polyploids from Messy Sequencing Data.

Abstract

Talk to us

Similar Papers

More From: Genetics

Lead the way for us

Journal: Genetics	Publication Date: Sep 5, 2018
Citations: 153

Similar Papers

Benchmarking variant callers in next-generation and third-generation sequencing analysis.
Surui Pei ... Tao Liu
Briefings in Bioinformatics | VOL. 22
Surui Pei, et. al.Surui Pei ... Tao Liu
23 Jul 2020
Briefings in Bioinformatics | VOL. 22

Abstract 1660: Identification of allelic imbalance utilizing heterozygous genotype allele frequencies and intensities
Kyle Chang ... Zuhal Ozcan
Cancer Research | VOL. 79
Kyle Chang, et. al.Kyle Chang ... Zuhal Ozcan
01 Jul 2019
Abstract 1660: Identification of allelic imbalance utilizing heterozygous genotype allele frequencies and intensities
Kyle Chang ... Zuhal Ozcan

A review on advancements in feature selection and feature extraction for high-dimensional NGS data analysis.
Kasmika Borah ... Saurav Mallik
Functional & integrative genomics | VOL. 24
Kasmika Borah, et. al.Kasmika Borah ... Saurav Mallik
19 Aug 2024
Functional & integrative genomics | VOL. 24

Detection of FLT3 Internal Tandem Duplication in Targeted, Short-Read-Length, Next-Generation Sequencing Data
David H Spencer ... Eric J Duncavage
The Journal of Molecular Diagnostics | VOL. 15
David H Spencer, et. al.David H Spencer ... Eric J Duncavage
14 Nov 2012
The Journal of Molecular Diagnostics | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Genotyping Polyploids from Messy Sequencing Data.

Abstract

Talk to us

Similar Papers

More From: Genetics