Abstract

MotivationAncestral haplotype maps provide useful information about genomic variation and insights into biological processes. Reconstructing the descendent haplotype structure of homologous chromosomes, particularly for large numbers of individuals, can help with characterizing the recombination landscape, elucidating genotype-to-phenotype relationships, improving genomic predictions and more. Inferring haplotype maps from sparse genotype data is an efficient approach to whole-genome haplotyping, but this is a non-trivial problem. A standardized approach is needed to validate whether haplotype reconstruction software, conceived population designs and existing data for a given population provides accurate haplotype information for further inference.ResultsWe introduce SPEARS, a pipeline for the simulation-based appraisal of genome-wide haplotype maps constructed from sparse genotype data. Using a specified pedigree, the pipeline generates virtual genotypes (known data) with genotyping errors and missing data structure. It then proceeds to mimic analysis in practice, capturing sources of error due to genotyping, imputation and haplotype inference. Standard metrics allow researchers to assess different population designs and which features of haplotype structure or regions of the genome are sufficiently accurate for analysis. Haplotype maps for 1000 outcross progeny from a multi-parent population of maize are used to demonstrate SPEARS.Availabilityand implementationSPEARS, the protocol and suite of scripts, are publicly available under an MIT license at GitHub (https://github.com/maizeatlas/spears).Supplementary information Supplementary data are available at Bioinformatics online.

Highlights

  • Reconstructing the descendent haplotype structure of ho- grees

  • We introduce SPEARS, a pipeline for whole simulation-based appraisal of genome-wide ancestral haplotype inference

  • T with real-world missing data structure. It proceeds to mimic analysis in practice, capturing sources of error due to imputation and reconstruction of ancestral haplotypes

Read more

Summary

Introduction

Reconstructing the descendent haplotype structure of ho- grees Inferring haplotype maps from sparse genotype data is an efficient approach to whole-genome haplotyping, but this is a non-trivial problem. A standardized approach is needed to validate whether haplotype reconstruction software, conceived population designs and exthaliana MAGIC design [12]) [10]. Tailored for RABBIT but extensible for other tools, we present a standard performance evaluation of ancestral reconstruction through simulation (SPEARS) pipeline. SPEARS is designed to determine expectations for the accuracy of haplotype reconstruction software and different population designs. As proof-ofconcept and a new demonstration of RABBIT, we develop a isting data for a given population provides accurate haplotype detailed picture on the variation in accuracy for genome-wide information for further inference

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.