Abstract

SummaryGenotype imputation using a reference panel that combines high-density array data and publicly available whole genome sequence consortium variant data is potentially a cost-effective method to increase the density of extant lower-density array datasets. In this study, three datasets (two Border Collie; one Italian Spinone) generated using a legacy array (Illumina CanineHD, 173 662 SNPs) were utilised to assess the feasibility and accuracy of this approach and to gather additional evidence for the efficacy of canine genotype imputation. The cosmopolitan reference panels used to impute genotypes comprised dogs of 158 breeds, mixed breed dogs, wolves and Chinese indigenous dogs, as well as breed-specific individuals genotyped using the Axiom Canine HD array. The two Border Collie reference panels comprised 808 individuals including 79 Border Collies and 426 326 or 426 332 SNPs; and the Italian Spinone reference panel comprised 807 individuals including 38 Italian Spinoni and 476 313 SNPs. A high accuracy for imputation was observed, with the lowest accuracy observed for one of the Border Collie datasets (mean R2 = 0.94) and the highest for the Italian Spinone dataset (mean R2 = 0.97). This study’s findings demonstrate that imputation of a legacy array study set using a reference panel comprising both breed-specific array data and multi-breed variant data derived from whole genomes is effective and accurate. The process of canine genotype imputation, using the valuable growing resource of publicly available canine genome variant datasets alongside breed-specific data, is described in detail to facilitate and encourage use of this technique in canine genetics.

Highlights

  • Genotype imputation is a computational method that predicts missing genotypes in a dataset of genotyped individuals, using a reference panel of individuals genotyped at a higher density (Marchini et al 2007; Howie et al 2009)

  • This can be seen clearly when comparing the relative sizes of the Border Collie and Italian Spinone reference panels and the number of SNPs in the resulting imputed datasets (Table 1)

  • This research has demonstrated and described in detail the successful use of imputation to bring the SNP density of the commonly used Illumina array closer to that of datasets generated using the newer higher-density, and increasingly used, Axiom array. This represents a costeffective method to make the most use of extant data, without the need to re-genotype all individuals or generate large whole genome sequencing (WGS) datasets as would be necessary for imputation up to the density of WGS, which has been the predominant focus of previous literature in the canine field

Read more

Summary

Introduction

Genotype imputation is a computational method that predicts missing genotypes in a dataset of genotyped individuals, using a reference panel of individuals genotyped at a higher density (Marchini et al 2007; Howie et al 2009).

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.