Inference of haplotypic phase and missing genotypes in polyploid organisms and variable copy number genomic regions.

Shu-Yi Su,Lachlan Jm Coin,David J Balding,Jonathan White

doi:10.1186/1471-2105-9-513

Shu-Yi Su, Lachlan Jm Coin + Show 2 more

Open Access

PDF Available

https://doi.org/10.1186/1471-2105-9-513

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

BackgroundThe power of haplotype-based methods for association studies, identification of regions under selection, and ancestral inference, is well-established for diploid organisms. For polyploids, however, the difficulty of determining phase has limited such approaches. Polyploidy is common in plants and is also observed in animals. Partial polyploidy is sometimes observed in humans (e.g. trisomy 21; Down's syndrome), and it arises more frequently in some human tissues. Local changes in ploidy, known as copy number variations (CNV), arise throughout the genome. Here we present a method, implemented in the software polyHap, for the inference of haplotype phase and missing observations from polyploid genotypes. PolyHap allows each individual to have a different ploidy, but ploidy cannot vary over the genomic region analysed. It employs a hidden Markov model (HMM) and a sampling algorithm to infer haplotypes jointly in multiple individuals and to obtain a measure of uncertainty in its inferences.ResultsIn the simulation study, we combine real haplotype data to create artificial diploid, triploid, and tetraploid genotypes, and use these to demonstrate that polyHap performs well, in terms of both switch error rate in recovering phase and imputation error rate for missing genotypes. To our knowledge, there is no comparable software for phasing a large, densely genotyped region of chromosome from triploids and tetraploids, while for diploids we found polyHap to be more accurate than fastPhase. We also compare the results of polyHap to SATlotyper on an experimentally haplotyped tetraploid dataset of 12 SNPs, and show that polyHap is more accurate.ConclusionWith the availability of large SNP data in polyploids and CNV regions, we believe that polyHap, our proposed method for inferring haplotypic phase from genotype data, will be useful in enabling researchers analysing such data to exploit the power of haplotype-based analyses.

Highlights

The power of haplotype-based methods for association studies, identification of regions under selection, and ancestral inference, is well-established for diploid organisms
With genetic or physical maps of plant genomes becoming increasingly available and with increasing numbers of copy number variations (CNV) regions identified and improving technology for genotyping copy number polymorphisms (CNP), we believe that polyHap provides a timely addition to the geneticist's toolkit
Due to the limited availability of phased SNP data from polyploid species, we evaluated the performance of polyHap by randomly combining human male X-chromosome haplotypes from the WTCCC to create datasets of artificial diploid, triploid and tetraploid genotypes

Summary

Introduction

The power of haplotype-based methods for association studies, identification of regions under selection, and ancestral inference, is well-established for diploid organisms. Partial polyploidy is sometimes observed in humans (e.g. trisomy 21; Down's syndrome), and it arises more frequently in some human tissues. We present a method, implemented in the software polyHap, for the inference of haplotype phase and missing observations from polyploid genotypes. PolyHap allows each individual to have a different ploidy, but ploidy cannot vary over the genomic region analysed. It employs a hidden Markov model (HMM) and a sampling algorithm to infer haplotypes jointly in multiple individuals and to obtain a measure of uncertainty in its inferences. Haplotypebased methods may be used to infer aspects of population history, such as the effects of positive selection [5] and recombination events [6].

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Dec 1, 2008
Citations: 29	License type: cc-by

R Discovery Prime

Inference of haplotypic phase and missing genotypes in polyploid organisms and variable copy number genomic regions.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Genome wide copy number variations using Porcine 60K SNP Beadchip in Landlly pigs
Snehasmita Panda ... Triveni Dutt
Animal Biotechnology | VOL. 34
Snehasmita Panda, et. al.Snehasmita Panda ... Triveni Dutt
29 Mar 2022
Animal Biotechnology | VOL. 34

Haplotype phasing and inheritance of copy number variants in nuclear families.
Priit Palta ... Andres Veidenberg
PloS one | VOL. 10
Priit Palta, et. al.Priit Palta ... Andres Veidenberg
08 Apr 2015
PloS one | VOL. 10

Copy number variation in human genomes from three major ethno-linguistic groups in Africa
Oscar A Nyangiri ... Enock Matovu
BMC Genomics | VOL. 21
Oscar A Nyangiri, et. al.Oscar A Nyangiri ... Enock Matovu
10 Apr 2020
BMC Genomics | VOL. 21

Genome-wide elucidation of CNV regions and their association with production and reproduction traits in composite Vrindavani cattle
Sheikh Firdous Ahmad ... Triveni Dutt
Gene | VOL. 830
Sheikh Firdous Ahmad, et. al.Sheikh Firdous Ahmad ... Triveni Dutt
18 Apr 2022
Gene | VOL. 830

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Inference of haplotypic phase and missing genotypes in polyploid organisms and variable copy number genomic regions.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Bioinformatics