Haplotype frequency inference from pooled genetic data with a latent multinomial model.

Yong See Foo,Jennifer Flegg

doi:10.1109/tcbb.2024.3420430

Abstract

In genetic association studies, haplotype data provide more refined information than data about separate genetic markers. However, large-scale studies that genotype hundreds to thousands of individuals may only provide results of pooled data. Methods for inferring haplotype frequencies from pooled genetic data that scale well with pool size rely on a normal approximation, which we observe to produce unreliable inference when applied to real data. We illustrate cases where the approximation fails, due to the normal covariance matrix being nearsingular. As an alternative to approximate methods, in this paper we propose two exact methods to infer haplotype frequencies from pooled genetic data based on a latent multinomial model, where the pooled results are considered integer combinations of latent, unobserved haplotype counts. One of our methods, latent count sampling via Markov bases, achieves approximately linear runtime with respect to pool size. Our exact methods produce more accurate inference over existing approximate methods for synthetic data and for haplotype data from the 1000 Genomes Project. We also demonstrate how our methods can be applied to time-series of pooled genetic data, as a proof of concept of how our methods are relevant to more complex hierarchical settings, such as spatiotemporal models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Haplotype frequency inference from pooled genetic data with a latent multinomial model.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on computational biology and bioinformatics

Lead the way for us

Similar Papers

Extending the latent multinomial model with complex error processes and dynamic Markov bases
Simon J Bonner ... Patrik Noren
The Annals of Applied Statistics | VOL. 10
Simon J Bonner, et. al.Simon J Bonner ... Patrik Noren
01 Mar 2016
The Annals of Applied Statistics | VOL. 10

Predicting breeding values and accuracies from group in comparison to individual observations
K M Olson ... D J Garrick
Journal of Animal Science | VOL. 84
K M Olson, et. al.K M Olson ... D J Garrick
01 Jan 2006
Journal of Animal Science | VOL. 84

The exact and approximate method in mechanical system's analysis
Andrzej Buchacz ... Marek Płaczek
PAMM | VOL. 10
Andrzej Buchacz, et. al.Andrzej Buchacz ... Marek Płaczek
16 Nov 2010
PAMM | VOL. 10

PERFORMANCE ANALYSIS OF OPTIMIZATION METHODS FOR SOLVING TRAVELING SALESMAN PROBLEM
Chandra Agung ... Natalia Christine
Innovative Technologies and Scientific Solutions for Industries | VOL. -
Chandra Agung, et. al.Chandra Agung ... Natalia Christine
31 Mar 2021
Innovative Technologies and Scientific Solutions for Industries | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Haplotype frequency inference from pooled genetic data with a latent multinomial model.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on computational biology and bioinformatics