Kalis: a modern implementation of the Li & Stephens model for local ancestry inference in R

Louis J M Aslett,Ryan R Christ

doi:10.1186/s12859-024-05688-8

Abstract

BackgroundApproximating the recent phylogeny of N phased haplotypes at a set of variants along the genome is a core problem in modern population genomics and central to performing genome-wide screens for association, selection, introgression, and other signals. The Li & Stephens (LS) model provides a simple yet powerful hidden Markov model for inferring the recent ancestry at a given variant, represented as an N×N\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$N \ imes N$$\\end{document} distance matrix based on posterior decodings.ResultsWe provide a high-performance engine to make these posterior decodings readily accessible with minimal pre-processing via an easy to use package kalis, in the statistical programming language R. kalis enables investigators to rapidly resolve the ancestry at loci of interest and developers to build a range of variant-specific ancestral inference pipelines on top. kalis exploits both multi-core parallelism and modern CPU vector instruction sets to enable scaling to hundreds of thousands of genomes.ConclusionsThe resulting distance matrices accessible via kalis enable local ancestry, selection, and association studies in modern large scale genomic datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Kalis: a modern implementation of the Li & Stephens model for local ancestry inference in R

Abstract

Talk to us

Similar Papers

More From: BMC bioinformatics

Lead the way for us

Journal: BMC bioinformatics	Publication Date: Feb 28, 2024
License type: CC BY 4.0

Similar Papers

Author response: A method for low-coverage single-gamete sequence analysis demonstrates adherence to Mendel’s first law across a large sample of human sperm
Kathryn J Weaver ... Avery Davis Bell
-
Kathryn J Weaver, et. al.Kathryn J Weaver ... Avery Davis Bell
05 May 2022
05 May 2022

Decision letter: A method for low-coverage single-gamete sequence analysis demonstrates adherence to Mendel’s first law across a large sample of human sperm
Molly Przeworski
-
Molly PrzeworskiMolly Przeworski
19 Apr 2022
19 Apr 2022

Editor's evaluation: A method for low-coverage single-gamete sequence analysis demonstrates adherence to Mendel’s first law across a large sample of human sperm
Daniel R Matute
-
Daniel R MatuteDaniel R Matute
19 Apr 2022
19 Apr 2022

Analysis of Complex Disease Association and Linkage Studies Using the University of California Santa Cruz Genome Browser
Tianyuan Wang ... Terrence S Furey
Circulation: Cardiovascular Genetics | VOL. 2
Tianyuan Wang, et. al.Tianyuan Wang ... Terrence S Furey
01 Apr 2009
Circulation: Cardiovascular Genetics | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kalis: a modern implementation of the Li & Stephens model for local ancestry inference in R

Abstract

Talk to us

Similar Papers

More From: BMC bioinformatics