Tree sequences as a general-purpose tool for population genetic inference.

Logan S Whitehouse,Dylan D Ray,Daniel R Schrider

doi:10.1101/2024.02.20.581288

Logan S Whitehouse, Dylan D Ray + Show 1 more

Open Access

https://doi.org/10.1101/2024.02.20.581288

Copy DOI

Journal: bioRxiv : the preprint server for biology	Publication Date: Oct 5, 2024
Citations: 3	License type: CC BY 4.0

Abstract

As population genetics data increases in size new methods have been developed to store genetic information in efficient ways, such as tree sequences. These data structures are computationally and storage efficient, but are not interchangeable with existing data structures used for many population genetic inference methodologies such as the use of convolutional neural networks (CNNs) applied to population genetic alignments. To better utilize these new data structures we propose and implement a graph convolutional network (GCN) to directly learn from tree sequence topology and node data, allowing for the use of neural network applications without an intermediate step of converting tree sequences to population genetic alignment format. We then compare our approach to standard CNN approaches on a set of previously defined benchmarking tasks including recombination rate estimation, positive selection detection, introgression detection, and demographic model parameter inference. We show that tree sequences can be directly learned from using a GCN approach and can be used to perform well on these common population genetics inference tasks with accuracies roughly matching or even exceeding that of a CNN-based method. As tree sequences become more widely used in population genetics research we foresee developments and optimizations of this work to provide a foundation for population genetics inference moving forward.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tree sequences as a general-purpose tool for population genetic inference.

Abstract

Talk to us

Similar Papers

More From: bioRxiv : the preprint server for biology

Lead the way for us

Similar Papers

Tree sequences as a general-purpose tool for population genetic inference.
Logan S Whitehouse ... Daniel R Schrider
Molecular biology and evolution | VOL. -
Logan S Whitehouse, et. al.Logan S Whitehouse ... Daniel R Schrider
26 Oct 2024
Molecular biology and evolution | VOL. -

Review: The evolution of chemometrics coupled with near infrared spectroscopy for fruit quality evaluation. II. The rise of convolutional neural networks
Jeremy Walsh ... Michael Li
Journal of Near Infrared Spectroscopy | VOL. 31
Jeremy Walsh, et. al.Jeremy Walsh ... Michael Li
23 May 2023
Journal of Near Infrared Spectroscopy | VOL. 31

Hybrid text classification model based on graph convolution network and neural network
Zhaohe Dong ... Zhengli Zhai
-
Zhaohe Dong, et. al.Zhaohe Dong ... Zhengli Zhai
01 Jun 2023
01 Jun 2023

Analysis of Control Flow Graphs Using Graph Convolutional Neural Networks
Patrick Philipp ... Rafael X Morales Georgi
-
Patrick Philipp, et. al.Patrick Philipp ... Rafael X Morales Georgi
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tree sequences as a general-purpose tool for population genetic inference.

Abstract

Talk to us

Similar Papers

More From: bioRxiv : the preprint server for biology