Abstract

Biogeographical ancestry (BGA) inference is based on the understanding of genetic diversity distribution among population groups. BGA inference is used to detect and measure the population structure that presents the natural assignment in genetic terms, identify genetic patterns found in individuals’ genotypes, and estimate an individual’s BGAs. In the context of forensic, BGA inference at an individual level gives the possibilities to achieve more complete identification of missing person or suspect. Current machine learning approach to BGA inference based on Bayesian theory and principle component analysis cannot operate on the data sequence directly and require predefined features extracted from the data sequence based on prior knowledge. In this paper, we conduct a survey of the state of the art of BGA inference and propose a new approach based on deep learning to BGA inference without prior feature extraction to find hidden genetic structure and provide more accurate predictions. Our experiments conducted on the dataset for Human Genome Diversity Project (HGDP) show better results for the proposed approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.