Searching for improvements in predicting human eye colour from DNA

Magdalena Kukla-Bartoszek,Ewelina Pośpiech,Michał Boroń,Tomasz Grzybowski,Magdalena Spólnicka,Rafał Płoski,Paweł Teisseyre,Magdalena Zubańska,Agata Jarosz,Michał Dąbrowski,Piotr Zieliński,Joanna Karłowska-Pik,Jan Mielniczuk,Wojciech Branicki,Anna Woźniak

doi:10.1007/s00414-021-02645-5

Abstract

Increasing understanding of human genome variability allows for better use of the predictive potential of DNA. An obvious direct application is the prediction of the physical phenotypes. Significant success has been achieved, especially in predicting pigmentation characteristics, but the inference of some phenotypes is still challenging. In search of further improvements in predicting human eye colour, we conducted whole-exome (enriched in regulome) sequencing of 150 Polish samples to discover new markers. For this, we adopted quantitative characterization of eye colour phenotypes using high-resolution photographic images of the iris in combination with DIAT software analysis. An independent set of 849 samples was used for subsequent predictive modelling. Newly identified candidates and 114 additional literature-based selected SNPs, previously associated with pigmentation, and advanced machine learning algorithms were used. Whole-exome sequencing analysis found 27 previously unreported candidate SNP markers for eye colour. The highest overall prediction accuracies were achieved with LASSO-regularized and BIC-based selected regression models. A new candidate variant, rs2253104, located in the ARFIP2 gene and identified with the HyperLasso method, revealed predictive potential and was included in the best-performing regression models. Advanced machine learning approaches showed a significant increase in sensitivity of intermediate eye colour prediction (up to 39%) compared to 0% obtained for the original IrisPlex model. We identified a new potential predictor of eye colour and evaluated several widely used advanced machine learning algorithms in predictive analysis of this trait. Our results provide useful hints for developing future predictive models for eye colour in forensic and anthropological studies.

Highlights

Increasing understanding of human genome variability is enabling better use of DNA’s predictive potential [1]
There are many machine learning (ML) methods available for developing predictive models, and their effectiveness may depend on the type and amount of data used; some of them may be more suitable than others for taking into account diverse genetic phenomena, including epistasis
It has been proved that ensemble methods such as random forest (RF) or extreme gradient boosting (XGB) are among the most powerful classification models; they usually achieve significantly higher accuracy when compared to simple models

Summary

Introduction

Increasing understanding of human genome variability is enabling better use of DNA’s predictive potential [1]. It has been proved that ensemble methods such as random forest (RF) or extreme gradient boosting (XGB) are among the most powerful classification models; they usually achieve significantly higher accuracy when compared to simple models The price for this is the higher computational cost and more complicated interpretation. In the case of some classification methods, feature selection is an integral element of learning the model; for example, in tree-based methods, relevant attributes are chosen during the building of the tree Another solution is using regularization techniques [18], such as least absolute shrinkage and selection operator (LASSO) regularization, which ensure sparsity in the parameter vector and allow one to find attributes influencing the class variable

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Legal Medicine	Publication Date: Jul 14, 2021
Citations: 5	License type: open-access

R Discovery Prime

R Discovery Prime

Searching for improvements in predicting human eye colour from DNA

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Legal Medicine

Lead the way for us

Similar Papers

Predicting eye and hair colour in a Norwegian population using Verogen’s ForenSeq™ DNA signature prep kit
Nina Mjølsnes Salvo ... Gunn-Hege Olsen
Forensic Science International: Genetics | VOL. 56
Nina Mjølsnes Salvo, et. al.Nina Mjølsnes Salvo ... Gunn-Hege Olsen
24 Oct 2021
Forensic Science International: Genetics | VOL. 56

The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA
Susan Walsh ... Agnieszka Kosiniak-Kamysz
Forensic Science International: Genetics | VOL. 7
Susan Walsh, et. al.Susan Walsh ... Agnieszka Kosiniak-Kamysz
20 Aug 2012
Forensic Science International: Genetics | VOL. 7

Eye color prediction using single nucleotide polymorphisms in Saudi population.
Jahad Alghamdi ... Mansour Al Mufarrej
Saudi journal of biological sciences | VOL. 26
Jahad Alghamdi, et. al.Jahad Alghamdi ... Mansour Al Mufarrej
28 Sep 2018
Saudi journal of biological sciences | VOL. 26

Phenotypic Classification of Eye Colour and Developmental Validation of the Irisplex System on Population Living in Malakand Division, Pakistan.
Murad Ali Rahat ... Muhammad Israr
Biomedicines | VOL. 11
Murad Ali Rahat, et. al.Murad Ali Rahat ... Muhammad Israr
20 Apr 2023
Biomedicines | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Searching for improvements in predicting human eye colour from DNA

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Legal Medicine