Deep learning classification models based on Convolutional Neural Networks (CNNs) are increasingly used in population genetic inference for detecting signatures of natural selection. Prevailing detection methods treat the design of the classifier as a discrete phase, assuming that high classification accuracy is the sole prerequisite for precise detection. This frequently steers method development toward classification-driven optimizations that can inadvertently impede detection. We present FASTER-NN, a CNN classifier designed specifically for the precise detection of natural selection. It has higher sensitivity than state-of-the-art CNN classifiers while only processing allele frequencies and genomic positions through dilated convolutions to maximize data reuse. As a result, execution time is invariant to the sample size and the chromosome length, creating a highly suitable solution for large-scale, whole-genome scans. Furthermore, FASTER-NN can accurately identify selective sweeps in recombination hotspots, which is a highly challenging detection problem with very limited theoretical treatment to date.
Read full abstract