Advancements in sequencing technologies have facilitated low-coverage whole-genome sequencing (lcWGS) for detecting millions of single nucleotide polymorphisms (SNPs) in large populations at low cost. The effectiveness of this approach relies on genotype imputation following lcWGS. While several imputation methods have been applied in human and other terrestrial animals, their performance in aquaculture organisms, particularly mollusks, remains unassessed. Mollusks contribute 70% of mariculture production in China and are characterized by high genetic polymorphism. In this study, we aimed to optimize reference panel construction and evaluate imputation strategies for SNP detection in two scallops, Chlamys farreri and Patinopecten yessoensis, both important bivalve species. High-coverage whole-genome sequencing (hcWGS) data were used to construct haplotype reference panels, and three imputation methods were applied: Beagle5.4, GLIMPSE2, and QUILT. QUILT outperformed other methods, achieving imputation quality scores (IQS) exceeding 0.930 in C. farreri and 0.946 in P. yessoensis, and yielded the highest number of imputed loci in both species. Furthermore, panel construction was optimized by adjusting for genotype quality (GQ) and minor allele frequency (MAF); we determined that a GQ threshold > 20 and an MAF threshold > 0.01 were optimal. We observed that imputation quality improved with increased sequencing depth, plateauing at 0.5×, and varied with sample size depending on the imputation method. A negative correlation was observed between imputation quality and the ratio of non-synonymous to synonymous substitution rates (Ka/Ks) across chromosomal segments, highlighting the influence of selection pressure on imputation efficacy. Collectively, our findings provide an optimized framework for SNP genotyping using lcWGS in scallops, with implications for aquaculture breeding and genetic research.
Read full abstract