Anopheles stephensi is the most menacing malaria vector to watch for in newly urbanising parts of the world. Its fitness is reported to be a direct consequence of the vector adapting to laying eggs in over-head water tanks with street-side water puddles polluted by oil and sewage. Large frequent inversions in the genome of malaria vectors are implicated in adaptation. We report the genome assembly of a strain of An. stephensi of the type-form, collected from a construction site from Chennai (IndCh) in 2016. The genome reported here with a L50 of 4, completes the trilogy of high-resolution genomes of strains with respect to a 16.5 Mbp 2Rb genotype in An. stephensi known to be associated with adaptation to environmental heterogeneity. Unlike the reported genomes of two other strains, STE2 (2R+b/2Rb) and UCI (2Rb/2Rb), IndCh is found to be homozygous for the standard form (2R+b/2R+b). Comparative genome analysis revealed base-level details of the breakpoints and allowed extraction of 22,650 segregating SNPs for typing this inversion in populations. Whole genome sequencing of 82 individual mosquitoes from diverse geographical locations reveal that one third of both wild and laboratory populations maintain the heterozygous genotype of 2Rb. The large number of SNPs can be tailored to 1740 exonic SNPs enabling genotyping directly from transcriptome sequencing. The genome trilogy approach accelerated the study of fine structure and typing of an important inversion in An. stephensi, putting the genome resources for this understudied species on par with the extensively studied malaria vector, Anopheles gambiae. We argue that the IndCh genome is relevant for field translation work compared to those reported earlier by showing that individuals from diverse geographical locations cluster with IndCh, pointing to significant convergence resulting from travel and commerce between cities, perhaps, contributing to the survival of the fittest strain.
Read full abstract