Caragana sensu lato (s.l.) includes approximately 100 species that are mainly distributed in arid and semi-arid regions. Caragana species are ecologically valuable for their roles in windbreaking and sand fixation. However, the taxonomy and phylogenetic relationships of the genus Caragana are still unclear. In this study, we sequenced and assembled the chloroplast genomes of representative species of Caragana and reconstructed robust phylogenetic relationships at the section level. The Caragana chloroplast genome has lost the inverted repeat region and wascategorized in the inverted repeat loss clade (IRLC). The chloroplast genomes of the eight species ranged from 128,458 bp to 135,401 bp and contained 110 unique genes. All the Caragana chloroplast genomes have a highly conserved structure and gene order. The number of long repeats and simple sequence repeats (SSRs) showed significant variation among the eight species, indicating heterogeneous evolution in Caragana. Selective pressure analysis of the genes revealed that most of the protein-coding genes evolved under purifying selection. The phylogenetic analyses indicated that each section forms a clade, except the section Spinosae, which was divided into two clades. This study elucidated the evolution of the chloroplast genome within the widely distributed genus Caragana. The detailed information obtained from this study can serve as a valuable resource for understanding the molecular dynamics and phylogenetic relationships within Caragana.
Read full abstract