The chloroplast genome is an important tool for studying plant classification, evolution, and the heterologous production of secondary metabolites and protein drugs. With advancements in sequencing technology and reductions in sequencing costs, chloroplast genome data have rapidly accumulated. However, existing chloroplast genome databases suffer from issues such as incomplete data, inadequate management, and inconsistent, inaccurate information, posing significant challenges for the development and utilization of the chloroplast genome. Therefore, it is urgently necessary to establish a database that provides comprehensive and reliable chloroplast genome information. This article provides a brief introduction to the Chloroplast Genome Information Resource(CGIR), the most comprehensive chloroplast genome database globally in terms of species coverage. The database, consisting of five modules, i.e.,(1) genomes,(2) genes,(3) simple sequence repeats(SSRs),(4) DNA barcodes, and(5) DNA signature sequences(DSSs), currently includes 34 923 chloroplast genome assemblies from 16 717 species. Based on the functionalities of these modules, the article systematically summarizes the progress in the application of the database in plant phylogenetic analysis, species identification, and chloroplast genetic engineering. The chloroplast genome database will be continuously updated in the future to provide a solid and reliable data foundation for chloroplast genome research, further promoting studies on traditional Chinese medicine(TCM)identification, resource conservation, and germplasm innovation.
Read full abstract