Malus baccata, a valuable germplasm resource in the genus Malus, is indigenous to China and widely distributed. However, little is known about the lineage composition and genetic basis of 'ZA', a mutant type of M. baccata. In this study, we compared the differences between 'ZA' and wild type from the perspective of morphology and ultrastructure and analyzed their chloroplast pigment content based on biochemical methods. Further, the complete mitogenome of M. baccata 'ZA' was assembled and obtained by next-generation sequencing. Subsequently, its molecular characteristics were analyzed using Geneious, MISA-web, and CodonW toolkits. Furthermore, by examining 106 Malus germplasms and 42 Rosaceae species, we deduced and elucidated the evolutionary position of M. baccata 'ZA', as well as interspecific variations among different individuals. In comparison, the total length of the 'ZA' mitogenome (GC content: 45.4%) is 374,023 bp, which is approximately 2.33 times larger than the size (160,202 bp) of the plastome (GC: 36.5%). The collinear analysis results revealed abundant repeats and genome rearrangements occurring between different Malus species. Additionally, we identified 14 plastid-driven fragment transfer events. A total of 54 genes have been annotated in the 'ZA' mitogenome, including 35 protein-coding genes, 16 tRNAs, and three rRNAs. By calculating nucleotide polymorphisms and selection pressure for 24 shared core mitochondrial CDSs from 42 Rosaceae species (including 'ZA'), we observed that the nad3 gene exhibited minimal variation, while nad4L appeared to be evolving rapidly. Population genetics analysis detected a total of 1578 high-quality variants (1424 SNPs, 60 insertions, and 94 deletions; variation rate: 1/237) among samples from 106 Malus individuals. Furthermore, by constructing phylogenetic trees based on both Malus and Rosaceae taxa datasets, it was preliminarily demonstrated that 'ZA' is closely related to M. baccata, M. sieversii, and other proximate species in terms of evolution. The sequencing data obtained in this study, along with our findings, contribute to expanding the mitogenomic resources available for Rosaceae research. They also hold reference significance for molecular identification studies as well as conservation and breeding efforts focused on excellent germplasms.
Read full abstract