Abstract
Camellia oleifera is one of the four largest woody edible oil plants in the world with high ecological and medicinal values. Due to frequent interspecific hybridization, it was difficult to study its genetics and evolutionary history. This study used C. oleifera that was collected on Hainan Island to conduct our research. The unique island environment makes the quality of tea oil higher than that of other species grown in the mainland. Moreover, a long-term geographic isolation might affect gene structure. In order to better understand the molecular biology of this species, protect excellent germplasm resources, and promote the population genetics and phylogenetic studies of Camellia plants, high-throughput sequencing technology was used to obtain the chloroplast genome sequence of Hainan C. oleifera. The results showed that the whole chloroplast genome of C. oleifera in Hainan was 156,995 bp in length, with a typical quadripartite structure of a large single copy (LSC) region of 86,648 bp, a small single copy (SSC) region of 18,297 bp, and a pair of inverted repeats (IRs) of 26,025 bp. The whole genome encoded a total of 141 genes (115 different genes), including 88 protein-coding genes, 45 tRNA genes, and eight rRNA genes. Among these genes, nine genes contained one intron, two genes contained two introns, and four overlapping genes were also detected. The total GC content of Hainan C. oleifera’s chloroplast genome was 37.29%. The chloroplast genome structure characteristics of Hainan C. oleifera were compared with mainland C. oleifera and those of the other eight closely related Theaceae species; it was found that the contractions and expansions of the IR/LSC and IR/SSC regions affected the length of chloroplast genome. The chloroplast genome sequences of these Theaceae species were highly similar. A comparative analysis indicated that the Theaceae species were conserved in structure and evolution. A total of 51 simple sequence repeat (SSR) loci were detected in the chloroplast genome of Hainan C. oleifera, and all Camellia plants did not have pentanucleotide repeats, which could be used as a good marker in phylogenetic studies. We also detected seven long repeats, the base composition of all repeats was biased toward A/T, which was consistent with the codon bias. It was found that Hainan C. oleifera had a similar evolutionary relationship with C. crapnelliana, through the use of codons and phylogenetic analysis. This study can provide an effective genomic resource for the evolutionary history of Theaceae family.
Highlights
The chloroplast genome, known as chloroplast DNA, is often abbreviated as cpDNA
Chloroplast genome characteristic of Hainan C. oleifera Like the majority of land plants, the whole chloroplast genome of Hainan C. oleifera showed a typical quadripartite genome organization with a size of 156,995 bp, including a large single copy (LSC) region of 86,648 bp and a small single copy (SSC) region of 18,297 bp, which were separated by two inverted repeats (IRs) (IRa and IRb) regions of 26,025 bp (Fig. 1)
The chloroplast genome of Hainan C. oleifera was found to contain 141 predicted functional genes, including 88 protein-coding genes, 45 tRNA genes and eight rRNA genes, which were classified according to their function
Summary
The chloroplast genome, known as chloroplast DNA, is often abbreviated as cpDNA It shows the typical quadripartite structure generally consisting of four parts with a large single copy (LSC), a small single copy (SSC), and two inverted repeats (IRs) (Jansen et al, 2005; Palmer, 1991). The IR regions of gymnosperms, such as Japanese black pine, which was only 495 bp in length (Tsudzuki et al, 1992), whereas the IR regions of legumes, such as Medicago, disappeared completely (Saski et al, 2005). This polymorphism of chloroplast genome has important research significance in phylogenetic and population genetics
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have