Gene annotation is essential for genome-based studies. However, algorithm-based genome annotation is difficult to fully and correctly reveal genomic information, especially for species with complex genomes. Artemisia annua L. is the only commercial resource of artemisinin production though the content of artemisinin is still to be improved. Genome-based genetic modification and breeding are useful strategies to boost artemisinin content and therefore, ensure the supply of artemisinin and reduce costs, but better gene annotation is urgently needed. In this study, we manually corrected the newly released genome annotation of A. annua using second- and third-generation transcriptome data. We found that incorrect gene information may lead to differences in structural, functional, and expression levels compared to the original expectations. We also identified alternative splicing events and found that genome annotation information impacted identifying alternative splicing genes. We further demonstrated that genome annotation information and alternative splicing could affect gene expression estimation and gene function prediction. Finally, we provided a valuable version of A. annua genome annotation and demonstrated the importance of gene annotation in future research.
Read full abstract