Elaeagnus mollis, which has seeds with high lipid and vitamin E contents, is a valuable woody oil plant with potential for utilization. Currently, the biosynthesis and regulation mechanism of glycerolipids and vitamin E are still unknown in E. mollis. Here, we present the chromosome-level reference genome of E. mollis (scaffold N50: ~40.66Mbp, genome size: ~591.48Mbp) by integrating short-read, long-read, and Hi-C sequencing platforms. A total of 36,796 protein-coding sequences, mainly located on 14 proto-chromosomes, were predicted. Additionally, two whole genome duplication (WGD) events were suggested to have occurred ~54.07 and ~35.06 million years ago (MYA), with Elaeagnaceae plants probably experiencing both WGD events. Furthermore, the long terminal retrotransposons in E. mollis were active ~0.23MYA, and one of them was inferred to insert into coding sequence of the negative regulatory lipid synthesis gene, EMF2. Through transcriptomic and metabonomic analysis, key genes contributing to the high lipid and vitamin E levels of E. mollis seeds were identified, while miRNA regulation was also considered. This comprehensive work on the E. mollis genome not only provides a solid theoretical foundation and experimental basis for the efficient utilization of seed lipids and vitamin E, but also contributes to the exploration of new genetic resources.
Read full abstract