Nicotiana benthamiana is a model organism widely adopted in plant biology. Its complete assembly remains unavailable despite several recent improvements. To further improve its usefulness, we generate and phase the complete 2.85 Gb genome assembly of allotetraploid N. benthamiana. We find that although Solanaceae centromeres are widely dominated by Ty3/Gypsy retrotransposons, satellite-based centromeres are surprisingly common in N. benthamiana, with 11 of 19 centromeres featured by megabase-scale satellite arrays. Interestingly, the satellite-enriched and satellite-free centromeres are extensively invaded by distinct Gypsy retrotransposons which CENH3 protein more preferentially occupies, suggestive of their crucial roles in centromere function. We demonstrate that ribosomal DNA is a major origin of centromeric satellites, and mitochondrial DNA could be employed as a core component of the centromere. Subgenome analysis indicates that the emergence of satellite arrays probably drives new centromere formation. Altogether, we propose that N. benthamiana centromeres evolved via neocentromere formation, satellite expansion, retrotransposon enrichment and mtDNA integration.
Read full abstract