Pan-evolutionary and regulatory genome architecture delineated by an integrated macro- and microsynteny approach.

Lingling Zhang,Hongwei Yu,Fuyun Liu,Lisui Bao,Yuanting Ma,Wentao Han,Zhenmin Bao,Yuli Li,Shi Wang,Qifan Zeng,Zhongqi Pu

doi:10.1038/s41596-024-00966-4

Abstract

The forthcoming massive genome data generated by the Earth BioGenome Project will open up a new era of comparative genomics, for which genome synteny analysis provides an important framework. Profiling genome synteny represents an essential step in elucidating genome architecture, regulatory blocks/elements and their evolutionary history. Here we describe PanSyn, ( https://github.com/yhw320/PanSyn ), the most comprehensive and up-to-date genome synteny pipeline, providing step-by-step instructions and application examples to demonstrate its usage. PanSyn inherits both basic and advanced functions from existing popular tools, offering a user-friendly, highly customized approach for genome macrosynteny analysis and integrated pan-evolutionary and regulatory analysis of genome architecture, which are not yet available in public synteny software or tools. The advantages of PanSyn include: (i) advanced microsynteny analysis by functional profiling of microsynteny genes and associated regulatory elements; (ii) comprehensive macrosynteny analysis, including the inference of karyotype evolution from ancestors to extant species; and (iii) functional integration of microsynteny and macrosynteny for pan-evolutionary profiling of genome architecture and regulatory blocks, as well as integration with external functional genomics datasets from three- or four-dimensional genome and ENCODE projects. PanSyn requires basic knowledge of the Linux environment and Perl programming language and the ability to access a computer cluster, especially for large-scale genomic comparisons. Our protocol can be easily implemented by a competent graduate student or postdoc and takes several days to weeks to execute for dozens to hundreds of genomes. PanSyn provides yet the most comprehensive and powerful tool for integrated evolutionary and functional genomics.

Full Text