Abstract

The three helical Histone Fold Motif (HFM) of core histone proteins provides an evolutionarily favored site for the protein-DNA interface. Despite significant variation in sequence, the HFM retains a distinctive structural fold that has diversified into several non-histone protein families. In this work, we explore the ancestry of non-histone HFM containing families in the plant kingdom. A sequence search algorithm was developed using iterative profile Hidden Markov Models to identify remote homologs of core-histone proteins. The resulting hits were functionally annotated, classified into families, and subjected to comprehensive phylogenetic analyses via Maximum likelihood and Bayesian methods. We have identified 4390 HFM containing proteins in the plant kingdom that are not histones, mostly existing as diverse transcription factor families, distributed widely within and across taxonomic groups. Patterns of homology suggest that core histone subunit H2A has evolved into newer families like NF-YC and DRAP1, whereas the H2B subunit of core histones shares a common ancestry with NF-YB and DR1 class of TFs. Core histone subunits H3 and H4 were found to have evolved into DPE and TAF proteins, respectively. Taken together these results provide insights into diversification events during the evolution of the HFM, including sub-functionalization and neo-functionalization of the HFM.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call