DNA methylation plays a critical role in gene regulation by modulating the DNA binding of transcription factors (TFs). This study integrates TFs’ ChIP-seq profiles with WGBS profiles to investigate how DNA methylation affects protein interactions. Statistical methods and a 5-letter DNA motif calling model have been developed to characterize DNA sequences bound by proteins, while considering the effects of DNA modifications. By employing these methods, 79 significant universal “stripe” TFs and cofactors (USFs), 2360 co-binding protein pairs, and distinct protein modules associated with various DNA methylation states have been identified. The USFs hint a regulatory hierarchy within these protein interactions. Proteins preferentially bind to non-CpG sites in methylated regions, indicating binding affinity is not solely CpG-dependent. Proteins involved in methylation-specific USFs and cobinding pairs play essential roles in promoting and sustaining DNA methylation through interacting with DNMTs or inhibiting TET binding. These findings underscore the interplay between protein binding and methylation, offering insights into epigenetic regulation in cellular biology.
Read full abstract