Understanding variation in chromatin contact patterns across diverse humans is critical for interpreting noncoding variants and their effects on gene expression and phenotypes. However, experimental determination of chromatin contact patterns across large samples is prohibitively expensive. To overcome this challenge, we develop and validate a machine learning method to quantify the variation in 3D chromatin contacts at 2 kilobase resolution from genome sequence alone. We apply this approach to thousands of human genomes from the 1000 Genomes Project and the inferred hominin ancestral genome. While patterns of 3D contact divergence genome wide are qualitatively similar to patterns of sequence divergence, we find substantial differences in 3D divergence and sequence divergence in local 1 megabase genomic windows. In particular, we identify 392 windows with significantly greater 3D divergence than expected from sequence. Moreover, for 31% of genomic windows, a single individual has a rare divergent 3D contact map pattern. Using in silico mutagenesis, we find that most single nucleotide sequence changes do not result in changes to 3D chromatin contacts. However, in windows with substantial 3D divergence just one or a few variants can lead to divergent 3D chromatin contacts without the individuals carrying those variants having high sequence divergence. In summary, inferring 3D chromatin contact maps across human populations reveals variable contact patterns. We anticipate that these genetically diverse maps of 3D chromatin contact will provide a reference for future work on the function and evolution of 3D chromatin contact variation across human populations.
Read full abstract