DNA methylation is a covalent bond modification that is observed mainly at cytosine bases in the context of CG pairs. DNA methylation patterns reflect the status of individual tissues, such as cell composition, age, and the local environment, in mammals. Genetic factors also impact DNA methylation, and the genetic diversity among various dog breeds provides a valuable platform for exploring this topic. Compared to those in the human genome, studies on the profiling of methylation in the dog genome have been less comprehensive. Our study provides extensive profiling of DNA methylation in the whole blood of three dog breeds using whole-genome bisulfite sequencing. The difference in DNA methylation between breeds was moderate after removing CpGs overlapping with potential genetic variation. However, variance in methylation between individuals was common and often occurred in promoters and CpG islands (CGIs). Moreover, we adopted contextual awareness methodology to characterize DNA primary sequences using natural language processing (NLP). This method could be used to effectively separate unmethylated CGIs from highly methylated CGIs in the sequences that are identified by the conventional criteria. This study presents a comprehensive DNA methylation landscape in the dog blood. Our observations reveal the similar methylation patterns across dog breeds, while CGI regions showed high variations in DNA methylation level between individuals. Our study also highlights the potential of NLP approach for analyzing low-complexity DNA sequences, such as CGIs.
Read full abstract