Abstract

BackgroundDNA methylation is an important epigenetic mark relevant to normal development and disease genesis. A common approach to characterizing genome-wide DNA methylation is using Next Generation Sequencing technology to sequence bisulfite treated DNA. The short sequence reads are mapped to the reference genome to determine the methylation statuses of Cs. However, despite intense effort, a much smaller proportion of the reads derived from bisulfite treated DNA (usually about 40-80%) can be mapped than regular short reads mapping (> 90%), and it is unclear what factors lead to this low mapping efficiency.ResultsTo address this issue, we used the hairpin bisulfite sequencing technology to determine sequences of both DNA double strands simultaneously. This enabled the recovery of the original non-bisulfite-converted sequences. We used Bismark for bisulfite read mapping and Bowtie2 for recovered read mapping. We found that recovering the reads improved unique mapping efficiency by 9-10% compared to the bisulfite reads. Such improvement in mapping efficiency is related to sequence entropy.ConclusionsThe hairpin recovery technique improves mapping efficiency, and sequence entropy relates to mapping efficiency.

Highlights

  • DNA methylation is an important epigenetic mark relevant to normal development and disease genesis

  • The DNA is amplified with polymerase chain reaction (PCR) where the uracil is converted to thymine

  • Differences between a reference genome and the bisulfite treated DNA can be used to search for methylated cytosines

Read more

Summary

Introduction

DNA methylation is an important epigenetic mark relevant to normal development and disease genesis. A common approach to characterizing genome-wide DNA methylation is using Generation Sequencing technology to sequence bisulfite treated DNA. The short sequence reads are mapped to the reference genome to determine the methylation statuses of Cs. despite intense effort, a much smaller proportion of the reads derived from bisulfite treated DNA (usually about 40-80%) can be mapped than regular short reads mapping (> 90%), and it is unclear what factors lead to this low mapping efficiency. DNA methylation is an epigenetic phenomenon that adds a methyl group to the nucleic acid cytosine in DNA. Differences between a reference genome and the bisulfite treated DNA can be used to search for methylated cytosines

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call