Abstract

BackgroundEpigenetics is the study of gene expression changes that are not caused by changes in the deoxyribonucleic acid (DNA) sequence. DNA methylation is an epigenetic mark occurring in C–phosphate–G sites (CpGs) that leads to local or regional gene expression changes. Reduced-representation bisulfite sequencing (RRBS) is a technique that is used to ascertain the DNA methylation of millions of CpGs at single-nucleotide resolution. The genomic coverage of RRBS is given by the restriction enzyme combination used during the library preparation and the throughput capacity of the next-generation sequencer, which is used to read the generated libraries. The four-nucleotide cutters, MspI and TaqαI, are restriction enzymes commonly used in RRBS that, when combined, achieve ~12% genomic coverage. The increase in throughput of next-generation sequencers allows for novel combinations of restriction enzymes that provide higher CpG coverage.ResultsWe performed a near-neighbor analysis of the four nucleotide sequences most frequently found within 50 nt of all genomic CpGs. This resulted in the identification of seven methylation-insensitive restriction enzymes (AluI, BfaI, HaeIII, HpyCH4V, MluCI, MseI, and MspI) that shared similar restriction conditions suitable for RRBS library preparation. We report that the use of two or three enzyme combinations increases the theoretical epigenome coverage to almost half of the human genome.ConclusionsWe provide the enzyme combinations that are more likely to increase the CpG coverage in human, rat, and mouse genomes.Electronic supplementary materialThe online version of this article (doi:10.1186/1756-0500-7-534) contains supplementary material, which is available to authorized users.

Highlights

  • Epigenetics is the study of gene expression changes that are not caused by changes in the deoxyribonucleic acid (DNA) sequence

  • The increase in mass sequencing throughput allows for multiple enzyme combinations to expand representation bisulfite sequencing (RRBS) C–phosphate–G sites (CpGs) coverage

  • In bold are enzymes that may be used for RRBS, and which share similar restriction conditions

Read more

Summary

Introduction

Epigenetics is the study of gene expression changes that are not caused by changes in the deoxyribonucleic acid (DNA) sequence. DNA methylation is an epigenetic mark occurring in C–phosphate–G sites (CpGs) that leads to local or regional gene expression changes. Reduced-representation bisulfite sequencing (RRBS) is a technique that is used to ascertain the DNA methylation of millions of CpGs at single-nucleotide resolution. The genomic coverage of RRBS is given by the restriction enzyme combination used during the library preparation and the throughput capacity of the next-generation sequencer, which is used to read the generated libraries. The increase in throughput of next-generation sequencers allows for novel combinations of restriction enzymes that provide higher CpG coverage. RRBS was originally described as using a DNA methylation-insensitive restriction enzyme with a consensus sequence that is often found in C–phosphate–G (CpG)-rich regions to digest genomic DNA. Please refer to Gu et al for a detail description of the RRBS library preparation process [4]

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call