Abstract

BackgroundDNA rearrangement events have been widely studied in comparative genomic for many years. The importance of these events resides not only in the study about relatedness among different species, but also to determine the mechanisms behind evolution. Although there are many methods to identify genome-rearrangements (GR), the refinement of their borders has become a huge challenge. Until now no accepted method exists to achieve accurate fine-tuning: i.e. the notion of breakpoint (BP) is still an open issue, and despite repeated regions are vital to understand evolution they are not taken into account in most of the GR detection and refinement methods.Methods and resultsWe propose a method to refine the borders of GR including repeated regions. Instead of removing these repetitions to facilitate computation, we take advantage of them using a consensus alignment sequence of the repeated region in between two blocks. Using the concept of identity vectors for Synteny Blocks (SB) and repetitions, a Finite State Machine is designed to detect transition points in the difference between such vectors. The method does not force the BP to be a region or a point but depends on the alignment transitions within the SBs and repetitions.ConclusionThe accurate definition of the borders of SB and repeated genomic regions and consequently the detection of BP might help to understand the evolutionary model of species. In this manuscript we present a new proposal for such a refinement. Features of the SBs borders and BPs are different and fit with what is expected. SBs with more diversity in annotations and BPs short and richer in DNA replication and stress response, which are strongly linked with rearrangements.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-3069-4) contains supplementary material, which is available to authorized users.

Highlights

  • DNA rearrangement events have been widely studied in comparative genomic for many years

  • Calculating consensus identity vector In the case that a group of repetitions are detected, we use the information of the consensus sequence to improve accuracy of the refinement method

  • We have developed a method to refine the borders of Computational Synteny Blocks (CSB) taking into account repetitions and using them to improve the accuracy of the refinement

Read more

Summary

Introduction

DNA rearrangement events have been widely studied in comparative genomic for many years. The importance of these events resides in the study about relatedness among different species, and to determine the mechanisms behind evolution. The presence of repeated regions or small blocks between the SBs increases the complexity of the detection, one of the main reasons why most methods do not take into account such repetitions. These repetitions -mostly associated with mobile elements- have been driving the evolution in many ways [9]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call