Abstract

CpG Island (CGI) is considered to be one of the important segments of deoxyribonucleic acid (DNA) sequences. Out of the various epigenetic events which are associated with CGIs, some such events are like: CGIs are useful in the prediction of promoter region and subsequently for gene prediction, CGIs’ contribution in finding the epigenetic reasons of cancer is of great importance, CGIs can be used to identify chromosome inactivation. Therefore, the exact and maximum number of CGIs hidden in DNA sequences need to explored. A lot of computational, transform-based approaches have been developed and reported in literature for the identification of CGIs in DNA sequences since last many years. The problem associated with transform-based approaches is that the domain of functioning of algorithm requires to be changed which can probably lead to biasing and result in loss of important information in terms of CGIs. Hence, to provide a solution to this issue, a modified P-spectrum-based approach has been proposed here which does not suffer from domain transformation issue. Also, the performance of proposed algorithm has been tested on a large data set of 100 DNA sequences of human species and the performance has been compared with other recently reported methods of CGIs identification in DNA sequences. The results obtained prove that the proposed algorithm is better than the existing methods in terms of identification of more number of CGIs in DNA sequences. Therefore, the proposed algorithm has been considered as an efficient approach to enhance the sensitivity of CGIs identification.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call