Abstract

The intricacy and fractal properties of human DNA sequences are examined in this work. The core of this study is to discern whether complete DNA sequences present distinct complexity and fractal attributes compared with sequences containing exclusively exon regions. In this regard, the entire base pair sequences of DNA are extracted from the NCBI (National Center for Biotechnology Information) database. In order to create a time series representation for the base pair sequence {G,C,T,A}, we use the Chaos Game Representation (CGR) approach and a mapping rule f, which enables us to apply the metric known as the Complexity–Entropy Plane (CEP) and multifractal detrended fluctuation analysis (MF-DFA). To carry out our investigation, we divided human DNA into two groups: the first is composed of the 24 chromosomes, which comprises all the base pairs that form the DNA sequence, and another group that also includes the 24 chromosomes, but the DNA sequences rely only on the exons’ presence. The results show that both sets provide fractal patterns in their structure, as obtained by the CGR approach. Complete DNA sequences show a sharper visual fractal pattern than sequences composed only of exons. Moreover, the sequences occupy distinct areas of the complexity–entropy plane, and the complete DNA sequences lead to greater statistical complexity and lower entropy than the exon sequences. Also, we observed that different fractal parameters between chromosomes indicate diversity in genomic sequences. All these results occur in different scales for all chromosomes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.