Informational laws of genome structures.

Vincenzo Bonnici,Vincenzo Manca

doi:10.1038/srep28840

Abstract

In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined.

Highlights

The analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures
Many studies have approached the investigation of DNA strings and genomes by means of algorithms, information theory and formal languages[11,12,13,14,15,16,17,18,19,20,21,22], and methods were developed for investigating whole genome structures
We prove that preferential lengths exist for computing entropies, and in correspondence with these lengths, some informational indexes can be defined that exhibit “informational laws” and characterize an informational structure of genomes

Summary

Introduction

The analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In genome analyses based on dictionaries, concepts from formal language theory, probability, and information theory are naturally combined by providing new perspectives in the investigation of genomes, which may disclose the internal logics of their structures. A point that is crucial in genome analyses based on k-mers is the value of k that is more adequate for specific investigations This issue becomes extremely evident when computing the entropy of a genome. We follow an information theoretic line of investigation based on k-mer dictionaries and entropies[16,26,27,31,32,33], which is aimed at defining and computing informational indexes for a representative set of www.nature.com/scientificreports/

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Jun 29, 2016
Citations: 30	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Informational laws of genome structures.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Spatial evolutionary game theory: Hawks and Doves revisited
...
Proceedings of the Royal Society of London. Series B: Biological Sciences | VOL. 263
, et. al. ...
22 Sep 1996
Proceedings of the Royal Society of London. Series B: Biological Sciences | VOL. 263

The Need for Speed and Energy Efficiency in Genome Analysis
Sachin Rawat
GEN Biotechnology | VOL. 2
Sachin RawatSachin Rawat
01 Jun 2023
GEN Biotechnology | VOL. 2

Task Force on Strategic Research Direction: Basic Science Subgroup key science topics report.
Joseph Loscalzo ... Geoffrey S Ginsburg
Circulation | VOL. 106
Joseph Loscalzo, et. al.Joseph Loscalzo ... Geoffrey S Ginsburg
12 Nov 2002
Circulation | VOL. 106

New mathematical measures for apprehending complexity of chiral molecules using information entropy.
Patrick Piras
Chirality | VOL. 34
Patrick PirasPatrick Piras
11 Feb 2022
Chirality | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Informational laws of genome structures.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports