Abstract

C+G content (GC content or G+C content) is known to be correlated with genome/chromosome size in bacteria but the relationship for other kingdoms remains unclear. This study analyzed genome size, chromosome size, and base composition in most of the available sequenced genomes in various kingdoms. Genome size tends to increase during evolution in plants and animals, and the same is likely true for bacteria. The genomic C+G contents were found to vary greatly in microorganisms but were quite similar within each animal or plant subkingdom. In animals and plants, the C+G contents are ranked as follows: monocot plants>mammals>non-mammalian animals>dicot plants. The variation in C+G content between chromosomes within species is greater in animals than in plants. The correlation between average chromosome C+G content and chromosome length was found to be positive in Proteobacteria, Actinobacteria (but not in other analyzed bacterial phyla), Ascomycota fungi, and likely also in some plants; negative in some animals, insignificant in two protist phyla, and likely very weak in Archaea. Clearly, correlations between C+G content and chromosome size can be positive, negative, or not significant depending on the kingdoms/groups or species. Different phyla or species exhibit different patterns of correlation between chromosome-size and C+G content. Most chromosomes within a species have a similar pattern of variation in C+G content but outliers are common. The data presented in this study suggest that the C+G content is under genetic control by both trans- and cis- factors and that the correlation between C+G content and chromosome length can be positive, negative, or not significant in different phyla.

Highlights

  • Base composition is a fundamental property of genomes and has a strong influence on gene function and regulation

  • Flow cytometry measurement is correlated with sequencing data in rice, but the C+G content measured by flow cytometry after staining with DAPI (49,6-diamidino-2-phenylindole) is consistently higher than the content in the sequencing data [6]

  • Microorganisms and higher organisms showed very different patterns of C+G content when the genomes were ranked by size within each large phylogenetic group (Table 1; Figure 1)

Read more

Summary

Introduction

Base composition is a fundamental property of genomes and has a strong influence on gene function and regulation. It is known from Chargaff’s rule [1] that in cellular DNA, the amount of adenine (A) is approximately equal to that of thymine (T) and the amount of cytosine (C) is approximately equal to that of guanine (G). C+G content is one of the major factors affecting melting–annealing profiles, the melting–annealing speed of DNA is not 100% correlated with GC content, because the speed is influenced by the degree of repetition of sequences such as satellite sequences [3,4,5]. A general comparison of genomic base composition between major kingdoms (domains of life) is still lacking

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call