Abstract

To search for evolutionary forces that might act upon transcript length, we use the singular value decomposition (SVD) to identify the length distribution functions of sets and subsets of human and yeast transcripts from profiles of mRNA abundance levels across gel electrophoresis migration distances that were previously measured by DNA microarrays. We show that the SVD identifies the transcript length distribution functions as “asymmetric generalized coherent states” from the DNA microarray data and with no a-priori assumptions. Comparing subsets of human and yeast transcripts of the same gene ontology annotations, we find that in both disparate eukaryotes, transcripts involved in protein synthesis or mitochondrial metabolism are significantly shorter than typical, and in particular, significantly shorter than those involved in glucose metabolism. Comparing the subsets of human transcripts that are overexpressed in glioblastoma multiforme (GBM) or normal brain tissue samples from The Cancer Genome Atlas, we find that GBM maintains normal brain overexpression of significantly short transcripts, enriched in transcripts that are involved in protein synthesis or mitochondrial metabolism, but suppresses normal overexpression of significantly longer transcripts, enriched in transcripts that are involved in glucose metabolism and brain activity. These global relations among transcript length, cellular metabolism and tumor development suggest a previously unrecognized physical mode for tumor and normal cells to differentially regulate metabolism in a transcript length-dependent manner. The identified distribution functions support a previous hypothesis from mathematical modeling of evolutionary forces that act upon transcript length in the manner of the restoring force of the harmonic oscillator.

Highlights

  • Transcription of messenger RNA associates a cell’s genotype with its phenotype in all known organisms

  • We find that in both disparate organisms, transcripts involved in protein synthesis or mitochondrial metabolism are significantly shorter than typical, and in particular, significantly shorter than those involved in glucose metabolism

  • To search for evolutionary forces that might act upon transcript length, we use the singular value decomposition (SVD) to identify the length distribution functions of subsets of human transcripts that are overexpressed in either normal brain or glioblastoma multiforme (GBM) tumor tissue samples from TCGA [16,17]

Read more

Summary

Introduction

Transcription of messenger RNA (mRNA) associates a cell’s genotype with its phenotype in all known organisms. Unlike prokaryotes, multiple possible mRNAs of, e.g., different lengths, can be and usually are produced, i.e., transcribed and processed in the cell’s nucleus and modified in the cytoplasm, which correspond to just a single gene. This diversity of mRNAs has been suggested as a possible origin of the diversity of eukaryotes in general [1] and of neurons in the human brain in particular [2]. Eukaryotic mRNA transcription usually starts with the binding of DNA-dependent RNA polymerase (RNAP) II at one of several possible sites for the 59 end of the synthesized transcript, at the DNA-encoded promoter region upstream of the 59 end of a gene.

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call