Cross-species Conservation Research Articles

Motivations. Recent data coming from the comparison of genomes of different individuals in human species and of different genotypes in plants has led interesting findings about the differences among individuals, ecotypes or genotypes. Cross-species conservation analysis revealed that many of the genes potentially encoded by novel sequences are conserved across a number of mammal and might be biologically functional and thus may be related to differences in gene networks between human individuals. This strongly suggests that genetics and transcriptomics must be performed in the context of individual genomes. NGS technologies provide for the first time the opportunity to study the complexity of individual-specific sequences. However a full genome assembly still presents problems due to highly repetitive sequences which cannot be easily solved with current technologies. Methods. The first step in our workflow is de novo assembly based on de bruijn graph assembly plus an error detection and correction step based on comparison with datasets of annotated proteins. This has been implemented in order to overcome limitations of current assembly methods which rely uniquely on sequence data and thus they do not prevent frameshift or overassembly errors. The platform determines if the new genes and transcript isoforms are potentially functional and if mutations disrupting the functionality of the original gene models present in the reference genome are compensated by the new isoform. Those data are integrated and linked to expression profiles, annotation functions and network data. This allows determining if metabolic pathways are affected or modified by the expression of transcripts alternative to those expressed in the reference genotype or by the expression of novel genes. On the algorithmic viewpoint, innovative approaches contributing to efficiently carry out the comparison of reconstructed transcriptomes with reference genome and quantify the transcriptome and proteome diversity will be proposed based on: (i) Machine learning techniques to genome reassembling; (ii) Functional enrichment based on non parametric statistical tests; (iii) Gene similarity based on common miRNA targeting and RNA editing function; (iv) Probabilistic generative models for network analysis. On the computational viewpoint, we propose an innovative infrastructure, based on grid/cloud computing and efficient intra-node accelerators (i.e., GP-GPUs and FPGAs). Since complex analysis pipeline made of several stages are characterized by heterogeneous computational requirements, we developed a middleware infrastructure where specific schedulers and task migration agents will orchestrate task allocation both across nodes and within nodes. The orchestration will be performed by matching application computational kernels characteristics (obtained through off-line profiling) with computational capabilities of nodes. Moreover, since transcriptome reconstruction requires the capability of processing many biological samples for statistical and comparative reasons and current frameworks are not optimized for multi-sample analysis, rather they run various samples sequentially, we designed techniques for efficient sample-level allocation on computational nodes. See Figure 1 for a description of the platform. Results. The solution we propose here improves the existing solutions in the following two directions. First, efficient algorithms are applied for genome reconstruction and identification. Second, these algorithms are implemented in an pipeline analysis framework, where the processing of multiple samples is optimized to better exploit computational resources. The infrastructure makes possible for bioinformaticians, through a web service interface, to build workflows and execute them on a grid/cloud computing platform in a easy to use and programming-friendly environment.

Read full abstract

Abstract 2392There is increasing recognition of the role of small noncoding RNAs in post-transcriptional regulation of gene expression in diverse tissues of eukaryotic organisms including vertebrates. MicroRNAs (miRNAs) are the best studied amongst these small RNAs and are thought to act by binding to the 3' untranslated regions (3' UTRs) of mature mRNAs in a sequence-specific fashion and preventing the initiation of peptide translation and/ or initiating mRNA degradation. Recent evidence suggests that miRNA-based regulation might involve binding to regions other than 3' UTRs including coding regions. Current approaches to defining miRNA-mRNA interactions are mostly restricted to those based on bio-informatic prediction, protein down-regulation following in-vitro transfection of miRNA precursors and luciferase assays to determine binding to 3' UTRs. None of these methods however show direct interaction between a specific miRNA and its purported target RNA. Bio-informatics-based approaches are also prone to false positive and negative results given the short length of sequence matching, and reliance on heuristics and cross-species conservation. Newer genome-wide approaches like HITS-CLIP (High Throughput Sequencing following Cross Linked Immuno Precipitation, or CLIP-Seq) overcome some of these limitations by directly isolating the miRNA-mRNA interactome bound to argonaute (AGO), a critical component of the rna-induced silencing complex (RISC)1. HITS-CLIP utilizes the ability of ultraviolet (UV) light to cross-link RNAs to proteins in their close proximity. The crosslinked miRNA-mRNA-Ago complexes are then isolated and the RNA reverse transcribed to cDNA libraries and sequenced by next generation sequencing (NGS).Given the widespread role of miRNAs in several vertebrate tissues, we hypothesized that miRNA-regulation of gene expression is operant in the hematopoietic microenvironment (ME) and thus contributes to regulation of hematopoiesis. We hence used HITS-CLIP to analyze the miRNA-mRNA interactome of three key cellular components of the ME: stromal cells, endothelium and macrophages. We have previously reported on the use of the stromal cell lines Hs27a and Hs5 to define specific functional niches within the ME. Hs27a can functionally support primitive hematopoietic stem and progenitor cells (HSPC) in cobblestone areas (CSAs) and express high levels of factors known to support HSPC such as SDF1, Jagged1 and Angiopoietin1. In contrast, Hs5 drives HSPC to mature lineages and secretes high levels of cytokines like IL1, IL6 and GCSF. Human umbilical vein endothelial cells (HUVECs) and MCSF-treated CD14+ cells were utilized for the endothelial and macrophage cultures respectively. The HITS-CLIP datasets from each of these populations were enriched for a putative binding site for miR-9 in the coding region of Matrix Metalloproteinase 2 (MMP2) mRNA. MMP2 belongs to a family of endopeptidases critical in the remodeling of extracellular matrix in several tissues and in the egress/ homing of HSPC to their functional niches in the ME. Functional binding of miR-9 to MMP2 was validated by Western-blotting of stromal cells transfected with miR-9 which revealed > 50% reduction of protein levels when compared to control-transfected cells. This was also confirmed by gelatin zymography which showed significantly reduced MMP2 activity in stromal cells transfected with miR-9. Finally, to confirm direct binding of miR-9 to the putative binding region on the MMP2 transcript, we cloned this microRNA responsive region (MRE) downstream of the Renilla luciferase gene and assayed its activity by luciferase assays. MiR-9 transfection down-regulated luciferase activity > 50% confirming direct binding to the MRE. Our results show that genome-wide approaches such as HITS-CLIP can be used to define in vivo miRNA-mRNA interactions in the ME and should be considered in studies that define such interactions given the significant false-positive and false negative results associated with approaches based on bio-informatics alone. The approach can also define specific interactions between miRNAs and mRNAs such as MMP2, of relevance to regulation of the hematopoietic ME. Disclosures:No relevant conflicts of interest to declare.

Read full abstract

Cross-species Conservation Research Articles

Related Topics

Articles published on Cross-species Conservation

RhesusBase: a knowledgebase for the monkey research community

S1P1 inhibits sprouting angiogenesis during vascular development

29 mammalian genomes reveal novel exaptations of mobile elements for likely regulatory functions in the human genome.

PsRobot: a web-based plant small RNA meta-analysis toolbox

The SAM Domain of Human TEL2 Can Abrogate Transcriptional Output from TEL1 (ETV-6) and ETS1/ETS2

Localizing transcriptional regulatory elements at the mouse Dlk1 locus.

Integrated cloud environment for characterization of genotype specific transcriptome from next generation sequencing data

Digital gene expression data, cross-species conservation and noncoding RNA

Novel compound heterozygous mutations of TGM1 gene identified in a Chinese collodion baby

Polar assembly and scaffolding proteins of the virulence-associated ESX-1 secretory apparatus in mycobacteria.

Finding Transcription Factor Binding Motifs for Coregulated Genes by Combining Sequence Overrepresentation with Cross-Species Conservation

Genome-wide association between DNA methylation and alternative splicing in an invertebrate

High Throughput Sequencing Following Cross-Linked Immune Precipitation (HITS-CLIP) of Argonaute (AGO) Identifies Mir-9 As a Regulator of MMP2 in the Marrow Microenvironment (ME)

Rigorous and thorough bioinformatic analyses of olfactory receptor promoters confirm enrichment of O/E and homeodomain binding sites but reveal no new common motifs

Role of CpG context and content in evolutionary signatures of brain DNA methylation

Translation Initiator EIF4G1 Mutations in Familial Parkinson Disease

Cross-Species Conservation of Open-Channel Block by Na Channel β4 Peptides Reveals Structural Features Required for Resurgent Na Current

Combinatorial Regulation of Photoreceptor Differentiation Factor, Neural Retina Leucine Zipper Gene Nrl, Revealed by in Vivo Promoter Analysis

Nuclear Outsourcing of RNA Interference Components to Human Mitochondria

SNPs occur in regions with less genomic sequence conservation.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Cross-species Conservation Research Articles

Related Topics

Articles published on Cross-species Conservation

RhesusBase: a knowledgebase for the monkey research community

S1P1 inhibits sprouting angiogenesis during vascular development

29 mammalian genomes reveal novel exaptations of mobile elements for likely regulatory functions in the human genome.

PsRobot: a web-based plant small RNA meta-analysis toolbox

The SAM Domain of Human TEL2 Can Abrogate Transcriptional Output from TEL1 (ETV-6) and ETS1/ETS2

Localizing transcriptional regulatory elements at the mouse Dlk1 locus.

Integrated cloud environment for characterization of genotype specific transcriptome from next generation sequencing data

Digital gene expression data, cross-species conservation and noncoding RNA

Novel compound heterozygous mutations of TGM1 gene identified in a Chinese collodion baby

Polar assembly and scaffolding proteins of the virulence-associated ESX-1 secretory apparatus in mycobacteria.

Finding Transcription Factor Binding Motifs for Coregulated Genes by Combining Sequence Overrepresentation with Cross-Species Conservation

Genome-wide association between DNA methylation and alternative splicing in an invertebrate

High Throughput Sequencing Following Cross-Linked Immune Precipitation (HITS-CLIP) of Argonaute (AGO) Identifies Mir-9 As a Regulator of MMP2 in the Marrow Microenvironment (ME)

Rigorous and thorough bioinformatic analyses of olfactory receptor promoters confirm enrichment of O/E and homeodomain binding sites but reveal no new common motifs

Role of CpG context and content in evolutionary signatures of brain DNA methylation

Translation Initiator EIF4G1 Mutations in Familial Parkinson Disease

Cross-Species Conservation of Open-Channel Block by Na Channel β4 Peptides Reveals Structural Features Required for Resurgent Na Current

Combinatorial Regulation of Photoreceptor Differentiation Factor, Neural Retina Leucine Zipper Gene Nrl, Revealed by in Vivo Promoter Analysis

Nuclear Outsourcing of RNA Interference Components to Human Mitochondria

SNPs occur in regions with less genomic sequence conservation.