Identification Of Regulatory Motifs Research Articles

One of the most difficult challenges in molecular biology as well as computer science is finding patterns in DNA sequences. Identification of regulatory motifs is critical for understanding the gene expression. The essential concept in gene expression is that each a gene encodes the instructions for making a protein. The process of expression begins with the binding of several recognised protein factors. As transcription factors, they bind to enhancer and promoter sequences (Li and Li, 2019). Transcription is the first stage, which involves creating an RNA "copy" of a section of the DNA. This RNA sequence is read and interpreted to create a protein in the second stage of the process, known as translation. Gene expression is the combined result of these two actions. Numerous regulatory transcription factors (TFs), also known as Transcription Factor Binding Sites (TFBS), bind to certain DNA regions to control gene expression. In the past ten years, a significant new method for understanding transcription regulation networks has emerged: the computational identification of TFBS through the study of DNA sequence data (Ruzicka et al., 2017). Finding sequence motifs can be challenging since intergenic regions are extremely long and highly varied, while sequence motifs are small (approximately 6–12 bp). Sequence motifs are frequently repeated and conserved, and they have a fixed size. These patterns are critical for identifying Transcription Factor Binding Sites (TF-BSs), which aids in understanding the mechanisms governing gene expression 3. Motifs can be classified as planted, structured, gapped, sequence, network, and motifs (Hashim et al., 2019). An important issue in computational biology is the finding of weak motifs. It is challenging to solve because there are so many inconsistencies between the actual theme and its altered variants that false signals may mask the real ones. Further, it is challenging to identify and uncover regulatory elements using computer algorithms since they are typically brief and varied. The task of solving the theme finding problem is that of discovering overrepresented motifs as well as conserved motifs from the set of DNA sequences that are good candidates for becoming sites where transcription factors bind. Transcription factor is a protein that functions as a gene expression regulator, specifically regulating the start of the transcription process that produces mRNA using DNA as a template. The common sequence is called a motif. A "pattern" in a transcription factor's binding sites. Finding motifs will aid in the development of illness therapies and comprehension disease susceptibility (Mohanty and Mohanty, 2020). Many techniques for analysing gene function start with the finding of a DNA motif. Finding Transcription Factor Binding Sites (TFBSs), which aid in understanding the mechanisms for controlling gene expression, is a crucial part of motif discovery. The development of quick and precise motif discovery technologies has utilised a variety of algorithms over the years. These algorithms are typically categorised as probabilistic or consensus techniques, and many of them take a lot of time to run and are prone to get stuck in local optimums. Recently, solutions to these issues have been offered using both nature-inspired algorithms and a variety of combinatorial algorithms (Hashim et al., 2019).

Read full abstract

Multiple transcription factors (TFs) coordinately control transcriptional regulation of genes in eukaryotes. Although numerous computational methods focus on the identification of individual TF-binding sites (TFBSs), very few consider the interdependence among these sites. In this article, we studied the relationship between TFBSs and microarray gene expression levels using both family-wise and memberspecific motifs, under various combination of regression models with Bayesian variable selection, as well as motif scoring and sharing conditions, in order to account for the coordination complexity of transcription regulation. We proposed a three-step approach to model the relationship. In the first step, we preprocessed microarray data and used p-values and expression ratios to preselect upregulated and downregulated genes. The second step aimed to identify and score individual TFBSs within DNA sequence of each gene. A method based on the degree of similarity and the number of TFBSs was employed to calculate the score of each TFBS in each gene sequence. In the last step, linear regression and probit regression were used to build a predictive model of gene expression outcomes using these TFBSs as predictors. Given a certain number of predictors to be used, a full search of all possible predictor sets is usually combinatorially prohibitive. Therefore, this article considered the Bayesian variable selection for prediction using either of the regression models. The Bayesian variable selection has been applied in the context of gene selection, missing value estimation, and regulatory motif identification. In our modeling, the regressor was approximated as a linear combination of the TFBSs and a Gibbs sampler was employed to find the strongest TFBSs. We applied these regression models with the Bayesian variable selection on spinal cord injury gene expression data set. These TFs demonstrated intricate regulatory roles either as a family or as individual members in neuroinflammatory events. Our analysis can be applied to create plausible hypotheses for combinatorial regulation by TFBSs and avoiding false-positive candidates in the modeling process at the same time. Such a systematic approach provides the possibility to dissect transcription regulation, from a more comprehensive perspective, through which phenotypical events at cellular and tissue levels are moved forward by molecular events at gene transcription and translation levels.

Read full abstract

Identification Of Regulatory Motifs Research Articles

Related Topics

Articles published on Identification Of Regulatory Motifs

Comparison of different motif discovery algorithms

Signatures of protein expression revealed by secretome analyses of cancer associated fibroblasts and melanoma cell lines

Identification of regulatory motifs in the CHO genome for stable monoclonal antibody production.

In silico identification of regulatory motifs in co-expressed genes under osmotic stress representing their co-regulation

RMOD: A Tool for Regulatory Motif Detection in Signaling Network

An enhanced computational platform for investigating the roles of regulatory RNA and for identifying functional RNA motifs

An alternative approach to multiple genome comparison

Identifying functional relationships within sets of co-expressed genes by combining upstream regulatory motif analysis and gene expression information

Functions of Bifans in Context of Multiple Regulatory Motifs in Signaling Networks

Gene ordering in partitive clustering using microarray expressions

PEAKS: identification of regulatory motifs by their position in DNA sequences

Bayesian Variable Selection for Gene Expression Modeling With Regulatory Motif Binding Sites in Neuroinflammatory Events

A novel approach to identifying regulatory motifs in distantly related genomes

A suite of web-based programs to search for transcriptional regulatory motifs.

Methods in comparative genomics: genome correspondence, gene identification and regulatory motif discovery.

Genome wide identification of regulatory motifs in Bacillus subtilis

Oligo-capped cDNAs for promoter identification and annotation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Identification Of Regulatory Motifs Research Articles

Related Topics

Articles published on Identification Of Regulatory Motifs

Comparison of different motif discovery algorithms

Signatures of protein expression revealed by secretome analyses of cancer associated fibroblasts and melanoma cell lines

Identification of regulatory motifs in the CHO genome for stable monoclonal antibody production.

In silico identification of regulatory motifs in co-expressed genes under osmotic stress representing their co-regulation

RMOD: A Tool for Regulatory Motif Detection in Signaling Network

An enhanced computational platform for investigating the roles of regulatory RNA and for identifying functional RNA motifs

An alternative approach to multiple genome comparison

Identifying functional relationships within sets of co-expressed genes by combining upstream regulatory motif analysis and gene expression information

Functions of Bifans in Context of Multiple Regulatory Motifs in Signaling Networks

Gene ordering in partitive clustering using microarray expressions

PEAKS: identification of regulatory motifs by their position in DNA sequences

Bayesian Variable Selection for Gene Expression Modeling With Regulatory Motif Binding Sites in Neuroinflammatory Events

A novel approach to identifying regulatory motifs in distantly related genomes

A suite of web-based programs to search for transcriptional regulatory motifs.

Methods in comparative genomics: genome correspondence, gene identification and regulatory motif discovery.

Genome wide identification of regulatory motifs in Bacillus subtilis

Oligo-capped cDNAs for promoter identification and annotation