Iterative Search Strategy Research Articles

The amount of genomic sequence information continues to grow at an exponential rate, while the identification and characterization of genes without known homologs remains a major challenge. For non-model organisms with limited resources for manipulative studies, high-throughput transcriptomic data combined with bioinformatics methods provide a powerful approach to obtain initial insights into the function of unknown genes. In this study, we report the identification and characterization of a novel family of putatively secreted, small, cysteine-rich proteins herein named Small Cysteine-Rich Proteins (SCRiPs). Their discovery in expressed sequence tag (EST) libraries from the coral Montastraea faveolata required the performance of an iterative search strategy based on BLAST and Hidden-Markov-Model algorithms. While a discernible homolog could neither be identified in the genome of the sea anemone Nematostella vectensis, nor in a large EST dataset from the symbiotic sea anemone Aiptasia pallida, we identified SCRiP sequences in multiple scleractinian coral species. Therefore, we postulate that this gene family is an example of lineage-specific gene expansion in reef-building corals. Previously published gene expression microarray data suggest that a sub-group of SCRiPs is highly responsive to thermal stress. Furthermore, data from microarray experiments investigating developmental gene expression in the coral Acropora millepora suggest that different SCRiPs may play distinct roles in the development of corals. The function of these proteins remains to be elucidated, but our results from in silico, transcriptomic, and phylogenetic analyses provide initial insights into the evolution of SCRiPs, a novel, taxonomically restricted gene family that may be responsible for a lineage-specific trait in scleractinian corals.

Read full abstract

Because proteins that have diverged beyond significant sequence similarity still retain the three-dimensional (3D) fold of their ancestor (Chothia and Lesk, 1986; Rost, 1997), the recognition of structural similarity between proteins provides powerful clues to ancestry. In fact, a large number of distant homology relationships were identified only after the structures of the proteins had been solved (Murzin, 1998). However, structures are being determined only for a small fraction of the proteins. There is a pressing need for improvement in the performance of sequence-based methods for the detection of proteins with the same fold but scant sequence similarity. Here, we examine how to achieve this goal by combining three kinds of information from a protein sequence. First, it has long been recognized that the use of multiplyaligned sequences from a protein family improves the sensitivity of homology detection. This idea is used by many recent computational procedures that exploit evolutionary information to uncover subtle sequence similarity. Examples of such procedures include sequence profiles (Gribskov et al., 1987), consensus templates or motifs (Taylor, 1986; Bairoch, 1991; Tatusov et al., 1994; Yi and Lander, 1994), positionspecific scoring matrices (PSSMs) (Henikoff and Henikoff, 1997), profile hidden Markov models (Eddy, 1998), and intermediate sequence methods (Holm and Sander, 1997; Neuwald et al., 1997; Park et al., 1997). PSI-BLAST (Altschul et al., 1997), one of the most widely used of these procedures, employs an iterative profile search strategy that combines the advantages of both PSSM and intermediate sequence methods. This program has been used effectively by several groups to assign 3D folds to predicted genome products (Teichmann et al., 1999). Second, proteins having the same fold also by definition have very similar secondary structures. In the light of the improved accuracy of secondary structure prediction (Rost and Sander, 1993), several groups have attempted to use sequencederived predictions to improve the sensitivity of fold recognition (Fischer and Eisenberg, 1996; Russel et al., 1996; Di Francesco et al., 1997; Rice and Eisenberg, 1997; Rost et al., 1997). These methods usually represent each protein in a template library by a one-dimensional (1D) string of symbols (profiles) each representing a distinctive 3D structural state, and then use dynamic programming (Needleman and Wunsch, 1970) to align the predicted structural profiles of the query

Read full abstract

Iterative Search Strategy Research Articles

Articles published on Iterative Search Strategy

Iterative search strategy with selective bi-directional prediction for low complexity multiview video coding

Minimum Zone Evaluation of Flatness Error Using an Adaptive Iterative Strategy for Coordinate Measuring Machines Data

Swiftly Computing Center Strings

Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions

A metasynthesis of midwives’ experience of hospital practice in publicly funded settings: compliance, resistance and authenticity

Identification and Gene Expression Analysis of a Taxonomically Restricted Cysteine-Rich Protein Family in Reef-Building Corals

Mirador: A Simple Fast Search Interface for Global Remote Sensing Data Sets

ML Symbol Detection Based on the Shortest Path Algorithm for MIMO Systems

Targeted Analysis of Protein Termini

What is the empirical evidence that hospitals with higher-risk adjusted mortality rates provide poorer quality care? A systematic review of the literature.

Cut-and-solve

Cut-and-solve: An iterative search strategy for combinatorial optimization problems

Dynamic Spectrum Quality Assessment and Iterative Computational Analysis of Shotgun Proteomic Data

Comparison of Normal and Breast Cancer Cell Lines Using Proteome, Genome, and Interactome Data

Reducing the Computation Time of Nonlinear Problems by an Adaptive Linear System Tolerance

Sequence-based detection of distantly related proteins with the same fold.

SCHEDULING OPEN SHOPS TO MINIMIZE TOTAL WEIGHTED TARDINESS

SAVVYSEARCH: A Metasearch Engine That Learns Which Search Engines to Query

Changing Representations During Search: A Comparative Study of Delta Coding

Optimal production of secreted protein in fed‐batch reactors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Iterative Search Strategy Research Articles

Articles published on Iterative Search Strategy

Iterative search strategy with selective bi-directional prediction for low complexity multiview video coding

Minimum Zone Evaluation of Flatness Error Using an Adaptive Iterative Strategy for Coordinate Measuring Machines Data

Swiftly Computing Center Strings

Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions

A metasynthesis of midwives’ experience of hospital practice in publicly funded settings: compliance, resistance and authenticity

Identification and Gene Expression Analysis of a Taxonomically Restricted Cysteine-Rich Protein Family in Reef-Building Corals

Mirador: A Simple Fast Search Interface for Global Remote Sensing Data Sets

ML Symbol Detection Based on the Shortest Path Algorithm for MIMO Systems

Targeted Analysis of Protein Termini

What is the empirical evidence that hospitals with higher-risk adjusted mortality rates provide poorer quality care? A systematic review of the literature.

Cut-and-solve

Cut-and-solve: An iterative search strategy for combinatorial optimization problems

Dynamic Spectrum Quality Assessment and Iterative Computational Analysis of Shotgun Proteomic Data

Comparison of Normal and Breast Cancer Cell Lines Using Proteome, Genome, and Interactome Data

Reducing the Computation Time of Nonlinear Problems by an Adaptive Linear System Tolerance

Sequence-based detection of distantly related proteins with the same fold.

SCHEDULING OPEN SHOPS TO MINIMIZE TOTAL WEIGHTED TARDINESS

SAVVYSEARCH: A Metasearch Engine That Learns Which Search Engines to Query

Changing Representations During Search: A Comparative Study of Delta Coding

Optimal production of secreted protein in fed‐batch reactors