Sensitive Homology Search Research Articles

BackgroundA large number of sensitive homology searches are required for mapping DNA sequence fragments to known protein sequences in public and private databases during metagenomic analysis. BLAST is currently used for this purpose, but its calculation speed is insufficient, especially for analyzing the large quantities of sequence data obtained from a next-generation sequencer. However, faster search tools, such as BLAT, do not have sufficient search sensitivity for metagenomic analysis. Thus, a sensitive and efficient homology search tool is in high demand for this type of analysis.Methodology/Principal FindingsWe developed a new, highly efficient homology search algorithm suitable for graphics processing unit (GPU) calculations that was implemented as a GPU system that we called GHOSTM. The system first searches for candidate alignment positions for a sequence from the database using pre-calculated indexes and then calculates local alignments around the candidate positions before calculating alignment scores. We implemented both of these processes on GPUs. The system achieved calculation speeds that were 130 and 407 times faster than BLAST with 1 GPU and 4 GPUs, respectively. The system also showed higher search sensitivity and had a calculation speed that was 4 and 15 times faster than BLAT with 1 GPU and 4 GPUs.ConclusionsWe developed a GPU-optimized algorithm to perform sensitive sequence homology searches and implemented the system as GHOSTM. Currently, sequencing technology continues to improve, and sequencers are increasingly producing larger and larger quantities of data. This explosion of sequence data makes computational analysis with contemporary tools more difficult. We developed GHOSTM, which is a cost-efficient tool, and offer this tool as a potential solution to this problem.

Read full abstract

BackgroundRNA interference (RNAi), mediated by 21-nucleotide (nt)-length small interfering RNAs (siRNAs), is a powerful tool not only for studying gene function but also for therapeutic applications. RNAi, requiring perfect complementarity between the siRNA guide strand and the target mRNA, was believed to be extremely specific. However, a recent growing body of evidence has suggested that siRNA could down-regulate unintended genes whose transcripts possess complementarity to the 7-nt siRNA seed region. This off-target gene silencing may often provide incongruous results obtained from knockdown experiments, leading to misinterpretation. Thus, an efficient algorithm for designing functional siRNAs with minimal off-target effect based on the mechanistic features is considered of value.ResultsWe present siDirect 2.0, an update of our web-based software siDirect, which provides functional and off-target minimized siRNA design for mammalian RNAi. The previous version of our software designed functional siRNAs by considering the relationship between siRNA sequence and RNAi activity, and provided them along with the enumeration of potential off-target gene candidates by using a fast and sensitive homology search algorithm. In the new version, the siRNA design algorithm is extensively updated to eliminate off-target effects by reflecting our recent finding that the capability of siRNA to induce off-target effect is highly correlated to the thermodynamic stability, or the melting temperature (Tm), of the seed-target duplex, which is formed between the nucleotides positioned at 2-8 from the 5' end of the siRNA guide strand and its target mRNA. Selection of siRNAs with lower seed-target duplex stabilities (benchmark Tm < 21.5°C) followed by the elimination of unrelated transcripts with nearly perfect match should minimize the off-target effects.ConclusionsiDirect 2.0 provides functional, target-specific siRNA design with the updated algorithm which significantly reduces off-target silencing. When the candidate functional siRNAs could form seed-target duplexes with Tm values below 21.5°C, and their 19-nt regions spanning positions 2-20 of both strands have at least two mismatches to any other non-targeted transcripts, siDirect 2.0 can design at least one qualified siRNA for >94% of human mRNA sequences in RefSeq. siDirect 2.0 is available at http://siDirect2.RNAi.jp/.

Read full abstract

Sensitive Homology Search Research Articles

Related Topics

Articles published on Sensitive Homology Search

VOGDB-Database of Virus Orthologous Groups.

Lambda3: homology search for protein, nucleotide, and bisulfite-converted sequences.

Tracing the evolution of the plant meiotic molecular machinery

A family of unusual immunoglobulin superfamily genes in an invertebrate histocompatibility complex

Computational Structural Genomics Unravels Common Folds and Novel Families in the Secretome of Fungal Phytopathogen Magnaporthe oryzae.

COMER2: GPU-accelerated sensitive and specific homology searches

Improve homology search sensitivity of PacBio data by correcting frameshifts.

Rapid similarity search of proteins using alignments of domain arrangements

The product of C9orf72, a gene strongly implicated in neurodegeneration, is structurally related to DENN Rab-GEFs.

GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics

SiDirect 2.0: updated software for designing functional siRNA with reduced seed-dependent off-target effect

PatternHunter: faster and more sensitive homology search

A human protein containing multiple types of protease-inhibitory modules.

Highly Sensitive Homology Search Methods on Parallel Computer

Oligopeptide biases in protein sequences and their use in predicting protein coding regions in nucleotide sequences.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sensitive Homology Search Research Articles

Related Topics

Articles published on Sensitive Homology Search

VOGDB-Database of Virus Orthologous Groups.

Lambda3: homology search for protein, nucleotide, and bisulfite-converted sequences.

Tracing the evolution of the plant meiotic molecular machinery

A family of unusual immunoglobulin superfamily genes in an invertebrate histocompatibility complex

Computational Structural Genomics Unravels Common Folds and Novel Families in the Secretome of Fungal Phytopathogen Magnaporthe oryzae.

COMER2: GPU-accelerated sensitive and specific homology searches

Improve homology search sensitivity of PacBio data by correcting frameshifts.

Rapid similarity search of proteins using alignments of domain arrangements

The product of C9orf72, a gene strongly implicated in neurodegeneration, is structurally related to DENN Rab-GEFs.

GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics

SiDirect 2.0: updated software for designing functional siRNA with reduced seed-dependent off-target effect

PatternHunter: faster and more sensitive homology search

A human protein containing multiple types of protease-inhibitory modules.

Highly Sensitive Homology Search Methods on Parallel Computer

Oligopeptide biases in protein sequences and their use in predicting protein coding regions in nucleotide sequences.