Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application).

Peng Zhou,Kevin At Silverstein,Sumitha Nallu,Jonathan D Walton,Nevin D Young,Joseph Guhlin,Liangliang Gao

doi:10.1186/1471-2105-14-335

Abstract

BackgroundSmall peptides encoded as one- or two-exon genes in plants have recently been shown to affect multiple aspects of plant development, reproduction and defense responses. However, popular similarity search tools and gene prediction techniques generally fail to identify most members belonging to this class of genes. This is largely due to the high sequence divergence among family members and the limited availability of experimentally verified small peptides to use as training sets for homology search and ab initio prediction. Consequently, there is an urgent need for both experimental and computational studies in order to further advance the accurate prediction of small peptides.ResultsWe present here a homology-based gene prediction program to accurately predict small peptides at the genome level. Given a high-quality profile alignment, SPADA identifies and annotates nearly all family members in tested genomes with better performance than all general-purpose gene prediction programs surveyed. We find numerous mis-annotations in the current Arabidopsis thaliana and Medicago truncatula genome databases using SPADA, most of which have RNA-Seq expression support. We also show that SPADA works well on other classes of small secreted peptides in plants (e.g., self-incompatibility protein homologues) as well as non-secreted peptides outside the plant kingdom (e.g., the alpha-amanitin toxin gene family in the mushroom, Amanita bisporigera).ConclusionsSPADA is a free software tool that accurately identifies and predicts the gene structure for short peptides with one or two exons. SPADA is able to incorporate information from profile alignments into the model prediction process and makes use of it to score different candidate models. SPADA achieves high sensitivity and specificity in predicting small plant peptides such as the cysteine-rich peptide families. A systematic application of SPADA to other classes of small peptides by research communities will greatly improve the genome annotation of different protein families in public genome databases.

Highlights

Small peptides encoded as one- or two-exon genes in plants have recently been shown to affect multiple aspects of plant development, reproduction and defense responses
Our approach focuses on finding all related paralogous genes within a target gene family and using signals from the corresponding multiple sequence alignment to aid in refining the model predictions
Performance evaluation of SPADA on plant Cysteine Rich Peptide (CRP) families SPADA performance under different search E-value thresholds Using our manually-curated high-quality Cysteine-Rich Peptide (CRP) test set from Arabidopsis and Medicago, we first evaluated the performance of SPADA under different search E-value thresholds

Summary

Introduction

Small peptides encoded as one- or two-exon genes in plants have recently been shown to affect multiple aspects of plant development, reproduction and defense responses. Our approach focuses on finding all related paralogous genes within a target gene family and using signals from the corresponding multiple sequence alignment to aid in refining the model predictions We have implemented this approach in an open-source and freely available application called SPADA (Small Peptide Alignment Discovery Application). SPADA can be used directly with a user’s own protein family alignments or with a comprehensive set of protein family alignments from public sources such as Pfam [8], InterPro [9] or PROSITE [10], enabling the exhaustive discovery of essentially all members of the input families within a given genome sequence Because these public resources continue to expand and include new and novel protein families, SPADA’s ability to comprehensively identify arbitrarily large families of small peptides in genomes will steadily grow

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Nov 20, 2013
Citations: 137	License type: cc-by

R Discovery Prime

R Discovery Prime

Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application).

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Research Progress of Small Plant Peptides on the Regulation of Plant Growth, Development, and Abiotic Stress.
Guocheng Ren ... Zengting Chen
International Journal of Molecular Sciences | VOL. 25
Guocheng Ren, et. al.Guocheng Ren ... Zengting Chen
08 Apr 2024
International Journal of Molecular Sciences | VOL. 25

Shining in the dark: the big world of small peptides in plants.
Yan-Zhao Feng ... Yang Yu
aBIOTECH | VOL. 4
Yan-Zhao Feng, et. al.Yan-Zhao Feng ... Yang Yu
08 Apr 2023
aBIOTECH | VOL. 4

A benchmark study of ab initio gene prediction methods in diverse eukaryotic organisms
Nicolas Scalzitti ... Pierre Collet
BMC Genomics | VOL. 21
Nicolas Scalzitti, et. al.Nicolas Scalzitti ... Pierre Collet
09 Apr 2020
BMC Genomics | VOL. 21

Annotating Viral Genomes - A Cannon is Needed to Kill Mosquitoes
Shiliang Wang
Current Bioinformatics | VOL. 9
Shiliang WangShiliang Wang
31 Mar 2014
Current Bioinformatics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application).

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics