Abstract

Backgroundlarge scale and reliable proteins' functional annotation is a major challenge in modern biology. Phylogenetic analyses have been shown to be important for such tasks. However, up to now, phylogenetic annotation did not take into account expression data (i.e. ESTs, Microarrays, SAGE, ...). Therefore, integrating such data, like ESTs in phylogenetic annotation could be a major advance in post genomic analyses. We developed an approach enabling the combination of expression data and phylogenetic analysis. To illustrate our method, we used an example protein family, the peptidyl arginine deiminases (PADs), probably implied in Rheumatoid Arthritis.Resultsthe analysis was performed as follows: we built a phylogeny of PAD proteins from the NCBI's NR protein database. We completed the phylogenetic reconstruction of PADs using an enlarged sequence database containing translations of ESTs contigs. We then extracted all corresponding expression data contained in EST database This analysis allowed us 1/To extend the spectrum of homologs-containing species and to improve the reconstruction of genes' evolutionary history. 2/To deduce an accurate gene expression pattern for each member of this protein family. 3/To show a correlation between paralogous sequences' evolution rate and pattern of tissular expression.Conclusioncoupling phylogenetic reconstruction and expression data is a promising way of analysis that could be applied to all multigenic families to investigate the relationship between molecular and transcriptional evolution and to improve functional annotation.

Highlights

  • The "in silico" functional annotation of proteins generated by large scale sequencing projects is an important challenge in biology

  • This tree is the fusion on the NJ topology, of three phylogenetic trees built based on Neighbour Joining (NJ) [36], Maximum Parsimony (MP) [39], and Maximum Likelihood (ML) [40] methods

  • When performing analysis with reciprocal best BLAST hit on NR protein database; EST from Amphioxus and Teleostean is classified as an ortholog of peptidyl arginine deiminases (PADs)-2 only, but not as a coortholog of all PADs as is shown by phylogenetic analysis

Read more

Summary

Methodology article

A rigorous method for multigenic families' functional annotation: the peptidyl arginine deiminase (PADs) proteins family example. N Balandraud†1,2, P Gouret†1, EGJ Danchin, M Blanc, D Zinn, J Roudier and P Pontarotti*1. Address: 1EA 3781, Evolution Biologique, Université de Provence, 3 pl. V. Hugo, 13331 Marseille Cedex 03, France, 2INSERM UMR 639, Laboratoire d'Immunogénétique de la polyarthirte rhumatoïde, faculté de médecine la Timone, 13005 Marseille, France and 3AFMB-UMR 6098CNRS (U1 – U2) Glycogenomics and Biomedical Structural Biology, Case 932, 163 Avenue de Luminy, 13288 Marseille cedex 09, France. Published: 04 November 2005 BMC Genomics 2005, 6:153 doi:10.1186/1471-2164-6-153

Background
Results
Discussion
Conclusion
B16 F10Y cells Oocytes Trophoblast stem cell
Eisen JA
Sankoff D
34. Eddy SR
Findings
43. Kishino HHM
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.