Abstract

BackgroundDaphnia pulex (Water flea) is the first fully sequenced crustacean genome. The crustaceans and insects have diverged from a common ancestor. It is a model organism for studying the molecular makeup for coping with the environmental challenges. In the complete proteome, there are 30,550 putative proteins. However, about 10,000 of them have no known homologues. Currently, the UniProtoKB reports on 95% of the Daphnia's proteins as putative and uncharacterized proteins.ResultsWe have applied ProtoNet, an unsupervised hierarchical protein clustering method that covers about 10 million sequences, for automatic annotation of the Daphnia's proteome. 98.7% (26,625) of the Daphnia full-length proteins were successfully mapped to 13,880 ProtoNet stable clusters, and only 1.3% remained unmapped. We compared the properties of the Daphnia's protein families with those of the mouse and the fruitfly proteomes. Functional annotations were successfully assigned for 86% of the proteins. Most proteins (61%) were mapped to only 2953 clusters that contain Daphnia's duplicated genes. We focused on the functionality of maximally amplified paralogs. Cuticle structure components and a variety of ion channels protein families were associated with a maximal level of gene amplification. We focused on gene amplification as a leading strategy of the Daphnia in coping with environmental toxicity.ConclusionsAutomatic inference is achieved through mapping of sequences to the protein family tree of ProtoNet 6.0. Applying a careful inference protocol resulted in functional assignments for over 86% of the complete proteome. We conclude that the scaffold of ProtoNet can be used as an alignment-free protocol for large-scale annotation task of uncharacterized proteomes.

Highlights

  • IntroductionThe crustaceans and insects have diverged from a common ancestor

  • Daphnia pulex (Water flea) is the first fully sequenced crustacean genome

  • We limited the analysis to 26,968 (88%) proteins that are full length. We mapped these proteins to the ProtoNet tree that was pruned to ensure high confidence clusters

Read more

Summary

Introduction

The crustaceans and insects have diverged from a common ancestor It is a model organism for studying the molecular makeup for coping with the environmental challenges. Daphnia pulex is a key player in the aquatic ecosystems and an important component in the food web. It is a model organism for studying environmental challenges including toxic conditions [1]. They exhibit extraordinary levels of Comparative genomics approaches are useful for the discovery of functional elements from newly sequenced genomes [3]. The value of a comparative genomics research from its related proteomes (i.e., insects) might be somewhat limited

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call