The proteome is complex, dynamic, and functionally diverse. Functional proteomics aims to characterize the functions of proteins in biological systems. However, there is a delay in annotating the function of proteins, even in model organisms. This gap is even greater in other organisms, including Trypanosoma cruzi, the causative agent of the parasitic, systemic, and sometimes fatal disease called Chagas disease. About 99.8% of Trypanosoma cruzi proteome is not manually annotated (unreviewed), among which>25% are conserved hypothetical proteins (CHPs), calling attention to the knowledge gap on the protein content of this organism. CHPs are conserved proteins among different species of various evolutionary lineages; however, they lack functional validation. This study describes a bioinformatics pipeline applied to public proteomic data to infer possible biological functions of conserved hypothetical Trypanosoma cruzi proteins. Here, the adopted strategy consisted of collecting differentially expressed proteins between the epimastigote and metacyclic trypomastigotes stages of Trypanosoma cruzi; followed by the functional characterization of these CHPs applying a manifold learning technique for dimension reduction and 3D structure homology analysis (Spalog). We found a panel of 25 and 26 upregulated proteins in the epimastigote and metacyclic trypomastigote stages, respectively; among these, 18 CHPs (8 in the epimastigote stage and 10 in the metacyclic stage) were characterized. The data generated corroborate the literature and complement the functional analyses of differentially regulated proteins at each stage, as they attribute potential functions to CHPs, which are frequently identified in Trypanosoma cruzi proteomics studies. However, it is important to point out that experimental validation is required to deepen our understanding of the CHPs.
Read full abstract