Abstract
The comprehensive mapping of protein-protein interactions (PPIs) is highly desired for one to gain deep insights into both fundamental cell biology processes and the pathology of diseases. Finely-set small-scale experiments are not only very expensive but also inefficient to identify numerous interactomes despite their high accuracy. High-throughput screening techniques enable efficient identification of PPIs; yet the desire to further extract useful knowledge from these data leads to the problem of binary interactome mapping. Network topology-based approaches prove to be highly efficient in addressing this problem; however, their performance deteriorates significantly on sparse putative PPI networks. Motivated by the success of collaborative filtering (CF)-based approaches to the problem of personalized-recommendation on large, sparse rating matrices, this work aims at implementing a highly efficient CF-based approach to binary interactome mapping. To achieve this, we first propose a CF framework for it. Under this framework, we model the given data into an interactome weight matrix, where the feature-vectors of involved proteins are extracted. With them, we design the rescaled cosine coefficient to model the inter-neighborhood similarity among involved proteins, for taking the mapping process. Experimental results on three large, sparse datasets demonstrate that the proposed approach outperforms several sophisticated topology-based approaches significantly.
Highlights
The proposed rescaled cosine coefficient (RCC)-based collaborative filtering (CF) (RCF) approach to BIM is highly flexible, and is able to work depending on binary high-throughput screening (HTS)-protein-protein interactions (PPIs) data solely
It is fair and reasonable to compare the proposed RCF against sophisticated topology-based methods, which are well known for their efficiency and dependence on HTS-PPI data only
From the Results Section, we see that the efficiency of the proposed RCF in addressing the BIM problem is supported by the experimental results
Summary
High-throughput screening techniques enable efficient identification of PPIs; yet the desire to further extract useful knowledge from these data leads to the problem of binary interactome mapping. The progress of high-throughput screening (HTS) techniques, e.g., canonical yeast two-hybrid assay, tandem affinity purification and mass spectrometric, mass spectrometric protein complex identification, and protein fragment complementation, has resulted in rapid accumulation of data describing global networks of PPIs in organisms. Several HTS-PPI datasets were published for various organisms, such as humans (Homo sapiens), worms (Caenorhabditis elegans), yeast (Saccharomyces cerevisiae), fly (Drosophila melanogaster), and plants. Several HTS-PPI datasets were published for various organisms, such as humans (Homo sapiens), worms (Caenorhabditis elegans), yeast (Saccharomyces cerevisiae), fly (Drosophila melanogaster), and plants9 With these obtained HTS-PPI data, great opportunities in studying biological events are unprecedented. HTS-PPI data have made advances to identify the PPI networks, it is desired to extract more useful knowledge from them.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.