Boost-RS: boosted embeddings for recommender systems and its application to enzyme-substrate interaction prediction.

Xinmeng Li,Soha Hassoun,Li-Ping Liu

doi:10.1093/bioinformatics/btac201

Abstract

MotivationDespite experimental and curation efforts, the extent of enzyme promiscuity on substrates continues to be largely unexplored and under documented. Providing computational tools for the exploration of the enzyme–substrate interaction space can expedite experimentation and benefit applications such as constructing synthesis pathways for novel biomolecules, identifying products of metabolism on ingested compounds, and elucidating xenobiotic metabolism. Recommender systems (RS), which are currently unexplored for the enzyme–substrate interaction prediction problem, can be utilized to provide enzyme recommendations for substrates, and vice versa. The performance of Collaborative-Filtering (CF) RSs; however, hinges on the quality of embedding vectors of users and items (enzymes and substrates in our case). Importantly, enhancing CF embeddings with heterogeneous auxiliary data, specially relational data (e.g. hierarchical, pairwise or groupings), remains a challenge.ResultsWe propose an innovative general RS framework, termed Boost-RS that enhances RS performance by ‘boosting’ embedding vectors through auxiliary data. Specifically, Boost-RS is trained and dynamically tuned on multiple relevant auxiliary learning tasks Boost-RS utilizes contrastive learning tasks to exploit relational data. To show the efficacy of Boost-RS for the enzyme–substrate prediction interaction problem, we apply the Boost-RS framework to several baseline CF models. We show that each of our auxiliary tasks boosts learning of the embedding vectors, and that contrastive learning using Boost-RS outperforms attribute concatenation and multi-label learning. We also show that Boost-RS outperforms similarity-based models. Ablation studies and visualization of learned representations highlight the importance of using contrastive learning on some of the auxiliary data in boosting the embedding vectors.Availability and implementationA Python implementation for Boost-RS is provided at https://github.com/HassounLab/Boost-RS. The enzyme-substrate interaction data is available from the KEGG database (https://www.genome.jp/kegg/).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Boost-RS: boosted embeddings for recommender systems and its application to enzyme-substrate interaction prediction.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)

Lead the way for us

Journal: Bioinformatics (Oxford, England)	Publication Date: Apr 12, 2022
License type: CC BY-NC 4.0

Similar Papers

Deep Transfer Learning via Restricted Boltzmann Machine for Document Classification
Jian Zhang
-
Jian ZhangJian Zhang
01 Dec 2011
01 Dec 2011

A collaborative filtering recommendation algorithm based on DeepWalk and self-attention
Jiaming Guo ... Weihong Huang
International Journal of Computational Science and Engineering | VOL. 26
Jiaming Guo, et. al.Jiaming Guo ... Weihong Huang
01 Jan 2023
International Journal of Computational Science and Engineering | VOL. 26

Transfer learning in collaborative filtering
Weike Pan
-
Weike PanWeike Pan
23 Dec 2014
23 Dec 2014

Benchmarking big data recommendation algorithms using Hadoop orApache Spark
Dinesh Kumar Saini ... Kashif Zia
-
Dinesh Kumar Saini, et. al.Dinesh Kumar Saini ... Kashif Zia
04 Jul 2019
04 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Boost-RS: boosted embeddings for recommender systems and its application to enzyme-substrate interaction prediction.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)