NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations

Shaojun Wang,Ronghui You,Yunjia Liu,Yi Xiong,Shanfeng Zhu

doi:10.1016/j.gpb.2023.04.001

Shaojun Wang, Ronghui You + Show 3 more

Open Access

https://doi.org/10.1016/j.gpb.2023.04.001

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

As one of the state-of-the-art automated function prediction (AFP) methods, NetGO 2.0 integrates multi-source information to improve the performance. However, it mainly utilizes the proteins with experimentally supported functional annotations without leveraging valuable information from a vast number of unannotated proteins. Recently, protein language models have been proposed to learn informative representations [e.g., Evolutionary Scale Modeling (ESM)-1b embedding] from protein sequences based on self-supervision. Here, we represented each protein by ESM-1b and used logistic regression (LR) to train a new model, LR-ESM, for AFP. The experimental results showed that LR-ESM achieved comparable performance with the best-performing component of NetGO 2.0. Therefore, by incorporating LR-ESM into NetGO 2.0, we developed NetGO 3.0 to improve the performance of AFP extensively. NetGO 3.0 is freely accessible at https://dmiip.sjtu.edu.cn/ng3.0.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Genomics, Proteomics & Bioinformatics	Publication Date: Apr 1, 2023
Citations: 28	License type: CC BY 4.0

R Discovery Prime

NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations

Abstract

Published Version

Talk to us

Similar Papers

More From: Genomics, Proteomics & Bioinformatics

Lead the way for us

Similar Papers

The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches.
Ishita K Khan ... Samuel Chapman
GigaScience | VOL. 4
Ishita K Khan, et. al.Ishita K Khan ... Samuel Chapman
14 Sep 2015
The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches.
Ishita K Khan ... Samuel Chapman

New avenues in protein function prediction
Iddo Friedberg ... Adam Godzik
Protein Science | VOL. 15
Iddo Friedberg, et. al.Iddo Friedberg ... Adam Godzik
01 Jun 2006
Protein Science | VOL. 15

Extensive complementarity between gene function prediction methods.
Vedrana Vidulin ... Fran Supek
Bioinformatics | VOL. 32
Vedrana Vidulin, et. al.Vedrana Vidulin ... Fran Supek
13 Aug 2016
Bioinformatics | VOL. 32

Enhanced automated function prediction using distantly related sequences and contextual association by PFP
Troy Hawkins ... Stanislav Luban
Protein Science | VOL. 15
Troy Hawkins, et. al.Troy Hawkins ... Stanislav Luban
01 Jun 2006
Protein Science | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations

Abstract

Published Version

Talk to us

Similar Papers

More From: Genomics, Proteomics &amp; Bioinformatics

More From: Genomics, Proteomics & Bioinformatics