Learning peptide properties with positive examples only.

Mehrad Ansari,Andrew D White

doi:10.1039/d3dd00218g

Abstract

Deep learning can create accurate predictive models by exploiting existing large-scale experimental data, and guide the design of molecules. However, a major barrier is the requirement of both positive and negative examples in the classical supervised learning frameworks. Notably, most peptide databases come with missing information and low number of observations on negative examples, as such sequences are hard to obtain using high-throughput screening methods. To address this challenge, we solely exploit the limited known positive examples in a semi-supervised setting, and discover peptide sequences that are likely to map to certain antimicrobial properties via positive-unlabeled learning (PU). In particular, we use the two learning strategies of adapting base classifier and reliable negative identification to build deep learning models for inferring solubility, hemolysis, binding against SHP-2, and non-fouling activity of peptides, given their sequence. We evaluate the predictive performance of our PU learning method and show that by only using the positive data, it can achieve competitive performance when compared with the classical positive-negative (PN) classification approach, where there is access to both positive and negative examples.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning peptide properties with positive examples only.

Abstract

Talk to us

Similar Papers

More From: Digital Discovery

Lead the way for us

Journal: Digital Discovery	Publication Date: Jan 1, 2024
License type: CC BY 3.0

Similar Papers

Inferring Protein Sequence-Function Relationships with Large-Scale Positive-Unlabeled Learning.
Hyebin Song ... Bennett J Bremer
Cell systems | VOL. 12
Hyebin Song, et. al.Hyebin Song ... Bennett J Bremer
18 Nov 2020
Cell systems | VOL. 12

Positive-Unlabelled learning for identifying new candidate Dietary Restriction-related genes among ageing-related genes
Jorge Paz-Ruza ... Bertha Guijarro-Berdiñas
Computers in Biology and Medicine | VOL. 180
Jorge Paz-Ruza, et. al.Jorge Paz-Ruza ... Bertha Guijarro-Berdiñas
12 Aug 2024
Computers in Biology and Medicine | VOL. 180

Similarity-based approach for positive and unlabelled learning
...
-
, et. al. ...
16 Jul 2011
16 Jul 2011

Enhancing Knowledge Graph Completion with Positive Unlabeled Learning
Jinghao Niu ... Wensheng Zhang
-
Jinghao Niu, et. al.Jinghao Niu ... Wensheng Zhang
01 Aug 2018
01 Aug 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning peptide properties with positive examples only.

Abstract

Talk to us

Similar Papers

More From: Digital Discovery