Improving VAE based molecular representations for compound property prediction

Ani Tevosyan,Nelly Babayan,Lusine Khondkaryan,Hrant Khachatrian,Lilit Apresyan,Helga Stopper,Zaven Navoyan,Gohar Tadevosyan

doi:10.1186/s13321-022-00648-x

Abstract

Collecting labeled data for many important tasks in chemoinformatics is time consuming and requires expensive experiments. In recent years, machine learning has been used to learn rich representations of molecules using large scale unlabeled molecular datasets and transfer the knowledge to solve the more challenging tasks with limited datasets. Variational autoencoders are one of the tools that have been proposed to perform the transfer for both chemical property prediction and molecular generation tasks. In this work we propose a simple method to improve chemical property prediction performance of machine learning models by incorporating additional information on correlated molecular descriptors in the representations learned by variational autoencoders. We verify the method on three property prediction tasks. We explore the impact of the number of incorporated descriptors, correlation between the descriptors and the target properties, sizes of the datasets etc. Finally, we show the relation between the performance of property prediction models and the distance between property prediction dataset and the larger unlabeled dataset in the representation space.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Cheminformatics	Publication Date: Oct 14, 2022
Citations: 5	License type: open-access

R Discovery Prime

R Discovery Prime

Improving VAE based molecular representations for compound property prediction

Abstract

Talk to us

Similar Papers

More From: Journal of Cheminformatics

Lead the way for us

Similar Papers

Applying machine learning to the pharmacokinetic modeling of cyclosporine in adult renal transplant recipients: a multi-method comparison.
Junjun Mao ... Mingkang Zhong
Frontiers in Pharmacology | VOL. 13
Junjun Mao, et. al.Junjun Mao ... Mingkang Zhong
24 Oct 2022
Frontiers in Pharmacology | VOL. 13

Does Artificial Intelligence Outperform Natural Intelligence in Interpreting Musculoskeletal Radiological Studies? A Systematic Review.
Olivier Q Groot ... Michiel E R Bongers
Clinical Orthopaedics & Related Research | VOL. 478
Olivier Q Groot, et. al.Olivier Q Groot ... Michiel E R Bongers
30 Jul 2020
Clinical Orthopaedics & Related Research | VOL. 478

Pushing the limits of solubility prediction via quality-oriented data selection.
Murat Cihan Sorkun ... Süleyman Er
iScience | VOL. 24
Murat Cihan Sorkun, et. al.Murat Cihan Sorkun ... Süleyman Er
17 Dec 2020
iScience | VOL. 24

Machine learning enhances the performance of short and long-term mortality prediction model in non-ST-segment elevation myocardial infarction
Woojoo Lee ... Myung Ho Jeong
Scientific Reports | VOL. 11
Woojoo Lee, et. al.Woojoo Lee ... Myung Ho Jeong
18 Jun 2021
Scientific Reports | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving VAE based molecular representations for compound property prediction

Abstract

Talk to us

Similar Papers

More From: Journal of Cheminformatics