Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning

Maged Nasser,Idris Rabiu,Hentabli Hamza,Muaadh A Alsoufi,Naomie Salim,Shadi Basurra,Faisal Saeed

doi:10.3390/biom12040508

Abstract

The concept of molecular similarity has been commonly used in rational drug design, where structurally similar molecules are examined in molecular databases to retrieve functionally similar molecules. The most used conventional similarity methods used two-dimensional (2D) fingerprints to evaluate the similarity of molecules towards a target query. However, these descriptors include redundant and irrelevant features that might impact the performance of similarity searching methods. Thus, this study proposed a new approach for identifying the important features of molecules in chemical datasets based on the representation of the molecular features using Autoencoder (AE), with the aim of removing irrelevant and redundant features. The proposed approach experimented using the MDL Data Drug Report standard dataset (MDDR). Based on experimental findings, the proposed approach performed better than several existing benchmark similarity methods such as Tanimoto Similarity Method (TAN), Adapted Similarity Measure of Text Processing (ASMTP), and Quantum-Based Similarity Method (SQB). The results demonstrated that the performance achieved by the proposed approach has proven to be superior, particularly with the use of structurally heterogeneous datasets, where it yielded improved results compared to other previously used methods with the similar goal of improving molecular similarity searching.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biomolecules	Publication Date: Mar 27, 2022
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning

Abstract

Talk to us

Similar Papers

More From: Biomolecules

Lead the way for us

Similar Papers

A binary Krill Herd approach based feature selection for high dimensional data
V Preeja ... A H Shahana
-
V Preeja, et. al.V Preeja ... A H Shahana
01 Aug 2016
01 Aug 2016

SVM for network anomaly detection using ACO feature subset
Tahir Mehmood ... Helmi B Md Rais
-
Tahir Mehmood, et. al.Tahir Mehmood ... Helmi B Md Rais
01 May 2015
01 May 2015

Feature subset selection for irrelevant data removal using Decision Tree Algorithm
D Preetha Evangeline ... P Anandhakumar
-
D Preetha Evangeline, et. al.D Preetha Evangeline ... P Anandhakumar
01 Dec 2013
01 Dec 2013

Mutual Information-based Feature Selection Approach to Reduce High Dimension of Big Data
Thee Zin Win ... Nang Saing Moon Kham
-
Thee Zin Win, et. al.Thee Zin Win ... Nang Saing Moon Kham
28 Sep 2018
28 Sep 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Reduction for Molecular Similarity Searching Based on Autoencoder Deep Learning

Abstract

Talk to us

Similar Papers

More From: Biomolecules