Abstract

Recent years have brought big advances in the field of Data mining; this is due to the immense advances in technologies which facilitate the analysis of data. In particular, binary classification techniques are a subdomain of Data Mining which is used to classify data into two classes according to desired criteria. Different algorithms have been developed and they can be categorized into statistical and machine learning. Each category has its own limits. In this paper, a set of molecules is chosen to illustrate the proposed approach of binary classification. The considered molecules are described by means of size, shape, functionality and other expert descriptors. This paper defines new measure functions to quantify the similarities between the molecules and then combines them in a new approach which differs from actual algorithms by its reliability computations. Results of the proposed approach exceeded most common classification techniques with a f-measure exceeding 70% on this molecules Dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.