Comparative Analysis of Different Data Representations for the Task of Chemical Compound Extraction

Basel Alshaikhdeeb,Kamsuriah Ahmad

doi:10.18517/ijaseit.8.5.6432

Abstract

Chemical Compound Extraction refers to the task of recognizing chemical instances such as oxygen nitrogen and others. The majority of studies that addressed the task of chemical compound extraction used machine-learning techniques. The key challenge behind using machine-learning techniques lies in employing a robust set of features. In fact, the literature shows that there are numerous types of features used in the task of chemical compound extraction. Such dimensionality of features can be determined via data representation. Some researchers have used N-gram representation for biomedical-named entity recognition, where the most significant terms are represented as features. Meanwhile, others have used detailed-attribute representation in which the features are generalized. As a result, identifying the best combination of features to yield high-accuracy classification becomes challenging. This paper aims to apply the Wrapper Subset Selection approach using two data representations—N-gram and detailed-attributes. Since each data representation would suit a specific classification algorithm, two classifiers were utilized—Naí¯ve Bayes (for detailed-attributes) and Support Vector Machine (for N-gram). The results show that the application of feature selection using detailed-attributes outperformed that of N-gram representation by achieving a 0.722 f-measure. Despite the higher classification accuracy, the selected features using detailed-attribute representation have more meaning and can be applied for further datasets.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparative Analysis of Different Data Representations for the Task of Chemical Compound Extraction

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology

Lead the way for us

Journal: International Journal on Advanced Science, Engineering and Information Technology	Publication Date: Oct 31, 2018
License type: cc-by-sa

Similar Papers

Feature selection for chemical compound extraction using wrapper approach with Naive Bayes classifier
Basel Alshaikhdeeb ... Kamsuriah Ahmad
-
Basel Alshaikhdeeb, et. al.Basel Alshaikhdeeb ... Kamsuriah Ahmad
01 Nov 2017
01 Nov 2017

Machine Learning Techniques for Modelling Short Term Land-Use Change
Mileva Samardžić-Petrović ... Branislav Bajat
ISPRS International Journal of Geo-Information | VOL. 6
Mileva Samardžić-Petrović, et. al.Mileva Samardžić-Petrović ... Branislav Bajat
29 Nov 2017
ISPRS International Journal of Geo-Information | VOL. 6

An efficient Analysis based on the Internet of Things, SVM and KNN for Operative Diabetic Retinopathy Classification
... Raenu Kolandaisamy
Journal of Intelligent Systems and Internet of Things | VOL. 13
, et. al. ... Raenu Kolandaisamy
01 Jan 2024
Journal of Intelligent Systems and Internet of Things | VOL. 13

Tutorial I: Neural networks and support vector machines
C Chandra Sekhar ... C N Rao
-
C Chandra Sekhar, et. al.C Chandra Sekhar ... C N Rao
01 Dec 2011
01 Dec 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative Analysis of Different Data Representations for the Task of Chemical Compound Extraction

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology