Abstract

The rapid development of technologies has led to an increasing number of research works submitted to journals or conferences. However, the process of submitting articles can be challenging for authors due to the wide range of subjects covered by submission systems, such as the Association for Computing Machinery, with 2,000 subjects. This challenge arises from the need to accurately categorize the manuscript into the appropriate subject area before submission. This article proposes an automatic solution that extracts information and categorizes scientific papers into relevant topics to address this issue. The proposed approach employs pre-processing, extraction, vectorization, and classification techniques using three machine learning methods: support vector machines, Naïve Bayes, and decision trees. The experiments conducted on a dataset of articles published in the Tra Vinh University Journal of Science show promising results. The support vector machines technique, in particular, achieved an accuracy rate of over 75%, demonstrating its potential as a tool for developing an automatic classification system for scientific papers.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call