An ensemble approach for research article classification: a case study in artificial intelligence

Min Lu,Lie Tang,Xianke Zhou

doi:10.7717/peerj-cs.2521

Abstract

Text classification of research articles in emerging fields poses significant challenges due to their complex boundaries, interdisciplinary nature, and rapid evolution. Traditional methods, which rely on manually curated search terms and keyword matching, often lack recall due to the inherent incompleteness of keyword lists. In response to this limitation, this study introduces a deep learning-based ensemble approach that addresses the challenges of article classification in dynamic research areas, using the field of artificial intelligence (AI) as a case study. Our approach included using decision tree, sciBERT and regular expression matching on different fields of the articles, and a support vector machine (SVM) to merge the results from different models. We evaluated the effectiveness of our method on a manually labeled dataset, finding that our combined approach captured around 97% of AI-related articles in the web of science (WoS) corpus with a precision of 0.92. This presents a 0.15 increase in F1-score compared with existing search term based approach. Following this, we performed an ablation study to prove that each component in the ensemble model contributes to the overall performance, and that sciBERT outperforms other pre-trained BERT models in this case.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An ensemble approach for research article classification: a case study in artificial intelligence

Abstract

Talk to us

Similar Papers

More From: PeerJ Computer Science

Lead the way for us

Journal: PeerJ Computer Science	Publication Date: Dec 10, 2024
License type: CC BY 4.0

Similar Papers

BSEFNet: bidirectional self-attention edge fusion network salient object detection based on deep fusion of edge features
Gan Gao ... Rugang Wang
PeerJ Computer Science | VOL. 10
Gan Gao, et. al.Gan Gao ... Rugang Wang
10 Dec 2024
PeerJ Computer Science | VOL. 10

Label dependency modeling in Multi-Label Naïve Bayes through input space expansion
Pka Chitra ... Mhd Omar Al-Kadri
PeerJ Computer Science | VOL. 10
Pka Chitra, et. al.Pka Chitra ... Mhd Omar Al-Kadri
10 Dec 2024
PeerJ Computer Science | VOL. 10

An ensemble approach for research article classification: a case study in artificial intelligence
Min Lu ... Xianke Zhou
PeerJ Computer Science | VOL. 10
Min Lu, et. al.Min Lu ... Xianke Zhou
10 Dec 2024
PeerJ Computer Science | VOL. 10

ADHDP-based robust self-learning 3D trajectory tracking control for underactuated UUVs
Chunbo Zhao ... Deyi Gao
PeerJ Computer Science | VOL. 10
Chunbo Zhao, et. al.Chunbo Zhao ... Deyi Gao
10 Dec 2024
PeerJ Computer Science | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An ensemble approach for research article classification: a case study in artificial intelligence

Abstract

Talk to us

Similar Papers

More From: PeerJ Computer Science