An Automated Text Document Classification Framework using BERT

Momna Ali Shah,Iftikhar Ahmed,Muhammad Javed Iqbal,Neelum Noreen

doi:10.14569/ijacsa.2023.0140332

Momna Ali Shah, Iftikhar Ahmed + Show 2 more

Open Access

https://doi.org/10.14569/ijacsa.2023.0140332

Copy DOI

Abstract

Due to the rapid advancement of technology, the volume of online text data from numerous various disciplines is increasing significantly over time. Therefore, more work is needed to create systems that can effectively classify text data in accordance with its content, facilitating processing and the extraction of crucial information. Since these non-automated systems use manual feature extraction and classification, which is error-prone and time-consuming by choosing the best appropriate algorithms for feature extraction and classification, traditional procedures are typically resource intensive (computational, human, etc.), which is not a viable solution. To address the shortcomings of traditional approaches, we offer a unique text categorization strategy based on a well-known DL algorithm called BERT. The proposed framework is trained and tested using cutting-edge text datasets, such as the UCI email dataset, which includes spam and non-spam emails, and the BBC News dataset, which includes multiple categories such as tech, sports, politics, business, and entertainment. The system achieved the highest accuracy of 91.4% and can be used by different organizations to classify text-based data with a high performance. The effectiveness of the proposed framework is evaluated using multiple evaluation metrics such as Accuracy, Precision, and Recall.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2023
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

An Automated Text Document Classification Framework using BERT

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

Analysis of Breast Cancer for Histological Dataset Based on Different Feature Extraction and Classification Algorithms
Chetna Kaushal ... Anshu Singla
-
Chetna Kaushal, et. al.Chetna Kaushal ... Anshu Singla
02 Aug 2020
02 Aug 2020

Hand Gesture Recognition Using Automatic Feature Extraction and Deep Learning Algorithms with Memory
Rubén E Nogales ... Marco E Benalcázar
Big Data and Cognitive Computing | VOL. 7
Rubén E Nogales, et. al.Rubén E Nogales ... Marco E Benalcázar
23 May 2023
Big Data and Cognitive Computing | VOL. 7

A Multi-metric Selection Strategy for Evolutionary Symbolic Regression
Hu Zhang ... Aimin Zhou
-
Hu Zhang, et. al.Hu Zhang ... Aimin Zhou
11 Oct 2020
11 Oct 2020

Multiple Feature Extraction Techniques in Image Stitching
Archana B ... Taherim S
International Journal of Computer Applications | VOL. 123
Archana B, et. al.Archana B ... Taherim S
18 Aug 2015
International Journal of Computer Applications | VOL. 123

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Automated Text Document Classification Framework using BERT

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications