Comparison of Classification Algorithm and Language Model in Accounting Financial Transaction Record: A Natural Language Processing Approach

Bagas Adi Makayasa,Agung Fatwanto,Maria Ulfah Siregar,Bambang Sugiantoro

doi:10.18517/ijaseit.14.3.19179

Bagas Adi Makayasa, Agung Fatwanto + Show 2 more

Open Access

https://doi.org/10.18517/ijaseit.14.3.19179

Copy DOI

Abstract

The problem of financial recording not following the principles of accounting science has the potential to cause unnecessary problems. However, micro, small, and medium enterprises with their distinctive characteristics, though not all, still face many obstacles in writing financial reports. Even though there is already much financial software available, our study aims to investigate opportunities for implementing automation of accounting financial transaction records using the NLP approach, to interpret financial transactions based on text written on the transaction form into accounting journals (debits and credits). Experiments were carried out by comparing the performance of three classification algorithms, namely SVM, K-Nearest Neighbor, and Random Forest, with traditional (TF-IDF and BOW) and contextual (Word2Vec) Language Models. There are 200 financial transaction datasets consisting of ten classes. The data is divided into two parts, namely, the balance dataset and the imbalance dataset. The pair SVM and Word2Vec in the balanced dataset gave the highest accuracy (92.5%), precision (92.5%), recall/sensitivity (93.33%), and F1 score (92%). However, compared with the results of related semantic research (the average performance reaches 95%), the results obtained in this study are still lower. One point that may have a significant effect is the amount of data in the corpus, which is still lacking. Researchers suggest increasing the number of datasets and using a combination of other language models such as Glove, Bert etc. This study can also be used as a model for more complex financial transaction cases in future research.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of Classification Algorithm and Language Model in Accounting Financial Transaction Record: A Natural Language Processing Approach

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology

Lead the way for us

Journal: International Journal on Advanced Science, Engineering and Information Technology	Publication Date: Jun 15, 2024
License type: CC BY 4.0

Similar Papers

Soil textural class modeling using digital soil mapping approaches: Effect of resampling strategies on imbalanced dataset predictions
Fereshteh Mirzaei ... Ruth Kerry
Geoderma Regional | VOL. 38
Fereshteh Mirzaei, et. al.Fereshteh Mirzaei ... Ruth Kerry
15 Jun 2024
Geoderma Regional | VOL. 38

Comparative Analysis of Machine Learning Models for Fitness Level Prediction with Imbalanced Dataset
Stephanie Chua ... Chia Inn Sii
-
Stephanie Chua, et. al.Stephanie Chua ... Chia Inn Sii
01 Dec 2022
01 Dec 2022

Predicting Spine Surgery Complications Using Machine Learning
Mohamad Hoda ... Philippe Phan
-
Mohamad Hoda, et. al.Mohamad Hoda ... Philippe Phan
01 Jul 2019
01 Jul 2019

Machine Learning for Malware Detection on Balanced and Imbalanced Datasets
Manish Goyal ... Raman Kumar
-
Manish Goyal, et. al.Manish Goyal ... Raman Kumar
08 Nov 2020
08 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of Classification Algorithm and Language Model in Accounting Financial Transaction Record: A Natural Language Processing Approach

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology