SrBERT: automatic article classification model for systematic review using BERT

Sungmin Aum,Seon Choe

doi:10.1186/s13643-021-01763-w

Abstract

BackgroundSystematic reviews (SRs) are recognized as reliable evidence, which enables evidence-based medicine to be applied to clinical practice. However, owing to the significant efforts required for an SR, its creation is time-consuming, which often leads to out-of-date results. To support SR tasks, tools for automating these SR tasks have been considered; however, applying a general natural language processing model to domain-specific articles and insufficient text data for training poses challenges.MethodsThe research objective is to automate the classification of included articles using the Bidirectional Encoder Representations from Transformers (BERT) algorithm. In particular, srBERT models based on the BERT algorithm are pre-trained using abstracts of articles from two types of datasets, and the resulting model is then fine-tuned using the article titles. The performances of our proposed models are compared with those of existing general machine-learning models.ResultsOur results indicate that the proposed srBERTmy model, pre-trained with abstracts of articles and a generated vocabulary, achieved state-of-the-art performance in both classification and relation-extraction tasks; for the first task, it achieved an accuracy of 94.35% (89.38%), F1 score of 66.12 (78.64), and area under the receiver operating characteristic curve of 0.77 (0.9) on the original and (generated) datasets, respectively. In the second task, the model achieved an accuracy of 93.5% with a loss of 27%, thereby outperforming the other evaluated models, including the original BERT model.ConclusionsOur research shows the possibility of automatic article classification using machine-learning approaches to support SR tasks and its broad applicability. However, because the performance of our model depends on the size and class ratio of the training dataset, it is important to secure a dataset of sufficient quality, which may pose challenges.

Highlights

Systematic reviews (SRs) are recognized as reliable evidence, which enables evidence-based medicine to be applied to clinical practice
Fine‐tuning model hyperparameters The proposed srBERT was pre-trained using the Google Cloud Platform, which is typically used for large-scale experiments that need to be run on Tensor Processing Units (TPUs)
Among the tasks where automation tools could be supported for systematic review (SR) creation, we focused on the appraisal stage for srBERTmy250K srBERTmix Original Bidirectional Encoder Representations from Transformers (BERT) K-neighbors Support vector classification (SVC) DecisionTree RandomForest Adaboost MultinomialNB

Summary

Introduction

Systematic reviews (SRs) are recognized as reliable evidence, which enables evidence-based medicine to be applied to clinical practice. A systematic review (SR) is a literature review that involves evaluating the quality of previous research and reporting comprehensive results from all suitable works on a topic [1]. It is an efficient and reliable approach that. SRs involve robust analyses, which require significant time and effort; these requirements prevent the application of up-to-date results in clinical practice. Tools to automate parts of the SR process have been suggested based on the recent advances in natural language processing (NLP).

Methods

Results

Discussion

Conclusion