The Application of Projection Word Embeddings on Medical Records Scoring System.

Chin Lin,Chia-Jung Hsu,Chia-Cheng Lee,Dung-Jang Tsai,Feng-Jen Wu,Yung-Tsai Lee,Wen-Hui Fang,Shing-An Lin

doi:10.3390/healthcare9101298

Abstract

Medical records scoring is important in a health care system. Artificial intelligence (AI) with projection word embeddings has been validated in its performance disease coding tasks, which maintain the vocabulary diversity of open internet databases and the medical terminology understanding of electronic health records (EHRs). We considered that an AI-enhanced system might be also applied to automatically score medical records. This study aimed to develop a series of deep learning models (DLMs) and validated their performance in medical records scoring task. We also analyzed the practical value of the best model. We used the admission medical records from the Tri-Services General Hospital during January 2016 to May 2020, which were scored by our visiting staffs with different levels from different departments. The medical records were scored ranged 0 to 10. All samples were divided into a training set (n = 74,959) and testing set (n = 152,730) based on time, which were used to train and validate the DLMs, respectively. The mean absolute error (MAE) was used to evaluate each DLM performance. In original AI medical record scoring, the predicted score by BERT architecture is closer to the actual reviewer score than the projection word embedding and LSTM architecture. The original MAE is 0.84 ± 0.27 using the BERT model, and the MAE is 1.00 ± 0.32 using the LSTM model. Linear mixed model can be used to improve the model performance, and the adjusted predicted score was closer compared to the original score. However, the project word embedding with the LSTM model (0.66 ± 0.39) provided better performance compared to BERT (0.70 ± 0.33) after linear mixed model enhancement (p < 0.001). In addition to comparing different architectures to score the medical records, this study further uses a mixed linear model to successfully adjust the AI medical record score to make it closer to the actual physician’s score.

Highlights

With the increasing advancement of technology, the data amount generated by humans is growing explosively [1]
In Artificial intelligence (AI) model training, the medical records were divided into the training set and testing set based on year, where 74,959 records were used to establish Bidirectional Encoder Representations from Transformers (BERT)
The AI system is already capable of accurate classification to level 3 ICD-10 coding, combined with results from previous studies

Summary

Introduction

With the increasing advancement of technology, the data amount generated by humans is growing explosively [1]. Taking advantage of these growing data may bring valuable information, which many successful cases from different industries [2] have already proved. The majority of these data are not structured [3], which cannot be directly used by traditional analytical methods. It is expected to employ new algorithms to use these data to allow for stronger decision-making capacity [4,5]. With the breakthrough developments of the deep neural network in diverse fields, we are already capable of directly analyzing data in the forms of videos, texts, and voices. The focus of researches is to develop applications to solve practical problems

Objectives

Methods

Results

Conclusion