Abstract

The National Health and Family Planning Commission requires medical institutions to use the International Classification of Diseases (ICD) codes. However, due to many commonly used words in clinical disease descriptions, the direct mapping matching rate between the diagnosis names entered in the electronic medical records and the ICD codes is low. In this paper, based on the actual diagnostic data on the regional health platform, a disease term map incorporating standard terms was constructed. Specifically, based on the rule algorithm based on the components of the disease, a data-enhanced BERT (bidirectional encoder representation from transformers) upper and lower relationship recognition algorithm is proposed. Synonymous upper and lower relationships identify diseases, and the hierarchical structure is further integrated. In addition, a task assignment based on the association map of disease departments is also proposed. Methods were used for manual verification, and finally, 94,478 disease entities formed a large-scale disease term map, including 1,460 synonymous relationships and 46,508 hyponymous relationships. Evaluation experiments show that, based on the disease term map and clinical diagnosis, the coverage rate of diagnostic data is 75.31% higher than direct mapping coding based on ICD. In addition, using the disease term map to code diseases automatically will shorten the coding time by about 59.75% compared with manual coding by doctors, and the correct rate is 85%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call