An explainable CNN approach for medical codes prediction from clinical text

Shuyuan Hu,Fei Teng,Jun Yan,Haibo Zhang,Lufei Huang

doi:10.1186/s12911-021-01615-6

Abstract

BackgroundClinical notes are unstructured text documents generated by clinicians during patient encounters, generally are annotated with International Classification of Diseases (ICD) codes, which give formatted information about the diagnosis and treatment. ICD code has shown its potentials in many fields, but manual coding is labor-intensive and error-prone, lead to researches of automatic coding. Two specific challenges of this task are (1) given an annotated clinical notes, the reasons behind specific diagnoses and treatments are implicit; (2) explainability is important for practical automatic coding method, the method should not only explain its prediction output but also have explainable internal mechanics. This study aims to develop an explainable CNN approach to address these two challenges.MethodOur key idea is that for the automatic ICD coding task, the presence of informative snippets in the clinical text that correlated with each code plays an important role in the prediction of codes, and an informative snippet can be considered as a local and low-level feature. We infer that there exists a correspondence between a convolution filter and a local and low-level feature. Base on the inference, we come up with the Shallow and Wide Attention convolutional Mechanism (SWAM) to improve the CNN-based models’ ability to learn local and low-level features for each label.ResultsWe evaluate our approach on MIMIC-III, an open-access dataset of ICU medical records. Our approach substantially outperforms previous results on top-50 medical code prediction on MIMIC-III dataset, the precision of the worst-performing 10% labels in previous works is increased from 0% to 53% on average. We attribute this improvement to SWAM, by which the wide architecture with attention mechanism gives the model ability to more extensively learn the unique features of different codes, and we prove it by an ablation experiment. Besides, we perform manual analysis of the performance imbalance between different codes, and preliminary conclude the characteristics that determine the difficulty of learning specific codes.ConclusionsOur main contributions can be summarized into the following three: (1) We present local and low-level features, a.k.a. informative snippets play an important role in the automatic ICD coding task, and the informative snippets extracted from the clinical text provide explanations for each code. (2) We propose that there exists a correspondence between a convolution filter and a local and low-level feature. A combination of wide and shallow convolutional layer and attention layer can help the CNN-based models better learn local and low-level features. (3) We improved the precision of the worst-performing 10% labels from 0 to 53% on average.

Highlights

Clinical notes are unstructured text documents generated by clinicians during patient encounters, generally are annotated with International Classification of Diseases (ICD) codes, which give formatted information about the diagnosis and treatment
We attribute this improvement to Shallow and Wide Attention convolutional Mechanism (SWAM), by which the wide architecture with attention mechanism gives the model ability to more extensively learn the unique features of different codes, and we prove it by an ablation experiment
Our main contributions can be summarized into the following three: (1) We present local and lowlevel features, a.k.a. informative snippets play an important role in the automatic ICD coding task, and the informative snippets extracted from the clinical text provide explanations for each code

Summary

Introduction

Clinical notes are unstructured text documents generated by clinicians during patient encounters, generally are annotated with International Classification of Diseases (ICD) codes, which give formatted information about the diagnosis and treatment. Two specific challenges of this task are (1) given an annotated clinical notes, the reasons behind specific diagnoses and treatments are implicit; (2) explainability is important for practical automatic coding method, the method should explain its prediction output and have explainable internal mechanics. Given the annotated text, the connections between code and its corresponding informative snippets are lost, in other words, the model has to learn the reasons behind specific diagnoses and treatments.Second, interpretability is a crucial obstacle for practical automatic coding in both perspective of inferring and internal mechanics, the method is supposed to explain its prediction as well as have an explainable internal mechanics

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Informatics and Decision Making	Publication Date: Nov 1, 2021
Citations: 16	License type: open-access

R Discovery Prime

R Discovery Prime

An explainable CNN approach for medical codes prediction from clinical text

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making

Lead the way for us

Similar Papers

A Pseudo Label-Wise Attention Network for Automatic ICD Coding.
Yifan Wu ... Min Zeng
IEEE Journal of Biomedical and Health Informatics | VOL. 26
Yifan Wu, et. al.Yifan Wu ... Min Zeng
01 Oct 2022
IEEE Journal of Biomedical and Health Informatics | VOL. 26

HyperCore: Hyperbolic and Co-graph Representation for Automatic ICD Coding
Pengfei Cao ... Yubo Chen
-
Pengfei Cao, et. al.Pengfei Cao ... Yubo Chen
01 Jan 2020
01 Jan 2020

TextC/R/RCNN for multi-label classification based ICD coding
Xu Han
-
Xu HanXu Han
13 Oct 2022
13 Oct 2022

Designing NLP applications to support ICD coding: an impact analysis and guidelines to enhance baseline performance when processing patient discharge notes
Jessica Jha ... Mario Almagro
Journal of Digital Health | VOL. -
Jessica Jha, et. al.Jessica Jha ... Mario Almagro
30 Oct 2023
Journal of Digital Health | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An explainable CNN approach for medical codes prediction from clinical text

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making