Generalized Zero-Shot Text Classification for ICD Coding

Congzheng Song,Eric Xing,Shanghang Zhang,Najmeh Sadoughi,Pengtao Xie

doi:10.24963/ijcai.2020/556

Abstract

The International Classification of Diseases (ICD) is a list of classification codes for the diagnoses. Automatic ICD coding is a multi-label text classification problem with noisy clinical document inputs and long-tailed label distribution, making it difficult for fine-grained classification on both frequent and zero-shot codes at the same time, i.e. generalized zero-shot ICD coding. In this paper, we propose a latent feature generation framework to improve the prediction on unseen codes without compromising the performance on seen codes. Our framework generates semantically meaningful features for zero-shot codes by exploiting ICD code hierarchical structure and reconstructing the code-relevant keywords with a novel cycle architecture. To the best of our knowledge, this is the first adversarial generative model for generalized zero-shot learning on multi-label text classification. Extensive experiments demonstrate the effectiveness of our approach. On the public MIMIC-III dataset, our methods improve the F1 score from nearly 0 to 20.91% for the zero-shot codes, and increase the AUC score by 3% (absolute improvement) from previous state of the art. Code is available at https://github.com/csong27/gzsl_text.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalized Zero-Shot Text Classification for ICD Coding

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Pseudo Label-Wise Attention Network for Automatic ICD Coding.
Yifan Wu ... Ying Yu
IEEE Journal of Biomedical and Health Informatics | VOL. 26
Yifan Wu, et. al.Yifan Wu ... Ying Yu
01 Oct 2022
IEEE Journal of Biomedical and Health Informatics | VOL. 26

A Two-Stage Decoder for Efficient ICD Coding
...
arXiv (Cornell University) | VOL. -
, et. al. ...
27 May 2023
arXiv (Cornell University) | VOL. -

Reimbursement Policies for Carotid Duplex Ultrasound that are Based on International Classification of Diseases Codes May Discourage Testing in High-Yield Groups
Michael R Go ... Bhagwan Satiani
Annals of Vascular Surgery | VOL. 31
Michael R Go, et. al.Michael R Go ... Bhagwan Satiani
23 Nov 2015
Annals of Vascular Surgery | VOL. 31

Designing NLP applications to support ICD coding: an impact analysis and guidelines to enhance baseline performance when processing patient discharge notes
Jessica Jha ... Mario Almagro
Journal of Digital Health | VOL. -
Jessica Jha, et. al.Jessica Jha ... Mario Almagro
30 Oct 2023
Journal of Digital Health | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalized Zero-Shot Text Classification for ICD Coding

Abstract

Talk to us

Similar Papers