Abstract

Accurate clinical coding using the International Classification of Diseases (ICD) standard is essential for healthcare analytics. ICD-11 introduces new coding guidelines and cluster structures, posing challenges for existing coding tools. This research presents an automated approach to generate valid ICD-11 cluster codes from medical text. Natural language records are represented as vectors and compared to an ICD-11 corpus using cosine similarity. A bidirectional matching technique then refines similarity estimation. Experiments demonstrate the method yields up to 0.91 F1 score in coding accuracy, significantly outperforming a baseline tool. This work enables efficient high-quality ICD-11 coding to support healthcare informatics.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call