Automatic ICD-10 multi-class classification of cause of death from plaintext autopsy reports through expert-driven feature selection.

Ghulam Mujtaba,Khairunisa Shaikh,Ram Gopal Raj,Retnagowri Rajandram,Liyana Shuib,Mohammed Ali Al-Garadi

doi:10.1371/journal.pone.0170242

Ghulam Mujtaba, Khairunisa Shaikh + Show 4 more

Open Access

PDF Available

https://doi.org/10.1371/journal.pone.0170242

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

ObjectivesWidespread implementation of electronic databases has improved the accessibility of plaintext clinical information for supplementary use. Numerous machine learning techniques, such as supervised machine learning approaches or ontology-based approaches, have been employed to obtain useful information from plaintext clinical data. This study proposes an automatic multi-class classification system to predict accident-related causes of death from plaintext autopsy reports through expert-driven feature selection with supervised automatic text classification decision models.MethodsAccident-related autopsy reports were obtained from one of the largest hospital in Kuala Lumpur. These reports belong to nine different accident-related causes of death. Master feature vector was prepared by extracting features from the collected autopsy reports by using unigram with lexical categorization. This master feature vector was used to detect cause of death [according to internal classification of disease version 10 (ICD-10) classification system] through five automated feature selection schemes, proposed expert-driven approach, five subset sizes of features, and five machine learning classifiers. Model performance was evaluated using precisionM, recallM, F-measureM, accuracy, and area under ROC curve. Four baselines were used to compare the results with the proposed system.ResultsRandom forest and J48 decision models parameterized using expert-driven feature selection yielded the highest evaluation measure approaching (85% to 90%) for most metrics by using a feature subset size of 30. The proposed system also showed approximately 14% to 16% improvement in the overall accuracy compared with the existing techniques and four baselines.ConclusionThe proposed system is feasible and practical to use for automatic classification of ICD-10-related cause of death from autopsy reports. The proposed system assists pathologists to accurately and rapidly determine underlying cause of death based on autopsy findings. Furthermore, the proposed expert-driven feature selection approach and the findings are generally applicable to other kinds of plaintext clinical reports.

Highlights

Autopsy or postmortem examination provides useful contribution to health-related education and improves the quality of the healthcare industry [1,2]
J48 classifier outperformed in all automated feature selection schemes excluding Fisher Markov Selector (FMS) by producing the highest accuracy of 75.94%, followed by support vector machine (SVM) (73.15%, with feature subset size 30) and random forest (RF) (73.05%, with feature subset size 40)
The lowest performance was observed in Naive Bayes (NB) classifier which produced 67.31% accuracy in the “all” feature subset size, followed by k-nearest neighbor (KNN) (70.36%, with feature subset sizes of 20 and 10)

Summary

Introduction

Autopsy or postmortem examination provides useful contribution to health-related education and improves the quality of the healthcare industry [1,2]. The collected autopsy findings are correlated with medical history, premortem and postmortem laboratory studies, microscopic findings of tissues, toxicology, and other related medical procedures and documents to determine and assign the cause of death according to the World Health Organization internal classification of disease version 10 (ICD10) coding standard [4]. Clinical autopsy is performed to discover the medical cause of death. Forensic autopsy is performed to discover the cause of death in criminal matter [5]. An interview is conducted from the relatives or witnesses of the deceased person to discover the cause of death. This method is common in low economical countries, where health facilities are insufficient [6]

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Feb 6, 2017
Citations: 40	License type: CC BY 4.0

R Discovery Prime

Automatic ICD-10 multi-class classification of cause of death from plaintext autopsy reports through expert-driven feature selection.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Pediatric Injury Surveillance From Uncoded Emergency Department Admission Records in Italy: Machine Learning-Based Text-Mining Approach.
Danila Azzolina ... Silvia Bressan
JMIR Public Health and Surveillance | VOL. 9
Danila Azzolina, et. al.Danila Azzolina ... Silvia Bressan
12 Jul 2023
JMIR Public Health and Surveillance | VOL. 9

The death of Stalin - was it a natural death or poisoning?

Surgical Neurology International | VOL. 6

01 Jan 2015
Surgical Neurology International | VOL. 6

542 Current Practices Regarding the Investigation of Sudden Unexpected Death in the Young (SUDY) in Canada
L.J Dewar ... S Sanatani
Canadian Journal of Cardiology | VOL. 28
L.J Dewar, et. al.L.J Dewar ... S Sanatani
01 Sep 2012
Canadian Journal of Cardiology | VOL. 28

Autopsy and Malpractice Considerations
Jerri L Mclemore ... A Julian Garvin
Anesthesiology | VOL. 115
Jerri L Mclemore, et. al.Jerri L Mclemore ... A Julian Garvin
01 Oct 2011
Anesthesiology | VOL. 115

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Automatic ICD-10 multi-class classification of cause of death from plaintext autopsy reports through expert-driven feature selection.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS ONE