Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010

Berry De Bruijn,Xiaodan Zhu,Colin Cherry,Svetlana Kiritchenko,Joel Martin

doi:10.1136/amiajnl-2011-000150

Berry De Bruijn, Xiaodan Zhu + Show 3 more

Open Access

https://doi.org/10.1136/amiajnl-2011-000150

Copy DOI

Abstract

ObjectiveAs clinical text mining continues to mature, its potential as an enabling technology for innovations in patient care and clinical research is becoming a reality. A critical part of that process is rigid benchmark testing of natural language processing methods on realistic clinical narrative. In this paper, the authors describe the design and performance of three state-of-the-art text-mining applications from the National Research Council of Canada on evaluations within the 2010 i2b2 challenge.DesignThe three systems perform three key steps in clinical information extraction: (1) extraction of medical problems, tests, and treatments, from discharge summaries and progress notes; (2) classification of assertions made on the medical problems; (3) classification of relations between medical concepts. Machine learning systems performed these tasks using large-dimensional bags of features, as derived from both the text itself and from external sources: UMLS, cTAKES, and Medline.MeasurementsPerformance was measured per subtask, using micro-averaged F-scores, as calculated by comparing system annotations with ground-truth annotations on a test set.ResultsThe systems ranked high among all submitted systems in the competition, with the following F-scores: concept extraction 0.8523 (ranked first); assertion detection 0.9362 (ranked first); relationship detection 0.7313 (ranked second).ConclusionFor all tasks, we found that the introduction of a wide range of features was crucial to success. Importantly, our choice of machine learning algorithms allowed us to be versatile in our feature design, and to introduce a large number of features without overfitting and without encountering computing-resource bottlenecks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of the American Medical Informatics Association	Publication Date: Sep 1, 2011
Citations: 227	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010

Abstract

Talk to us

Similar Papers

More From: Journal of the American Medical Informatics Association

Lead the way for us

Similar Papers

Application of Machine Learning Techniques in Clinical Information Extraction
Ruchi Patel ... Sanjay Tanwani
-
Ruchi Patel, et. al.Ruchi Patel ... Sanjay Tanwani
01 Jan 2019
01 Jan 2019

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text
Özlem Uzuner ... Scott L Duvall
Journal of the American Medical Informatics Association | VOL. 18
Özlem Uzuner, et. al.Özlem Uzuner ... Scott L Duvall
16 Jun 2011
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text
Özlem Uzuner ... Scott L Duvall

COVID-19 classification by CCSHNet with deep fusion using transfer learning and discriminant correlation analysis
Shui-Hua Wang ... Yu-Dong Zhang
Information Fusion | VOL. 68
Shui-Hua Wang, et. al.Shui-Hua Wang ... Yu-Dong Zhang
13 Nov 2020
Information Fusion | VOL. 68

Enhancing clinical concept extraction with distributional semantics
Siddhartha Jonnalagadda ... Graciela Gonzalez
Journal of Biomedical Informatics | VOL. 45
Siddhartha Jonnalagadda, et. al.Siddhartha Jonnalagadda ... Graciela Gonzalez
07 Nov 2011
Journal of Biomedical Informatics | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010

Abstract

Talk to us

Similar Papers

More From: Journal of the American Medical Informatics Association