Using natural language processing to extract mammographic findings

Hongyuan Gao,Erin J Aiello Bowles,David Carrell,Diana S.M Buist

doi:10.1016/j.jbi.2015.01.010

Hongyuan Gao, Erin J Aiello Bowles + Show 2 more

Open Access

https://doi.org/10.1016/j.jbi.2015.01.010

Copy DOI

Abstract

ObjectiveStructured data on mammographic findings are difficult to obtain without manual review. We developed and evaluated a rule-based natural language processing (NLP) system to extract mammographic findings from free-text mammography reports. Materials and MethodsThe NLP system extracted four mammographic findings: mass, calcification, asymmetry, and architectural distortion, using a dictionary look-up method on 93,705 mammography reports from Group Health. Status annotations and anatomical location annotation were associated to each NLP detected finding through association rules. After excluding negated, uncertain, and historical findings, affirmative mentions of detected findings were summarized. Confidence flags were developed to denote reports with highly confident NLP results and reports with possible NLP errors. A random sample of 100 reports was manually abstracted to evaluate the accuracy of the system. ResultsThe NLP system correctly coded 96–99 out of our sample of 100 reports depending on findings. Measures of sensitivity, specificity and negative predictive values exceeded 0.92 for all findings. Positive predictive values were relatively low for some findings due to their low prevalence. DiscussionOur NLP system was implemented entirely in SAS Base, which makes it portable and easy to implement. It performed reasonably well with multiple applications, such as using confidence flags as a filter to improve the efficiency of manual review. Refinements of library and association rules, and testing on more diverse samples may further improve its performance. ConclusionOur NLP system successfully extracts clinically useful information from mammography reports. Moreover, SAS is a feasible platform for implementing NLP algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Informatics	Publication Date: Feb 3, 2015
Citations: 31	License type: elsevier-specific: oa user license

R Discovery Prime

R Discovery Prime

Using natural language processing to extract mammographic findings

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics

Lead the way for us

Similar Papers

Validation of natural language processing to extract breast cancer pathology procedures and results
Arika E Wieneke ... Diana S.M Buist
Journal of Pathology Informatics | VOL. 6
Arika E Wieneke, et. al.Arika E Wieneke ... Diana S.M Buist
01 Jan 2015
Journal of Pathology Informatics | VOL. 6

Abstract 10374: Ankle and Toe-Brachial Index for Peripheral Artery Disease Identification: Unlocking Clinical Data Through Novel Methods
Julia Friberg ... Carrie Franciscus
Circulation | VOL. 144
Julia Friberg, et. al.Julia Friberg ... Carrie Franciscus
16 Nov 2021
Circulation | VOL. 144

Facilitating cancer research using natural language processing of pathology reports.
Kristin Anderson ... Victor R Grann
Studies in health technology and informatics | VOL. 107
Kristin Anderson, et. al.Kristin Anderson ... Victor R Grann
25 Jun 2015
Studies in health technology and informatics | VOL. 107

Evaluation of Natural Language Processing (NLP) systems to annotate drug product labeling with MedDRA terminology
Thomas Ly ... Robert Ball
Journal of Biomedical Informatics | VOL. 83
Thomas Ly, et. al.Thomas Ly ... Robert Ball
01 Jun 2018
Journal of Biomedical Informatics | VOL. 83

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using natural language processing to extract mammographic findings

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics