Abstract

Detecting protected health information in electronic health record systems is often an early step in health care analytics, and it is a nontrivial problem. Specific challenges include finding clinician names and diseases, which lack a fixed format and are often context-dependent. The general problem of finding entities, termed named-entity recognition, has received a substantial amount of attention in the natural language processing and deep learning communities. This paper begins by outlining recent methods for finding protected health information, and it then introduces a hybrid system which combines regular expressions with a natural language processing framework called FLAIR. FLAIR is open-source, it includes state-of-the-art deep learning models, and it supports straightforward development of new models for language tasks including named-entity recognition. Finally, there is a discussion of how to apply the system to structured text in a database table as well as unstructured text in clinical notes.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.