An OCR Post-Correction Approach Using Deep Learning for Processing Medical Reports

Srinidhi Karthikeyan,Alba G Seco De Herrera,Faiyaz Doctor,Asim Mirza

doi:10.1109/tcsvt.2021.3087641

Abstract

According to a recent Deloitte study, the COVID-19 pandemic continues to place a huge strain on the global health care sector. Covid-19 has also catalysed digital transformation across the sector for improving operational efficiencies. As a result, the amount of digitally stored patient data such as discharge letters, scan images, test results or free text entries by doctors has grown significantly. In 2020, 2314 exabytes of medical data was generated globally. This medical data does not conform to a generic structure and is mostly in the form of unstructured digitally generated or scanned paper documents stored as part of a patient’s medical reports. This unstructured data is digitised using Optical Character Recognition (OCR) process. A key challenge here is that the accuracy of the OCR process varies due to the inability of current OCR engines to correctly transcribe scanned or handwritten documents in which text may be skewed, obscured or illegible. This is compounded by the fact that processed text is comprised of specific medical terminologies that do not necessarily form part of general language lexicons. The proposed work uses a deep neural network based self-supervised pre-training technique: Robustly Optimized Bidirectional Encoder Representations from Transformers (RoBERTa) that can learn to predict hidden (masked) sections of text to fill in the gaps of non-transcribable parts of the documents being processed. Evaluating the proposed method on domain-specific datasets which include real medical documents, shows a significantly reduced word error rate demonstrating the effectiveness of the approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An OCR Post-Correction Approach Using Deep Learning for Processing Medical Reports

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: May 1, 2022
Citations: 16

Similar Papers

Probabilistic management of OCR data using an RDBMS
Arun Kumar ... Christopher Ré
Proceedings of the VLDB Endowment | VOL. 5
Arun Kumar, et. al.Arun Kumar ... Christopher Ré
01 Dec 2011
Proceedings of the VLDB Endowment | VOL. 5

Facsimile‐to‐speech systems
Allen E Milewski
The Journal of the Acoustical Society of America | VOL. 94
Allen E MilewskiAllen E Milewski
01 Sep 1993
The Journal of the Acoustical Society of America | VOL. 94

An Overview of Neural OCR Networks
Herbert F Schantz
Journal of Information Systems Management | VOL. 8
Herbert F SchantzHerbert F Schantz
01 Jan 1991
Journal of Information Systems Management | VOL. 8

A survey of text detection and recognition algorithms based on deep learning technology
Xiao-Feng Wang ... Zhi-Ze Wu
Neurocomputing | VOL. 556
Xiao-Feng Wang, et. al.Xiao-Feng Wang ... Zhi-Ze Wu
18 Aug 2023
Neurocomputing | VOL. 556

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An OCR Post-Correction Approach Using Deep Learning for Processing Medical Reports

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology