Improving information extraction from visually rich documents using visual span representations

Ritesh Sarkhel,Arnab Nandi

doi:10.14778/3446095.3446104

Abstract

Along with textual content, visual features play an essential role in the semantics of visually rich documents. Information extraction (IE) tasks perform poorly on these documents if these visual cues are not taken into account. In this paper, we present Artemis - a visually aware, machine-learning-based IE method for heterogeneous visually rich documents. Artemis represents a visual span in a document by jointly encoding its visual and textual context for IE tasks. Our main contribution is two-fold. First, we develop a deep-learning model that identifies the local context boundary of a visual span with minimal human-labeling. Second, we describe a deep neural network that encodes the multimodal context of a visual span into a fixed-length vector by taking its textual and layout-specific features into account. It identifies the visual span(s) containing a named entity by leveraging this learned representation followed by an inference task. We evaluate Artemis on four heterogeneous datasets from different domains over a suite of information extraction tasks. Results show that it outperforms state-of-the-art text-based methods by up to 17 points in F1-score.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving information extraction from visually rich documents using visual span representations

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Jan 1, 2021
Citations: 5

Similar Papers

Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.
Mohammed Alawad ... J Blair Christian
Journal of the American Medical Informatics Association | VOL. 27
Mohammed Alawad, et. al.Mohammed Alawad ... J Blair Christian
09 Nov 2019
Journal of the American Medical Informatics Association | VOL. 27

A Review of Open Information Extraction Techniques
Sally Ali ... M Hussien
IJCI. International Journal of Computers and Information | VOL. 6
Sally Ali, et. al.Sally Ali ... M Hussien
01 Jan 2019
IJCI. International Journal of Computers and Information | VOL. 6

Plumber: A Modular Framework to Create Information Extraction Pipelines
Mohamad Yaser Jaradeh ... Sören Auer
-
Mohamad Yaser Jaradeh, et. al.Mohamad Yaser Jaradeh ... Sören Auer
19 Apr 2021
19 Apr 2021

Incorporating information extraction in the relational database model
Yoav Nahshon ... Stijn Vansummeren
-
Yoav Nahshon, et. al.Yoav Nahshon ... Stijn Vansummeren
26 Jun 2016
26 Jun 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving information extraction from visually rich documents using visual span representations

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment