Information extraction from Visually Rich Documents using graph convolutional network

Khanh Nguyen-Trong,Thinh Trinh

doi:10.3233/jifs-230204

Abstract

Visually rich documents, such as forms, invoices, receipts, and ID cards, are ubiquitous in daily business and life. Various methods have been used to convey such diverse information, including text, layout, font size, or text position. Combining these elements in information extraction can improve the result performance. However, previous works have not effectively utilized the cooperation between these rich information sources. Text detection and recognition have been performed without semantic supervision (e.g., entity name annotation), and text information extraction has been performed using only serialized plain text, ignoring rich visual information. This paper presents a method for extracting information from such documents, which integrates textual, non-spatial, and spatial visual features. The method consists of two main steps and uses three deep neural networks. The first step, Text Reading, employs two CNN models (Lightweight DB and C-PREN) for OCR tasks, based on the state-of-the-art models DB and PREN, with two improvements. These improvements include reducing noise by removing the SE block of DB and integrating both context and position features in PREN. The second step, Text Information Extraction, uses a graph convolutional network, RGCN, for name entity recognition. Experiments on self-collected and two public datasets have demonstrated that our method improves the performance of the original models and outperforms other state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Information extraction from Visually Rich Documents using graph convolutional network

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent & Fuzzy Systems

Lead the way for us

Similar Papers

Information Extraction from Invoices by using a Graph Convolutional Neural Network: A Case Study of Vietnamese Stores
An Cong Tran ... Hai Thanh Nguyen
IEIE Transactions on Smart Processing & Computing | VOL. 11
An Cong Tran, et. al.An Cong Tran ... Hai Thanh Nguyen
31 Oct 2022
IEIE Transactions on Smart Processing & Computing | VOL. 11

TRIE: End-to-End Text Reading and Information Extraction for Document Understanding
Peng Zhang ... Liang Qiao
-
Peng Zhang, et. al.Peng Zhang ... Liang Qiao
12 Oct 2020
12 Oct 2020

Multi-species Protein Association Prediction Using Residual Graph Convolutional Network
Rangan Das ... Bikram Boote
-
Rangan Das, et. al.Rangan Das ... Bikram Boote
30 Oct 2020
30 Oct 2020

Decoding Visual fMRI Stimuli from Human Brain Based on Graph Convolutional Neural Network
Lu Meng ... Kang Ge
Brain Sciences | VOL. 12
Lu Meng, et. al.Lu Meng ... Kang Ge
15 Oct 2022
Brain Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Information extraction from Visually Rich Documents using graph convolutional network

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent &amp; Fuzzy Systems

More From: Journal of Intelligent & Fuzzy Systems