LORE++: Logical location regression network for table structure recognition with pre-training

Rujiao Long,Hangdi Xing,Zhibo Yang,Qi Zheng,Zhi Yu,Fei Huang,Cong Yao

doi:10.1016/j.patcog.2024.110816

Abstract

Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats. Current approaches address this issue by either predicting the adjacency of detected cells or direct generation of structural sequences. Nonetheless, these approaches either count on additional heuristic rules for post-processing, or involve the generation of extremely long-range sequences that lead to computational intricacy. In this paper, We redefine TSR as a LOgical location REgression paradigm, which effectively captures inherent logical dependencies and constraints among table cells. Correspondingly, we propose LORE, a novel approach for TSR. LORE simultaneously predicts accurate geometric coordinates of table cells and the logical structures of the entire table. Our proposed LORE is conceptually simpler, easier to train, and more accurate than other TSR paradigms. Moreover, to enhance the model’s spatial and logical representation capabilities, we propose two pre-training tasks, resulting in an upgraded version named LORE++. The incorporation of pre-training is proven to enjoy significant advantages, leading to a substantial enhancement in terms of accuracy, generalization, and few-shot capability compared to its predecessor. Experiments on standard benchmarks demonstrate the superiority of LORE++, which highlights the potential and promising prospect of the logical location regression paradigm for TSR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LORE++: Logical location regression network for table structure recognition with pre-training

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Jul 23, 2024
Citations: 2

Similar Papers

LORE: Logical Location Regression Network for Table Structure Recognition
Hangdi Xing ... Qi Zheng
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Hangdi Xing, et. al.Hangdi Xing ... Qi Zheng
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing
Xiao-Hui Li ... Cheng-Lin Liu
Pattern Recognition | VOL. 132
Xiao-Hui Li, et. al.Xiao-Hui Li ... Cheng-Lin Liu
26 Jul 2022
Pattern Recognition | VOL. 132

DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images
Sebastian Schreiber ... Sheraz Ahmed
-
Sebastian Schreiber, et. al.Sebastian Schreiber ... Sheraz Ahmed
01 Nov 2017
01 Nov 2017

Table Structure Recognition Based on Cell Relationship, a Bottom-Up Approach
Darshan Adiga ... Shabir Bhat
-
Darshan Adiga, et. al.Darshan Adiga ... Shabir Bhat
22 Oct 2019
22 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LORE++: Logical location regression network for table structure recognition with pre-training

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition