Table representation learning using heterogeneous graph embedding

Willy Carlos Tchuitcheu,Tan Lu,Ann Dooms

doi:10.1016/j.patcog.2024.110734

Abstract

Tables, especially when having complex layouts, contain rich semantic information. However, effectively learning from tables to uncover such semantic information remains challenging. The rapid progress in natural language processing does not necessarily correspond to equivalent advancements in table parsing, which often requires joint visual and language modeling. Indeed, humans can quickly derive semantic meaning from table entries by associating them with corresponding column and/or row headers. Motivated by this observation, we propose a new heterogeneous Graph-based Table Representation Learning (GTRL) framework. GTRL combines graph-based visual modeling with sequence-based language modeling to learn granular per-cell embeddings that are sensitive to the semantic meaning of cells within their corresponding table context. We systematically evaluate the proposed GTRL framework using two datasets: a new adhesive table benchmark comprising complex tables extracted from industrial documents for learning per-entry semantics, and a publicly available large-scale dataset that enables learning header semantics from column tables. Experimental results demonstrate the competitive performance of the proposed GTRL, which often exhibits reduced computational complexity compared to state-of-the-art table representation learning models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Table representation learning using heterogeneous graph embedding

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Jul 1, 2024
License type: cc-by-nc-nd

Similar Papers

Are AI language models such as ChatGPT ready to improve the care of individuals with epilepsy?
Christian M Boßelmann ... Dennis Lal
Epilepsia | VOL. 64
Christian M Boßelmann, et. al.Christian M Boßelmann ... Dennis Lal
13 Mar 2023
Epilepsia | VOL. 64

Towards an Enhanced Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias
Anoop K ... Lajish V L
-
Anoop K, et. al. Anoop K ... Lajish V L
01 Jan 2021
01 Jan 2021

Geoscience language models and their intrinsic evaluation
Christopher J.M Lawley ... Geneviève Marquis
Applied Computing and Geosciences | VOL. 14
Christopher J.M Lawley, et. al.Christopher J.M Lawley ... Geneviève Marquis
04 May 2022
Applied Computing and Geosciences | VOL. 14

FETILDA: Evaluation Framework for Effective Representations of Long Financial Documents
Bolun (Namir) Xia ... Aparna Gupta
ACM Transactions on Knowledge Discovery from Data | VOL. 18
Bolun (Namir) Xia, et. al.Bolun (Namir) Xia ... Aparna Gupta
19 Jun 2024
ACM Transactions on Knowledge Discovery from Data | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Table representation learning using heterogeneous graph embedding

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition