Table Transformers for imputing textual attributes

Ting-Ruen Wei,Yuan Wang,Yoshitaka Inoue,Hsin-Tai Wu,Yi Fang

doi:10.1016/j.patrec.2024.09.023

Abstract

Missing data in tabular dataset is a common issue as the performance of downstream tasks usually depends on the completeness of the training dataset. Previous missing data imputation methods focus on numeric and categorical columns, but we propose a novel end-to-end approach called Table Transformers for Imputing Textual Attributes (TTITA) based on the transformer to impute unstructured textual columns using other columns in the table. We conduct extensive experiments on three datasets, and our approach shows competitive performance outperforming baseline models such as recurrent neural networks and Llama2. The performance improvement is more significant when the target sequence has a longer length. Additionally, we incorporate multi-task learning to simultaneously impute for heterogeneous columns, boosting the performance for text imputation. We also qualitatively compare with ChatGPT for realistic applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Table Transformers for imputing textual attributes

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Similar Papers

Gradient-Based Learning of Compositional Dynamics with Modular RNNs
Sebastian Otte ... Patricia Rubisch
-
Sebastian Otte, et. al.Sebastian Otte ... Patricia Rubisch
01 Jan 2019
01 Jan 2019

Perturbation of deep autoencoder weights for model compression and classification of tabular data
Sakib Abrar ... Manar D Samad
Neural networks : the official journal of the International Neural Network Society | VOL. 156
Sakib Abrar, et. al.Sakib Abrar ... Manar D Samad
27 Sep 2022
Neural networks : the official journal of the International Neural Network Society | VOL. 156

Recurrent convolutional neural network for answer selection in community question answering
Xiaoqiang Zhou ... Baotian Hu
Neurocomputing | VOL. 274
Xiaoqiang Zhou, et. al.Xiaoqiang Zhou ... Baotian Hu
11 Apr 2017
Neurocomputing | VOL. 274

Gated Recurrent Neural Tensor Network
Andros Tjandra ... Sakriani Sakti
-
Andros Tjandra, et. al.Andros Tjandra ... Sakriani Sakti
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Table Transformers for imputing textual attributes

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters