DeepTLF: robust deep neural networks for heterogeneous tabular data

Vadim Borisov,Klaus Broelemann,Enkelejda Kasneci,Gjergji Kasneci

doi:10.1007/s41060-022-00350-z

Vadim Borisov, Klaus Broelemann + Show 2 more

Open Access

https://doi.org/10.1007/s41060-022-00350-z

Copy DOI

Abstract

Although deep neural networks (DNNs) constitute the state of the art in many tasks based on visual, audio, or text data, their performance on heterogeneous, tabular data is typically inferior to that of decision tree ensembles. To bridge the gap between the difficulty of DNNs to handle tabular data and leverage the flexibility of deep learning under input heterogeneity, we propose DeepTLF, a framework for deep tabular learning. The core idea of our method is to transform the heterogeneous input data into homogeneous data to boost the performance of DNNs considerably. For the transformation step, we develop a novel knowledge distillations approach, TreeDrivenEncoder, which exploits the structure of decision trees trained on the available heterogeneous data to map the original input vectors onto homogeneous vectors that a DNN can use to improve the predictive performance. Within the proposed framework, we also address the issue of the multimodal learning, since it is challenging to apply decision tree ensemble methods when other data modalities are present. Through extensive and challenging experiments on various real-world datasets, we demonstrate that the DeepTLF pipeline leads to higher predictive performance. On average, our framework shows 19.6% performance improvement in comparison to DNNs. The DeepTLF code is publicly available.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Data Science and Analytics	Publication Date: Aug 23, 2022
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

DeepTLF: robust deep neural networks for heterogeneous tabular data

Abstract

Talk to us

Similar Papers

More From: International Journal of Data Science and Analytics

Lead the way for us

Similar Papers

Deep Neural Networks and Tabular Data: A Survey.
Vadim Borisov ... Tobias Leemann
IEEE transactions on neural networks and learning systems | VOL. 35
Vadim Borisov, et. al.Vadim Borisov ... Tobias Leemann
01 Jun 2024
IEEE transactions on neural networks and learning systems | VOL. 35

Deep transformation models for functional outcome prediction after acute ischemic stroke.
Lisa Herzog ... Lucas Kook
Biometrical journal. Biometrische Zeitschrift | VOL. 65
Lisa Herzog, et. al.Lisa Herzog ... Lucas Kook
09 Dec 2022
Biometrical journal. Biometrische Zeitschrift | VOL. 65

Artificial intelligence in interdisciplinary life science and drug discovery research.
Jürgen Bajorath
Future science OA | VOL. 8
Jürgen BajorathJürgen Bajorath
08 Mar 2022
Future science OA | VOL. 8

SAINTENS: Self-Attention and Intersample Attention Transformer for Digital Biomarker Development Using Tabular Healthcare Real World Data
Julian Gutheil ... Klaus Donsa
-
Julian Gutheil, et. al.Julian Gutheil ... Klaus Donsa
16 May 2022
16 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeepTLF: robust deep neural networks for heterogeneous tabular data

Abstract

Talk to us

Similar Papers

More From: International Journal of Data Science and Analytics