Evaluating Deep Learning Techniques for Natural Language Inference

Petros Eleftheriadis,Isidoros Perikos,Ioannis Hatzilygeroudis

doi:10.3390/app13042577

Petros Eleftheriadis, Isidoros Perikos + Show 1 more

Open Access

https://doi.org/10.3390/app13042577

Copy DOI

Journal: Applied Sciences	Publication Date: Feb 16, 2023
Citations: 4	License type: CC BY 4.0

Affiliation: University of Patras

Abstract

Natural language inference (NLI) is one of the most important natural language understanding (NLU) tasks. NLI expresses the ability to infer information during spoken or written communication. The NLI task concerns the determination of the entailment relation of a pair of sentences, called the premise and hypothesis. If the premise entails the hypothesis, the pair is labeled as an “entailment”. If the hypothesis contradicts the premise, the pair is labeled a “contradiction”, and if there is not enough information to infer a relationship, the pair is labeled as “neutral”. In this paper, we present experimentation results of using modern deep learning (DL) models, such as the pre-trained transformer BERT, as well as additional models that relay on LSTM networks, for the NLI task. We compare five DL models (and variations of them) on eight widely used NLI datasets. We trained and fine-tuned the hyperparameters for each model to achieve the best performance for each dataset, where we achieved some state-of-the-art results. Next, we examined the inference ability of the models on the BreakingNLI dataset, which evaluates the model’s ability to recognize lexical inferences. Finally, we tested the generalization power of our models across all the NLI datasets. The results of the study are quite interesting. In the first part of our experimentation, the results indicate the performance advantage of the pre-trained transformers BERT, RoBERTa, and ALBERT over other deep learning models. This became more evident when they were tested on the BreakingNLI dataset. We also see a pattern of improved performance when the larger models are used. However, ALBERT, given that it has 18 times fewer parameters, achieved quite remarkable performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluating Deep Learning Techniques for Natural Language Inference

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning
Benjamin Minixhofer ... Milan Gritta
-
Benjamin Minixhofer, et. al.Benjamin Minixhofer ... Milan Gritta
01 Jan 2020
01 Jan 2020

Don’t Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference
Yonatan Belinkov ... Benjamin Van Durme
-
Yonatan Belinkov, et. al.Yonatan Belinkov ... Benjamin Van Durme
01 Jan 2019
01 Jan 2019

Simple Data Transformations for Mitigating the Syntactic Similarity to Improve Sentence Embeddings at Supervised Contrastive Learning
Minji Kim ... Soohyeong Kim
Advanced Intelligent Systems | VOL. -
Minji Kim, et. al.Minji Kim ... Soohyeong Kim
15 Jul 2024
Advanced Intelligent Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating Deep Learning Techniques for Natural Language Inference

Abstract

Talk to us

Similar Papers

More From: Applied Sciences