Decomposing Generalization: Models of Generic, Habitual, and Episodic Statements

Venkata Govindarajan,Benjamin Van Durme,Aaron Steven White

doi:10.1162/tacl_a_00285

Abstract

We present a novel semantic framework for modeling linguistic expressions of generalization— generic, habitual, and episodic statements—as combinations of simple, real-valued referential properties of predicates and their arguments. We use this framework to construct a dataset covering the entirety of the Universal Dependencies English Web Treebank. We use this dataset to probe the efficacy of type-level and token-level information—including hand-engineered features and static (GloVe) and contextual (ELMo) word embeddings—for predicting expressions of generalization.

Highlights

Natural language allows us to convey information about particular individuals and events, as in (1), and generalizations about those individuals and events, as in (2).(1) a
Taking inspiration from decompositional semantics (Reisinger et al, 2015; White et al, 2016), we suggest that linguistic expressions of generalization should be captured in a continuous multilabel system, rather than a multi-class system
The ACE-2005 Multilingual Training Corpus (Walker et al, 2006) extends these annotation guidelines, providing two additional classes: (i) negatively quantified entries (NEG) for referring to empty sets and (ii) underspecified entries (USP), where the referent is ambiguous between GENERIC and SPECIFIC

Summary

Introduction

Natural language allows us to convey information about particular individuals and events, as in (1), and generalizations about those individuals and events, as in (2). One obstacle to further progress on generalization is that current frameworks tend to take standard descriptive categories as sharp classes— e.g. EPISODIC, GENERIC, HABITUAL for statements and KIND, INDIVIDUAL for noun phrases This may seem reasonable for sentences like (1a), where Mary clearly refers to a particular individual, or (3a), where Bishops clearly refers to a kind; but natural text is less forgiving (Grimm, 2014, 2016, 2018). Taking inspiration from decompositional semantics (Reisinger et al, 2015; White et al, 2016), we suggest that linguistic expressions of generalization should be captured in a continuous multilabel system, rather than a multi-class system We do this by decomposing categories such as EPISODIC, HABITUAL, and GENERIC into simple referential properties of predicates and their arguments. We find that (i) referential properties of arguments are easier to predict than those of predicates; and that (ii) contextual learned representations contain most of the relevant information for both arguments and predicates (§9)

Background

Annotation Framework

Framework Validation

Comparison to Standard Ontology

Bulk Annotation

Exploratory Analysis

Models

Results

10 Analysis

11 Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Nov 1, 2019
Citations: 38	License type: cc-by

R Discovery Prime

R Discovery Prime

Decomposing Generalization: Models of Generic, Habitual, and Episodic Statements

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

RadioBERT: A deep learning-based system for medical report generation from chest X-ray images using contextual embeddings
Navdeep Kaur ... Ajay Mittal
Journal of Biomedical Informatics | VOL. 135
Navdeep Kaur, et. al.Navdeep Kaur ... Ajay Mittal
10 Oct 2022
Journal of Biomedical Informatics | VOL. 135

Personalized Query Expansion with Contextual Word Embeddings
Elias Bassani ... Nicola Tonellotto
ACM Transactions on Information Systems | VOL. 42
Elias Bassani, et. al.Elias Bassani ... Nicola Tonellotto
11 Dec 2023
ACM Transactions on Information Systems | VOL. 42

Assessing the Impact of Contextual Embeddings for Portuguese Named Entity Recognition
Joaquim Santos ... Renata Vieira
-
Joaquim Santos, et. al.Joaquim Santos ... Renata Vieira
01 Oct 2019
01 Oct 2019

Contextual Word Embeddings and Topic Modeling in Healthy Dieting and Obesity.
Vijaya Kumari Yeruva ... Sidrah Junaid
Journal of Healthcare Informatics Research | VOL. 3
Vijaya Kumari Yeruva, et. al.Vijaya Kumari Yeruva ... Sidrah Junaid
01 Jun 2019
Journal of Healthcare Informatics Research | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decomposing Generalization: Models of Generic, Habitual, and Episodic Statements

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics