Influence Paths for Characterizing Subject-Verb Number Agreement in LSTM Language Models

Kaiji Lu,Matt Fredrikson,Anupam Datta,Klas Leino,Piotr Mardziel

doi:10.18653/v1/2020.acl-main.430

Abstract

LSTM-based recurrent neural networks are the state-of-the-art for many natural language processing (NLP) tasks. Despite their performance, it is unclear whether, or how, LSTMs learn structural features of natural languages such as subject-verb number agreement in English. Lacking this understanding, the generality of LSTM performance on this task and their suitability for related tasks remains uncertain. Further, errors cannot be properly attributed to a lack of structural capability, training data omissions, or other exceptional faults. We introduce *influence paths*, a causal account of structural properties as carried by paths across gates and neurons of a recurrent neural network. The approach refines the notion of influence (the subject's grammatical number has influence on the grammatical number of the subsequent verb) into a set of gate or neuron-level paths. The set localizes and segments the concept (e.g., subject-verb agreement), its constituent elements (e.g., the subject), and related or interfering elements (e.g., attractors). We exemplify the methodology on a widely-studied multi-layer LSTM language model, demonstrating its accounting for subject-verb number agreement. The results offer both a finer and a more complete view of an LSTM's handling of this structural aspect of the English language than prior results based on diagnostic classifiers and ablation.

Highlights

Traditional rule-based natural language processing (NLP) techniques can capture syntactic structures, while statistical NLP techniques, such as n-gram models, can heuristically integrate semantics of a natural language
In both we investigate high-attribution neurons along primary paths allowing us to compare our results to prior work
We study the exact combination of language model and number agreement (NA) datasets used in the closely related prior work of Lakretz et al (2019)

Summary

Introduction

Traditional rule-based NLP techniques can capture syntactic structures, while statistical NLP techniques, such as n-gram models, can heuristically integrate semantics of a natural language. Modern RNN-based models such as Long Short-Term Memory (LSTM) models are tasked with incorporating both semantic features from the statistical associations in their training corpus, and structural features generalized from the same. Number Agreement in Language Models The number agreement (NA) task, as described by Linzen et al (2016), is an evaluation of a language model’s ability to properly match the verb’s grammatical number with its subject. This evaluation is performed on sentences designed for the exercise, with zero or more words between the subject and the main verb, termed the context. The task for sentences with non-empty contexts will be referred to as long-term number agreement

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Influence Paths for Characterizing Subject-Verb Number Agreement in LSTM Language Models

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 16	License type: cc-by

Similar Papers

Subject�Verb Agreement in Turkish Agrammatic Broca�s Aphasia: Comparison of Person and Number Features
Aydin �Zg�R ... Kurada Hazel
Frontiers in Psychology | VOL. 7
Aydin �Zg�R, et. al.Aydin �Zg�R ... Kurada Hazel
01 Jan 2015
Frontiers in Psychology | VOL. 7

Conceptual effects on agreement: A corpus study of mismatch in English copular constructions
Cecily Jill Duffield
LSA Annual Meeting Extended Abstracts | VOL. 3
Cecily Jill DuffieldCecily Jill Duffield
08 Apr 2012
LSA Annual Meeting Extended Abstracts | VOL. 3

Finite Verb Morphology in the Spontaneous Speech of Dutch-Speaking Children With Hearing Loss.
Annemiek Hammer ... Martine Coene
Ear and hearing | VOL. 37
Annemiek Hammer, et. al.Annemiek Hammer ... Martine Coene
01 Jan 2015
Ear and hearing | VOL. 37

Emotional Attractors in Subject-Verb Number Agreement.
Anna Hatzidaki ... Mikel Santesteban
Frontiers in psychology | VOL. 13
Anna Hatzidaki, et. al.Anna Hatzidaki ... Mikel Santesteban
29 Jun 2022
Frontiers in psychology | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Influence Paths for Characterizing Subject-Verb Number Agreement in LSTM Language Models

Abstract

Highlights

Summary

Talk to us

Similar Papers