Abstract

This paper addresses the usefulness of speech pauses for determining whether third person neuter gender singular pronouns refer to individual or abstract entities in Danish spoken language. The annotations of dyadic map task dialogues and spontaneous first encounters are analyzed and used in machine learning experiments act to automatically identify the anaphoric functions of pronouns and the type of abstract reference. The analysis of the data shows that abstract reference is more often performed by marked (stressed or demonstrative pronouns) than by unmarked personal pronouns in Danish speech as in English, and therefore previous studies of abstract reference in the former language are corrected. The data also show that silent and filled pauses precede significantly more often third person singular neuter gender pronouns when they refer to abstract entities than when they refer to individual entities. Since abstract entities are not the most salient ones and referring to them is cognitively more hard than referring to individual entities, pauses signal this complex processes. This is in line with perception studies, which connect pauses with the expression of abstract or complex concepts. We also found that unmarked pronouns referring to an entity type usually referred to by a marked pronoun are significantly more often preceded by a speech pause than marked pronouns with the same referent type. This indicates that speech pauses can also signal that the referent of a pronoun of a certain type is not the most expected one. Finally, language models were produced from the annotated map task and first encounter dialogues in order to train machine learning experiments to predict the function of third person neuter gender singular pronouns as a first step toward the identification of the anaphoric antecedents. The language models from the map task dialogues were also used for training classifiers to determine the referent type (speech act, event, fact or proposition) of abstract anaphors. In all cases, the best results were obtained by a multilayer perceptron with an F1-score between 0.52 and 0.67 for the three-class function prediction task and of 0.73 for the referential type prediction.

Highlights

  • This paper addresses the role of speech pauses for determining what third person neuter gender singular pronouns, such as the English it, this and that refer to

  • The analysis of the DanPASS dialogues shows that both silent and filled pauses precede more frequently abstract anaphors than individual anaphors. This indicates that speech pauses can signal that the speaker is going to utter a difficult concept, since according to all theories of reference tpngs pronouns with abstract antecedents are less salient/accessible to the addressee than pronouns with individual antecedents, and they are more difficult to express for the speaker

  • The best measure we have in order to compare our classification results with the current state of art is given by the results reported by Marasovic et al (2017) for the resolution of English tpngs pronouns in the WSJ part of the ARRAU corpus

Read more

Summary

INTRODUCTION

This paper addresses the role of speech pauses for determining what third person neuter gender singular pronouns, such as the English it, this and that refer to. We analyze the relation between stress, speech pauses and tpngs pronouns in Danish annotated dialogues and use this relation in language models on which we train classifiers to discriminate individual and abstract anaphors from other uses of these pronouns, and to predict the referent type of abstract anaphors. The occurrences of abstract and individual tpngs pronouns in the two dialogue types and the role of stress and speech pauses for discriminating their uses are analyzed.

INDIVIDUAL AND ABSTRACT PRONOMINAL REFERENCE
RELEVANT STUDIES ON SPEECH PAUSES
The Annotated Corpora
The Annotations
Data Analysis
IDENTIFYING PRONOMINAL ANAPHORIC FUNCTIONS AND ABSTRACT REFERENT TYPES
Findings
DISCUSSION
CONCLUSIONS AND FUTURE STUDIES
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call