Anisotropic span embeddings and the negative impact of higher-order inference for coreference resolution: An empirical analysis

Feng Hou,Fangyi Zhu,See-Kiong Ng,Lily Chen,Michael Witbrock,Xiaoyun Jia,Steven F Cahan,Ruili Wang

doi:10.1017/s1351324924000019

Abstract

Abstract Coreference resolution is the task of identifying and clustering mentions that refer to the same entity in a document. Based on state-of-the-art deep learning approaches, end-to-end coreference resolution considers all spans as candidate mentions and tackles mention detection and coreference resolution simultaneously. Recently, researchers have attempted to incorporate document-level context using higher-order inference (HOI) to improve end-to-end coreference resolution. However, HOI methods have been shown to have marginal or even negative impact on coreference resolution. In this paper, we reveal the reasons for the negative impact of HOI coreference resolution. Contextualized representations (e.g., those produced by BERT) for building span embeddings have been shown to be highly anisotropic. We show that HOI actually increases and thus worsens the anisotropy of span embeddings and makes it difficult to distinguish between related but distinct entities (e.g., pilots and flight attendants). Instead of using HOI, we propose two methods, Less-Anisotropic Internal Representations (LAIR) and Data Augmentation with Document Synthesis and Mention Swap (DSMS), to learn less-anisotropic span embeddings for coreference resolution. LAIR uses a linear aggregation of the first layer and the topmost layer of contextualized embeddings. DSMS generates more diversified examples of related but distinct entities by synthesizing documents and by mention swapping. Our experiments show that less-anisotropic span embeddings improve the performance significantly (+2.8 F1 gain on the OntoNotes benchmark) reaching new state-of-the-art performance on the GAP dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Anisotropic span embeddings and the negative impact of higher-order inference for coreference resolution: An empirical analysis

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering

Lead the way for us

Journal: Natural Language Engineering	Publication Date: Jan 25, 2024
License type: CC BY 4.0

Similar Papers

Revealing the Myth of Higher-Order Inference in Coreference Resolution
Liyan Xu ... Jinho D Choi
-
Liyan Xu, et. al.Liyan Xu ... Jinho D Choi
01 Jan 2020
01 Jan 2020

Sequential Cross-Document Coreference Resolution
...
-
, et. al. ...
15 Oct 2021
15 Oct 2021

Co-reference Resolution in Prompt Engineering
Mridusmita Das ... Apurbalal Senapati
Procedia Computer Science | VOL. 244
Mridusmita Das, et. al.Mridusmita Das ... Apurbalal Senapati
01 Jan 2024
Procedia Computer Science | VOL. 244

BERT for Coreference Resolution: Baselines and Analysis
Mandar Joshi ... Daniel Weld
-
Mandar Joshi, et. al.Mandar Joshi ... Daniel Weld
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Anisotropic span embeddings and the negative impact of higher-order inference for coreference resolution: An empirical analysis

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering