Enhancing Turkish Coreference Resolution: Insights from deep learning, dropped pronouns, and multilingual transfer learning

Tuğba Pamay Arslan,Gülşen Eryiğit

doi:10.1016/j.csl.2024.101681

Abstract

Coreference resolution (CR), which is the identification of in-text mentions that refer to the same entity, is a crucial step in natural language understanding. While CR in English has been studied for quite a long time, studies for pro-dropped and morphologically rich languages is an active research area which has yet to reach sufficient maturity. Turkish, a morphologically highly-rich language, poses interesting challenges for natural language processing tasks, including CR, due to its agglutinative nature and consequent pronoun-dropping phenomenon. This article explores the use of different neural CR architectures (i.e., mention-pair, mention-ranking, and end-to-end) on Turkish, a morphologically highly-rich language, by formulating multiple research questions around the impacts of dropped pronouns, data quality, and interlingual transfer. The preparations made to explore these research questions and the findings obtained as a result of our explorations revealed the first Turkish CR dataset that includes dropped pronoun annotations (of size 4K entities/22K mentions), new state-of-the-art results on Turkish CR, the first neural end-to-end Turkish CR results (70.4% F-score), the first multilingual end-to-end CR results including Turkish (yielding 1.0 percentage points improvement on Turkish) and the demonstration of the positive impact of dropped pronouns on CR of pro-dropped and morphologically rich languages, for the first time in the literature. Our research has brought Turkish end-to-end CR performances (72.0% F-score) to similar levels with other languages, surpassing the baseline scores by 32.1 percentage points.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Turkish Coreference Resolution: Insights from deep learning, dropped pronouns, and multilingual transfer learning

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Similar Papers

Structured Representations for Coreference Resolution

-

01 Jan 2017
01 Jan 2017

Bio-SCoRes: A Smorgasbord Architecture for Coreference Resolution in Biomedical Text.
Halil Kilicoglu ... Dina Demner-Fushman
PloS one | VOL. 11
Halil Kilicoglu, et. al.Halil Kilicoglu ... Dina Demner-Fushman
02 Mar 2016
PloS one | VOL. 11

Developments in The Field of Natural Language Processing

International Journal of Advanced Research in Computer Science | VOL. 8

30 Apr 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Coreference Resolution in Research Papers from Multiple Domains
Arthur Brack ... Daniel Uwe Müller
-
Arthur Brack, et. al.Arthur Brack ... Daniel Uwe Müller
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Turkish Coreference Resolution: Insights from deep learning, dropped pronouns, and multilingual transfer learning

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language