Quantitative Aspects of PDTB-Style Discourse Relations across Languages

Kun Sun,Lili Zhang

doi:10.1080/09296174.2017.1390934

Abstract

Frequency distribution of words, syntax and semantics in many languages abides by certain laws. However, because of the shortage of discourse corpora, few studies have examined whether the frequency of discourse relations follows some distributional patterns. Although there is some research based on the Rhetorical Structure Theory discourse treebank (RST-DT), each of these studies is limited to a single language. Otherwise to the RST-DT, the Penn Discourse Treebank (PDTB), adopting another annotation system, has had an enormous influence on the study of discourse structure and discourse annotation. Discourse corpora in other languages, such as Chinese, Hindi, Turkish, Czech and Arabic have been annotated following PDTB style. With the data from these discourse treebanks, we find that the rank-frequency of discourse relations follow the same pattern and that these languages share significant similarities in using semantic relations to organize the discourse. It is evidenced in our research that humans assume the relationship between two consecutive sentences is a causal connection or expansion link for fewer connectives used, but the relation of contrast is the most marked by connectives. This research will be of significance for understanding the homogeneity of discourse structure across languages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Quantitative Aspects of PDTB-Style Discourse Relations across Languages

Abstract

Talk to us

Similar Papers

More From: Journal of Quantitative Linguistics

Lead the way for us

Journal: Journal of Quantitative Linguistics	Publication Date: Jan 5, 2018
Citations: 3

Similar Papers

How compatible are our discourse annotation frameworks? Insights from mapping RST-DT and PDTB annotations
Vera Demberg ... Merel C.J Scholman
Dialogue & Discourse | VOL. 10
Vera Demberg, et. al.Vera Demberg ... Merel C.J Scholman
14 Jun 2019
Dialogue & Discourse | VOL. 10

Semi-supervised Discourse Relation Classification with Structural Learning
Hugo Hernault ... Danushka Bollegala
-
Hugo Hernault, et. al.Hugo Hernault ... Danushka Bollegala
01 Jan 2010
01 Jan 2010

A CDT-Styled End-to-End Chinese Discourse Parser
Fang Kong ... Guodong Zhou
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 16
Fang Kong, et. al.Fang Kong ... Guodong Zhou
13 Jul 2017
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 16

A computational model for measuring discourse complexity
Kun Sun ... Wenxin Xiong
Discourse Studies | VOL. 21
Kun Sun, et. al.Kun Sun ... Wenxin Xiong
02 Aug 2019
Discourse Studies | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quantitative Aspects of PDTB-Style Discourse Relations across Languages

Abstract

Talk to us

Similar Papers

More From: Journal of Quantitative Linguistics