The Thai Discourse Treebank: Annotating and Classifying Thai Discourse Connectives

Ponrawee Prasertsom,Attapol T Rutherford,Apiwat Jaroonpol

doi:10.1162/tacl_a_00650

Abstract

Abstract Discourse analysis is a highly applicable area of natural language processing. In English and other languages, resources for discourse-based tasks are widely available. Thai, however, has hitherto lacked such resources. We present the Thai Discourse Treebank, the first, large Thai corpus annotated in the style of the Penn Discourse Treebank. The resulting corpus has over 10,000 sentences and 18,000 instances of connectives in 33 different relations. We release the corpus alongside our list of 148 potentially polysemous discourse connectives with a total of 340 form-sense pairs and their classification criteria to facilitate future research. We also develop models for connective identification and classification tasks. Our best models achieve an F1 of 0.96 in the identification task and 0.46 on the sense classification task. Our results serve as benchmarks for future models for Thai discourse tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Thai Discourse Treebank: Annotating and Classifying Thai Discourse Connectives

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Journal: Transactions of the Association for Computational Linguistics	Publication Date: May 6, 2024
License type: CC BY 4.0

Similar Papers

Cognitive Models of the Influence of Color Scale on Data Visualization Tasks
Leonard A Breslow ... J Gregory Trafton
Human Factors: The Journal of the Human Factors and Ergonomics Society | VOL. 51
Leonard A Breslow, et. al.Leonard A Breslow ... J Gregory Trafton
01 Jun 2009
Human Factors: The Journal of the Human Factors and Ergonomics Society | VOL. 51

Exploring the effects of driving experience on hazard awareness and risk perception via real-time hazard identification, hazard classification, and rating tasks
Avinoam Borowsky ... Tal Oron-Gilad
Accident Analysis & Prevention | VOL. 59
Avinoam Borowsky, et. al.Avinoam Borowsky ... Tal Oron-Gilad
25 Jul 2013
Accident Analysis & Prevention | VOL. 59

Improving the performance of speaker and language identification tasks using unique characteristics of a class
B Bharathi ... T Nagarajan
International Journal of Speech Technology | VOL. 16
B Bharathi, et. al.B Bharathi ... T Nagarajan
28 Jun 2012
International Journal of Speech Technology | VOL. 16

Reaction-times and bioelectrical brain signals of drug-naive schizophrenic first-onset patients in identification and classification tasks.
S Krieger ... S Lis
Acta Psychiatrica Scandinavica | VOL. 104
S Krieger, et. al.S Krieger ... S Lis
01 Nov 2001
Acta Psychiatrica Scandinavica | VOL. 104

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Thai Discourse Treebank: Annotating and Classifying Thai Discourse Connectives

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics