HILDA: A Discourse Parser Using Support Vector Machine Classification

Hugo Hernault,David A Du Verle,Helmut Prendinger,Mitsuru Ishizuka

doi:10.5087/dad.2010.003

Abstract

Discourse structures have a central role in several computational tasks, such as question-answering or dialogue generation. In particular, the framework of the Rhetorical Structure Theory (RST) offers a sound formalism for hierarchical text organization. In this article, we present HILDA, an implemented discourse parser based on RST and Support Vector Machine (SVM) classification. SVM classifiers are trained and applied to discourse segmentation and relation labeling. By combining labeling with a greedy bottom-up tree building approach, we are able to create accurate discourse trees in linear time complexity. Importantly, our parser can parse entire texts, whereas the publicly available parser SPADE (Soricut and Marcu 2003) is limited to sentence level analysis. HILDA outperforms other discourse parsers for tree structure construction and discourse relation labeling. For the discourse parsing task, our system reaches 78.3% of the performance level of human annotators. Compared to a state-of-the-art rule-based discourse parser, our system achieves a performance increase of 11.6%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Dialogue & Discourse	Publication Date: Dec 10, 2010
Citations: 87	License type: cc-by

R Discovery Prime

R Discovery Prime

HILDA: A Discourse Parser Using Support Vector Machine Classification

Abstract

Talk to us

Similar Papers

More From: Dialogue & Discourse

Lead the way for us

Similar Papers

A Symbolic Corpus-based Approach to Detect and Solve the Ambiguity of Discourse Markers
Iria Da Cunha
Research in Computing Science | VOL. 70
Iria Da CunhaIria Da Cunha
31 Dec 2014
Research in Computing Science | VOL. 70

A Dependency Perspective on RST Discourse Parsing and Evaluation
Mathieu Morey ... Philippe Muller
Computational Linguistics | VOL. 44
Mathieu Morey, et. al.Mathieu Morey ... Philippe Muller
01 Jun 2018
Computational Linguistics | VOL. 44

A CDT-Styled End-to-End Chinese Discourse Parser
Fang Kong ... Guodong Zhou
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 16
Fang Kong, et. al.Fang Kong ... Guodong Zhou
13 Jul 2017
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 16

Measuring the coherence of healthy and aphasic discourse production in Chinese using Rhetorical Structure Theory (RST)
Kong Anthony Pak Hin ... Linnik Anastasia
Frontiers in Psychology | VOL. 5
Kong Anthony Pak Hin, et. al.Kong Anthony Pak Hin ... Linnik Anastasia
01 Jan 2014
Frontiers in Psychology | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HILDA: A Discourse Parser Using Support Vector Machine Classification

Abstract

Talk to us

Similar Papers

More From: Dialogue &amp; Discourse

More From: Dialogue & Discourse