Mongolian prosodic phrase prediction using suffix segmentation

Rui Liu,Feilong Bao,Guanglai Gao,Weihua Wang

doi:10.1109/ialp.2016.7875979

Abstract

Accurate prosodic phrase prediction can improve the naturalness of speech synthesis. Predicting the prosodic phrase can be regarded as a sequence labeling problem and the Conditional Random Field (CRF) is typically used to solve it. Mongolian is an agglutinative language, in which massive words can be formed by concatenating these stems and suffixes. This character makes it difficult to build a Mongolian prosodic phrase predictions system, based on CRF, that has high performance. We introduce a new method that segments Mongolian word into stem and suffix as individual token. The proposed method integrates multiple features according to the characteristics of Mongolian word formation. We conduct the contrast experiment by selecting the following features: word, multi-level Part-of-Speech (POS), multi-level lexical for suffix and the existence for suffix. The experimental results show that our method has significantly enhanced the performance of the Mongolian prosodic phrase prediction system through comparing with the conventional method that treats Mongolian word as token directly. The word feature, level one lexical for suffix feature and existence for suffix feature are effective. The best result is measured by Fl-measure as 82.49%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mongolian prosodic phrase prediction using suffix segmentation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text
Ying Xiong ... Qingcai Chen
BMC Medical Informatics and Decision Making | VOL. 19
Ying Xiong, et. al.Ying Xiong ... Qingcai Chen
01 Apr 2019
BMC Medical Informatics and Decision Making | VOL. 19

Scaling Conditional Random Fields by One-Against-the-Other Decomposition
Hai Zhao ... Chunyu Kit
Journal of Computer Science and Technology | VOL. 23
Hai Zhao, et. al.Hai Zhao ... Chunyu Kit
01 Jul 2008
Journal of Computer Science and Technology | VOL. 23

Automatic Extraction of Terminology under CRF Model
Fu Chen
-
Fu ChenFu Chen
01 Jan 2012
01 Jan 2012

ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks
Atsunori Ogawa ... Takaaki Hori
-
Atsunori Ogawa, et. al.Atsunori Ogawa ... Takaaki Hori
01 Apr 2015
01 Apr 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mongolian prosodic phrase prediction using suffix segmentation

Abstract

Talk to us

Similar Papers