Unsupervised Neural Dialect Translation with Commonality and Diversity Modeling

Yu Wan,Derek F Wong,Haihua Du,Ben C.H Ao,Baosong Yang,Lidia S Chao

doi:10.1609/aaai.v34i05.6448

Abstract

As a special machine translation task, dialect translation has two main characteristics: 1) lack of parallel training corpus; and 2) possessing similar grammar between two sides of the translation. In this paper, we investigate how to exploit the commonality and diversity between dialects thus to build unsupervised translation models merely accessing to monolingual data. Specifically, we leverage pivot-private embedding, layer coordination, as well as parameter sharing to sufficiently model commonality and diversity among source and target, ranging from lexical, through syntactic, to semantic levels. In order to examine the effectiveness of the proposed models, we collect 20 million monolingual corpus for each of Mandarin and Cantonese, which are official language and the most widely used dialect in China. Experimental results reveal that our methods outperform rule-based simplified and traditional Chinese conversion and conventional unsupervised translation models over 12 BLEU scores.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised Neural Dialect Translation with Commonality and Diversity Modeling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 16

Similar Papers

Polygon-Net: A General Framework for Jointly Boosting Multiple Unsupervised Neural Machine Translation Models
Chang Xu ... Tie-Yan Liu
-
Chang Xu, et. al.Chang Xu ... Tie-Yan Liu
01 Aug 2019
01 Aug 2019

Unsupervised English Intelligent Machine Translation in Wireless Network Environment
Bing Zhang ... Muhammad Arif
Security and Communication Networks | VOL. 2022
Bing Zhang, et. al.Bing Zhang ... Muhammad Arif
21 May 2022
Security and Communication Networks | VOL. 2022

A Retrieve-and-Rewrite Initialization Method for Unsupervised Machine Translation
Shuo Ren ... Shujie Liu
-
Shuo Ren, et. al.Shuo Ren ... Shujie Liu
01 Jan 2020
01 Jan 2020

Unsupervised Image Super-Resolution with an Indirect Supervised Path
Shuaijun Chen ... Zhen Han
-
Shuaijun Chen, et. al.Shuaijun Chen ... Zhen Han
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised Neural Dialect Translation with Commonality and Diversity Modeling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence