정렬기법을 이용한 미등록 대역어의 자동 추출

Jae-Hoon Kim,Sung-Il Yang

doi:10.3745/kipstb.2007.14-b.3.231

Abstract

이 논문은 정렬 기법을 이용한 미등록 대역어 추출 모델을 제안하고 그 추출 시스템을 구현한다. 제안된 미등록 대역어 추출 모델은 일종의 구절정렬 모델로서 경계모델과 언어모델 그리고 번역 모델로 구성된다. 제안된 추출 시스템은 병렬말뭉치 구축, 단어정렬, 미등록어 추출로 구성된다. 이 논문에서는 제안된 시스템을 평가하기 위해서 약 1,500여 개의 미등록어가 포함된 2,200문장의 평가말뭉치를 구축하여 다양한 실험을 수행하였다. 실험을 통해서 제안된 모델이 미등록 대역어 추출에 매우 유용함을 알 수 있었다. 앞으로 좀 더 객관적인 평가를 위해 대량의 평가말뭉치 구축이 선행되어야 하며 좀 더 양질의 병렬말뭉치의 구축이 필요할 것이다. 또한 미등록어 추출 모델을 개선하기 다양한 연구가 추진되어야 할 것이다. In this paper, we propose an automatic extraction model for unknown translations and implement an unknown translation extraction system using the proposed model. The proposed model as a phrase-alignment model is incorporated with three models: a phrase-boundary model, a language model, and a translation model. Using the proposed model we implement the system for extracting unknown translations, which consists of three parts: construction of parallel corpora, alignment of Korean and English words, extraction of unknown translations. To evaluate the performance of the proposed system we have established the reference corpus for extracting unknown translation, which comprises of 2,220 parallel sentences including about 1,500 unknown translations. Through several experiments, we have observed that the proposed model is very useful for extracting unknown translations. In the future, researches on objective evaluation and establishment of parallel corpora with good quality should be performed and studies on improving the performance of unknown translation extraction should be kept up.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

정렬기법을 이용한 미등록 대역어의 자동 추출

Abstract

Talk to us

Similar Papers

More From: The KIPS Transactions:PartB

Lead the way for us

Similar Papers

적응형 채도 향상 알고리즘을 이용한 컬러 영상 처리 기법
Kyoung-Ok Yang ... Jong-Ho Yun
The KIPS Transactions:PartB | VOL. 14B
Kyoung-Ok Yang, et. al.Kyoung-Ok Yang ... Jong-Ho Yun
30 Jun 2007
The KIPS Transactions:PartB | VOL. 14B

Dynamic Fusion: Attentional Language Model for Neural Machine Translation
Michiki Kurosawa ... Mamoru Komachi
-
Michiki Kurosawa, et. al.Michiki Kurosawa ... Mamoru Komachi
01 Jan 2020
01 Jan 2020

Indonesian to Bengkulu Malay Statistical Machine Translation System
Bella Okta Sari Miranda ... Herman Yuliansyah
International Journal of Advances in Data and Information Systems | VOL. 5
Bella Okta Sari Miranda, et. al.Bella Okta Sari Miranda ... Herman Yuliansyah
29 Sep 2024
International Journal of Advances in Data and Information Systems | VOL. 5

A projection extension algorithm for statistical machine translation
Christoph Tillmann
-
Christoph TillmannChristoph Tillmann
01 Jan 2003
01 Jan 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

정렬기법을 이용한 미등록 대역어의 자동 추출

Abstract

Talk to us

Similar Papers

More From: The KIPS Transactions:PartB