Learn More Manchu Words with A New Visual-Language Framework

Wang Wang,Qi Qi,Su Su,Lu Lu,Wei Wei

doi:10.1145/3652992

Abstract

Manchu language, a minority language of China, is of significant historical and research value. An increasing number of Manchu documents are digitized into image format for better preservation and study. Recently, many researchers focused on identifying Manchu words in digitized documents. In previous approaches, a variety of Manchu words are recognized based on visual cues. However, we notice that visual-based approaches have some obvious drawbacks. On one hand, it is difficult to distinguish between similar and distorted letters. On the other hand, portions of letters obscured by breakage and stains are hard to identify. To cope with these two challenges, we propose a visual-language framework, namely the Visual-Language framework for Manchu word Recognition (VLMR), which fuses visual and semantic information to accurately recognize Manchu words. Whenever visual information is not available, the language model can automatically associate the semantics of words. The performance of our method is further enhanced by introducing a self-knowledge distillation network. In addition, we created a new handwritten Manchu word dataset named (HMW), which contains 6,721 handwritten Manchu words. The novel approach is evaluated on WMW and HMW. The experiments show that our proposed method achieves state-of-the-art performance on both datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learn More Manchu Words with A New Visual-Language Framework

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Similar Papers

Visual Targeting of Forelimbs in Ladder-Walking Locusts
Jeremy E Niven ... Simon B Laughlin
Current Biology | VOL. 20
Jeremy E Niven, et. al.Jeremy E Niven ... Simon B Laughlin
31 Dec 2009
Current Biology | VOL. 20

Multimodal signalling: the relative importance of chemical and visual cues from females to the behaviour of male wolf spiders (Lycosidae)
Ann L Rypstra ... Matthew H Persons
Animal Behaviour | VOL. 77
Ann L Rypstra, et. al.Ann L Rypstra ... Matthew H Persons
26 Feb 2009
Animal Behaviour | VOL. 77

Young Children Do Not Integrate Visual and Haptic Form Information
Monica Gori ... David C Burr
Current Biology | VOL. 18
Monica Gori, et. al.Monica Gori ... David C Burr
01 May 2008
Current Biology | VOL. 18

Egocentric perception through interaction among many sensory systems
Masao Ohmi
Cognitive Brain Research | VOL. 5
Masao OhmiMasao Ohmi
01 Dec 1996
Cognitive Brain Research | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learn More Manchu Words with A New Visual-Language Framework

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing