Abstract

The selection of the speech recognition modeling unit is the primary problem of acoustic modeling in speech recognition, and different acoustic modeling units will directly affect the overall performance of speech recognition. This paper designs the Tibetan character segmentation and labeling model and algorithm flow for the purpose of solving the problem of selecting the acoustic modeling unit in Tibetan speech recognition by studying and analyzing the deficiencies of the existing acoustic modeling units in Tibetan speech recognition. After experimental verification, the Tibetan character segmentation and labeling model and algorithm achieved good performance of character segmentation and labeling, and the accuracy of Tibetan character segmentation and labeling reached 99.98%, respectively.

Highlights

  • Automatic speech recognition technology is a key technology for human-computer interaction

  • The Tibetan character segmentation and labeling model in this article is mainly composed of preprocessing, segmentation and labeling modules

  • This paper proposes the algorithm flow of Tibetan character segmentation and labeling by designing the Tibetan character segmentation and labeling model

Read more

Summary

Introduction

Automatic speech recognition technology is a key technology for human-computer interaction. In Tibetan speech recognition system, researchers have considered modeling units with different granularity, including words and syllables [4], vowels [5,6,7,8] and phonemes [9,10,11], respectively. If words or syllables are used as modeling units, the requirements of the corpus are too high and can lead to data sparsity. In order to solve the above problems, the paper proposes a method of using Tibetan character as the modeling unit, and presents the flow of its segmentation and labeling algorithm. Tibetan character is defined as all single characters and Tibetan stacked combination symbols including the base character, head letter, subjoined letter, and vowel. The Tibetan syllable བ ིགས in བ, ི, ག and ས are each one character, and the syllable is composed of four characters

Tibetan character segmentation and labeling model
Tibetan character segmentation and labeling algorithm flow
Experiments
Experiment and result analysis
Summary

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.