Fractional Lower-order Statistics for Yangzhou Dialectal Speech Recognition

Huimin Lu,Shiyuan Yang,Seiichi Serikawa,Yujie Li,Xuelong Hu

doi:10.12792/iciae2015.068

Abstract

With the appearances of information time based on digital techniques and methods, people often concert on many kinds of machines in order to receive, transact and transfer information. As computers are wildly used, it is becoming true that the natural communication between people and machines without using keyboard or mouse, which is the goal by people for a long time. Multimedia era requests speech recognition system to put into practical from laboratory. Isolated word speech recognition system will bring advantages for people in daily life. However, because of the ambient noise, such as Gaussian noise and non-Gaussian noise, the product capability of isolated word speech recognition system is hard to gain a good demand. Even the isolated word recognition systems are quite mature, there are lots of problems existed and many fields need to be improved. This paper focuses on the problems of isolated word speech recognition systems as follows: 1) The problem of Pre-treatment in noisy environment. Generally, researchers consider the Gaussian Noise, but usually in our life the non-Gaussian noise are not neglected. Then we can do a good endpoint. Studies showed that a speech system utilizing an isolated word recognizer, more than 50% of error rate was credited to the endpoint detector. 2) The problem of Yangzhou dialectal. To do speech recognition of Yangzhou language by way of phonetic introduction and to establish common-used model is practical for information-exchange between dialects and speech recognition.

Highlights

The world has a variety of natural languages, and most of natural languages contain many dialects
As the developed of the information communication technologies (ICTs), many speech recognition software and synthesis systems were designed in the last decade
In this paper, we will concern about this need and propose a novel isolated words recognition system with non-Gaussian noise

Summary

Introduction

The world has a variety of natural languages, and most of natural languages contain many dialects. As the developed of the information communication technologies (ICTs), many speech recognition software and synthesis systems were designed in the last decade. Most of these are designed for large population speakers. The less scale population of language or dialect speech recognition systems and the speech recognition models are rarely. If we take the speech input method to identify Yangzhou dialect and to establish the speech models, can automatic exchange information between dialects. It will make the speech recognition technology for practical usage.

Endpoint Detection

Short-time Average Energy and Zero-Crossing Rate

Linear Prediction Coding

Dynamic Time Wrapping

Summarize and Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fractional Lower-order Statistics for Yangzhou Dialectal Speech Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An Alternative sEMG based Isolated Word Subvocal Speech Recognition System based on Interpolation Functions
Meng Yang ... Ming Zhang
-
Meng Yang, et. al.Meng Yang ... Ming Zhang
01 Oct 2020
01 Oct 2020

Noise adaptation in a hidden Markov model speech recognition system
Dirk Van Compernolle
Computer Speech & Language | VOL. 3
Dirk Van CompernolleDirk Van Compernolle
01 Apr 1989
Computer Speech & Language | VOL. 3

Audio Visual Technique for Enhancing the Isolated Word Speech Recognition System
...
International Journal of Advanced Research in Computer Science | VOL. 8
, et. al. ...
30 Apr 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Robust speech parameter based on a psychoacoustical rule-base model
Syunji Nishimura ... Yoshifumi Chisaki
The Journal of the Acoustical Society of America | VOL. 100
Syunji Nishimura, et. al.Syunji Nishimura ... Yoshifumi Chisaki
01 Oct 1996
The Journal of the Acoustical Society of America | VOL. 100

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fractional Lower-order Statistics for Yangzhou Dialectal Speech Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers