Abstract

With the appearances of information time based on digital techniques and methods, people often concert on many kinds of machines in order to receive, transact and transfer information. As computers are wildly used, it is becoming true that the natural communication between people and machines without using keyboard or mouse, which is the goal by people for a long time. Multimedia era requests speech recognition system to put into practical from laboratory. Isolated word speech recognition system will bring advantages for people in daily life. However, because of the ambient noise, such as Gaussian noise and non-Gaussian noise, the product capability of isolated word speech recognition system is hard to gain a good demand. Even the isolated word recognition systems are quite mature, there are lots of problems existed and many fields need to be improved. This paper focuses on the problems of isolated word speech recognition systems as follows: 1) The problem of Pre-treatment in noisy environment. Generally, researchers consider the Gaussian Noise, but usually in our life the non-Gaussian noise are not neglected. Then we can do a good endpoint. Studies showed that a speech system utilizing an isolated word recognizer, more than 50% of error rate was credited to the endpoint detector. 2) The problem of Yangzhou dialectal. To do speech recognition of Yangzhou language by way of phonetic introduction and to establish common-used model is practical for information-exchange between dialects and speech recognition.

Highlights

  • The world has a variety of natural languages, and most of natural languages contain many dialects

  • As the developed of the information communication technologies (ICTs), many speech recognition software and synthesis systems were designed in the last decade

  • In this paper, we will concern about this need and propose a novel isolated words recognition system with non-Gaussian noise

Read more

Summary

Introduction

The world has a variety of natural languages, and most of natural languages contain many dialects. As the developed of the information communication technologies (ICTs), many speech recognition software and synthesis systems were designed in the last decade. Most of these are designed for large population speakers. The less scale population of language or dialect speech recognition systems and the speech recognition models are rarely. If we take the speech input method to identify Yangzhou dialect and to establish the speech models, can automatic exchange information between dialects. It will make the speech recognition technology for practical usage.

Endpoint Detection
Short-time Average Energy and Zero-Crossing Rate
Linear Prediction Coding
Dynamic Time Wrapping
Summarize and Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.