Abstract

This paper is aimed to implement the speech to text conversion system for Myanmar alphabet. The Myanmar alphabet consists of 33 characters from ?ka? to ?ah?. The proposed system is software architecture which allows the user to speak against the computer in Myanmar language and the corresponding character is printed on the screen in the Microsoft Office Word Document Format. The system is emphasized on Speaker Independent Isolated Word Recognition System. The proposed system directly acquires and converts speech to text. This system contains two main modules: feature extraction and feature matching. Mel Frequency Cepstrum Coefficients (MFCC) is applied for feature extraction which extracts a small amount of data from the voice signal that can later be used to represent each character. Feature matching involves the actual procedure to identify the unknown character by comparing extracted features from the voice inputs of a set of known characters. In this system, Vector Quantization (VQ) approach using Linde, Buzo and Gray (LBG) clustering algorithm, which reduces the amount of data and complexity, is applied for feature matching. To implement this system MATLAB programming language is used.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call