Abstract

This paper is aimed to implement the speech to text conversion system for Myanmar alphabet. The Myanmar alphabet consists of 33 characters from ?ka? to ?ah?. The proposed system is software architecture which allows the user to speak against the computer in Myanmar language and the corresponding character is printed on the screen in the Microsoft Office Word Document Format. The system is emphasized on Speaker Independent Isolated Word Recognition System. The proposed system directly acquires and converts speech to text. This system contains two main modules: feature extraction and feature matching. Mel Frequency Cepstrum Coefficients (MFCC) is applied for feature extraction which extracts a small amount of data from the voice signal that can later be used to represent each character. Feature matching involves the actual procedure to identify the unknown character by comparing extracted features from the voice inputs of a set of known characters. In this system, Vector Quantization (VQ) approach using Linde, Buzo and Gray (LBG) clustering algorithm, which reduces the amount of data and complexity, is applied for feature matching. To implement this system MATLAB programming language is used.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.