Abstract

Language Identification (LID) is a method by which the language is identified from an utterance communicated by an unidentified person. Present work develops an effective baseline method with the Gaussian Mixture Model (GMM) and Mel Frequency Cepstral Coefficients (MFCC) for language identification and the performance of the LID system is evaluated for an unknown speaker. The LID framework is designed utilizing a user-defined database in 4 different languages Tamil, Malayalam, Hindi, and English. This work is based on some optimization approaches such as Minimum Mean Square Error (MMSE) and Spectral Subtraction (SS) to improve the LID performance with background noise. Moreover, the LID system performance is also investigated by changing the size of data used to train the system, duration of test data as well as the amount of noise.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.