Turkish Speech Recognition Software with Adaptable Language Model

Osman Buyuk,Ali Haznedaroglu,Levent M Arslan

doi:10.1109/siu.2007.4298561

Abstract

Turkish speech recognition studies have been accelerated recently. With these efforts, not only available speech and text corpus which can be used in recognition experiments but also proposed new methods to improve accuracy has increased. Agglutinative nature of Turkish causes out of vocabulary (OOV) problem in Large Vocabulary Continuous Speech Recognition (LVCSR) tasks. In order to overcome OOV problem, usage of sub-word units has been proposed. In addition to LVCSR experiments, there have been some efforts to implement a speech recognizer in limited domains such as radiology. In this paper, we will present Turkish speech recognition software, which has been developed by utilizing recent studies. Both interface of software and recognition accuracies in two different test sets will be summarized. The performance of software has been evaluated using radiology and large vocabulary test sets. In order to solve OOV problem practically, we propose to adapt language models using frequent words or sentences. In recognition experiments, 90% and 44% word accuracies have been achieved in radiology and large vocabulary test sets respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Turkish Speech Recognition Software with Adaptable Language Model

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR
Tara N Sainath ... David Nahamoo
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19
Tara N Sainath, et. al.Tara N Sainath ... David Nahamoo
01 Nov 2011
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19

Using ASR Methods for OCR
Ashish Arora ... Vimal Manohar
-
Ashish Arora, et. al.Ashish Arora ... Vimal Manohar
01 Sep 2019
01 Sep 2019

Issues in developing LVCSR System for Dravidian Languages: An Exhaustive Case Study for Tamil
Bharadwaja Kumarg ... Melvin Jose Johnson Premkumar
International Journal of Computer Applications | VOL. 70
Bharadwaja Kumarg, et. al.Bharadwaja Kumarg ... Melvin Jose Johnson Premkumar
31 May 2013
International Journal of Computer Applications | VOL. 70

Recognition of speech from live sports coverage using acoustic and language model adaptation
Yasuo Ariki ... Masakiyo Fujimoto
Systems and Computers in Japan | VOL. 36
Yasuo Ariki, et. al.Yasuo Ariki ... Masakiyo Fujimoto
01 Jan 2004
Systems and Computers in Japan | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Turkish Speech Recognition Software with Adaptable Language Model

Abstract

Talk to us

Similar Papers