Data Collection and Development of Bengali ASR and TTS for Conversational AI-based Automated Advisories in the Agriculture domain

Soma Khan,Rajib Roy,Madhab Pal,Joyanta Basu,Tulika Basu,Milton S Bepari

doi:10.1109/aist55798.2022.10065005

Abstract

This paper presents an indigenous work of text and speech data collection and organization in the agriculture domain and developing a structured Agriculture Knowledge Repository (AKR), Automatic Speech Recognition (ASR), and Text-To-Speech synthesis (TTS) systems in Bengali language, applicable to agriculture-related automated advisory systems. Authors have searched available data sources, interacted with real farmers and experts at local farming fields and agriculture institutes, collected feedback on actual advisory requirements, and designed an initial text corpus which is then used to prepare 10,000 numbers of unique query-answer pairs with 5000 agri-keywords in the final AKR. The DNN-HMM based ASR is trained by merging general domain speech data with 40.5 hours of agriculture data newly collected using a mobile app, web-based and Interactive Voice Response (IVR) based data collection setups. TTS is developed in one male and one female voice with 20 hours of studio speech data using DNN-based architecture. Both the ASR and TTS are tested with end users in real environments and are having encouraging results. Corpus collection and system development methodology is language invariant and developed sub-systems can be readily used for web or mobile chatbot-based and IVR-based automated advisory applications in the agriculture domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Collection and Development of Bengali ASR and TTS for Conversational AI-based Automated Advisories in the Agriculture domain

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework
Johanes Effendi ... Satoshi Nakamura
-
Johanes Effendi, et. al.Johanes Effendi ... Satoshi Nakamura
25 Oct 2020
25 Oct 2020

Speech-Based Access of Agricultural Commodity Prices and Weather Information in the Kannada Language and Its Dialects
Thimmaraja Yadava Gopalappa ... Nagaraja Benageri Gnaneshwara
-
Thimmaraja Yadava Gopalappa, et. al.Thimmaraja Yadava Gopalappa ... Nagaraja Benageri Gnaneshwara
22 Feb 2023
22 Feb 2023

End-to-end Feedback Loss in Speech Chain Framework via Straight-through Estimator
Andros Tjandra ... Sakriani Sakti
-
Andros Tjandra, et. al.Andros Tjandra ... Sakriani Sakti
01 May 2019
01 May 2019

Marathi Interactive Voice Response System (IVRS) using MFCC and DTW
Bharti W ... S.C Mehrotra
International Journal of Computer Applications | VOL. 125
Bharti W, et. al.Bharti W ... S.C Mehrotra
17 Sep 2015
International Journal of Computer Applications | VOL. 125

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Collection and Development of Bengali ASR and TTS for Conversational AI-based Automated Advisories in the Agriculture domain

Abstract

Talk to us

Similar Papers