Speech recognition model for solution of website element management tasks

Oksana Strokan,Yulia Lytvyn

doi:10.32919/uesit.2018.02.06

Abstract

The article deals with the method of speech recognition, namely, the possibility of using this technology for the language control of website elements. Due to the widespread introduction of such technologies into human lives, the task is to create a voice application that would improve the usability. The feature of the proposed model is the implementation of speech recognition not in the service, as it happens in most cases, but in the device itself, using only a microphone. In the existing design users can easily add any commands. Language recognition is implemented on the website page using the JavaScript programming language. The script work is based on the use of the SpeechRecognition Web Speech APIs and the SpeechGrammarList API for Web Speech. The current direction of the use of speech in the process of interaction between the technical system and the user is the application of such technology for creating comfortable living conditions for people who have a violation of the musculoskeletal system and who have lost the opportunity to use traditional means and methods of dialogue with the system. It analyzed the basic principles of the website and the ability to control it using voice control. To operate the proposed speech recognition model, two interfaces are used to solve the problem of managing elements of a website: SpeechRecognition Web Speech API and SpeechGrammarList API Web Speech. In order to manage the elements of the website, a model is proposed, the implementation of which is possible through the use of a microphone on the user's desktop only. The feature of the proposed model is that it is easy to add any commands to an already existing structure. Such application provides a great perspective for building new web interfaces in combination with artificial intelligence.

Highlights

Яскравим прикладом засобів, які реалізують голосовий пошук, є продукти від компанії Google та голосовий асистент Siri від компанії Apple, які підтверджують необхідність розроблення і впровадження нових мовленнєвих технологій у життя сучасної людини
Створити новий клас recognition можна так: var recognition = new SpeechRecognition(); Інтерфейс SpeechGrammarList API Web Speech має структуру списку об’єктів SpeechGrammar, які містять слова та фрази, які необхідно розпізнати
Розпізнавання мовлення почнеться за допомогою функції: recognition.start(); Для отримання даних після розпізнання мови необхідно творити подію наступним чином: recognition.onresult = function(event) { //дії з отриманими даними

Summary

РЕЗУЛЬТАТИ ДОСЛІДЖЕННЯ

Використання голосового інтерфейсу у повсякденному житті людини набуває широкого розповсюдження. Для розпізнання голосового мовлення у веб-браузері існує багато сервісів, серед яких: Google, Mozilla Web Speech API, Siri та багато інших. В основі актуальної версії голосового пошуку Google лежить покращений алгоритм для навчання нейронних мереж, створений спеціально для аналізу і розпізнавання акустичних моделей (Klass, 1962). Тобто звичайний алгоритм покращений за допомогою нейронних мереж, він дає більш точне розпізнання голосового мовлення, але при цьому втрачає швидкість розпізнання. З’являється повідомлення, яке запитує у користувача дозвіл на використання його пристроїв для роботи на цьому вебсайті. У методі, який пропонується, розпізнання голосового мовлення реалізовано на сторінці веб-сайту за допомогою мови програмування Javascript. Який при натисканні на іконку з мікрофоном на робочому столі користувача починає записувати мовну команду. Це дозволить підвищити точність та швидкість розпізнавання саме для конкретного набору слів та дає можливість використання алгоритму на будь-якому веб-сайті. Було вирішено використовувати відкриту технологію розпізнавання мовлення Mozilla Web Speech API (рис. 1)

Локальний мовний сервіс

ВИСНОВКИ ТА ПЕРСПЕКТИВИ ПОДАЛЬШИХ ДОСЛІДЖЕНЬ

СПИСОК ВИКОРИСТАНИХ ДЖЕРЕЛ

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech recognition model for solution of website element management tasks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Ukrainian Journal of Educational Studies and Information Technology

Lead the way for us

Journal: Ukrainian Journal of Educational Studies and Information Technology	Publication Date: Jun 30, 2018
License type: cc-by

Similar Papers

Study of Artificial Intelligence Flight Co-Pilot Speech Recognition Technology
Lin Wei ... Yelu Liu
-
Lin Wei, et. al.Lin Wei ... Yelu Liu
14 Oct 2020
14 Oct 2020

Adversarial Attack and Defense for Commercial Black-box Chinese-English Speech Recognition Systems
Xuejing Yuan ... Xinqi Ling
ACM Transactions on Privacy and Security | VOL. -
Xuejing Yuan, et. al.Xuejing Yuan ... Xinqi Ling
07 Nov 2024
ACM Transactions on Privacy and Security | VOL. -

OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

Research on dialect speech recognition based on DenseNet-CTC
-
Academic Journal of Computing & Information Science | VOL. 6
--
01 Jan 2023
Academic Journal of Computing & Information Science | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech recognition model for solution of website element management tasks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Ukrainian Journal of Educational Studies and Information Technology