Use Of Speech Recognition Research Articles

Background. In the modern world, information technologies evolve rapidly, constantly altering our approaches to learning, work, and daily life. One significant aspect of this evolution is the automation of various processes, including education. Students and teachers are faced with large volumes of information that need to be processed, stored, and used in the educational process. Taking lecture notes is still an important task that requires a lot of time and effort, thus automating this process is both relevant and necessary. Automated lecture note-taking based on video and audio materials greatly facilitates the lives of students and teachers by providing quick access to structured information. The use of speech recognition and artificial intelligence technologies to create notes from lecture materials opens up new opportunities for effective learning. These systems can significantly save time, improve the quality and accuracy of notes, and ensure their accessibility to all participants in the educational process. These systems can not only create notes but also structure them by highlighting key points and providing easy access to information. This allows students to focus on understanding and comprehending the material rather than writing it down, thereby improving the quality of learning and knowledge acquisition. Objective. The purpose of the paper is to simplify the note-taking process and improve its quality by developing a system for automated lecture note-taking based on video and audio materials, ensuring the efficient and rapid creation of structured notes from lecture materials. Methods. Analysis of Literature and Contemporary Studies: Studying scientific articles, monographs, and dissertations related to the topic of automated lecture note-taking, speech recognition, and artificial intelligence. System Analysis: Defining system requirements, analysing possible approaches and tools for implementation. Experimental Method: Developing, implementing, and testing the system. Comparative Analysis: Evaluating the effectiveness of different speech recognition tools and AI models for creating notes. Modelling and Prototyping: Creating a system prototype, testing it, and improving it based on the obtained results. Results. During the implementation of the automated lecture note-taking system, an effective Telegram bot was created, which uses "whisper-1" and "gpt-4" models to provide high-quality speech recognition and the generation of structured notes from video and audio materials. Conclusions. The developed system of automated note-taking of lectures based on video and audio materials significantly simplifies the preparation of materials for students and teachers. Integration with Telegram and implementation of the system through a Telegram bot ensure cross-platform, accessibility and ease of use and at the same time provide an opportunity to avoid creating additional web or mobile applications for a wide range of users. The use of OpenAI's "whisper-1" model demonstrates high accuracy of speech recognition, which allowed improving the quality of transcriptions compared to other tools such as Vosk or FasterWhisper.

Read full abstract

ImportanceSpeech recognition (SR) is increasingly used directly by clinicians for electronic health record (EHR) documentation. Its usability and effect on quality and efficiency versus other documentation methods remain unclear. ObjectiveTo study usability and quality of documentation with SR versus typing. DesignIn this controlled observational study, each subject participated in two of five simulated outpatient scenarios. Sessions were recorded with Morae® usability software. Two notes were documented into the EHR per encounter (one dictated, one typed) in randomized order. Participants were interviewed about each method’s perceived advantages and disadvantages. Demographics and documentation habits were collected via survey. Data collection occurred between January 8 and February 8, 2019, and data analysis was conducted from February through September of 2019. SettingBrigham and Women’s Hospital, Boston, Massachusetts, USA. ParticipantsTen physicians who had used SR for at least six months. Main outcomes and measuresDocumentation time, word count, vocabulary size, number of errors, number of corrections and quality (clarity, completeness, concision, information sufficiency and prioritization). ResultsDictated notes were longer than typed notes (320.6 vs. 180.8 words; p = 0.004) with more unique words (170.9 vs. 120.4; p = 0.01). Documentation time was similar between methods, with dictated notes taking slightly less time to complete than typed notes. Typed notes had more uncorrected errors per note than dictated notes (2.9 vs. 1.5), although most were minor misspellings. Dictated notes had a higher mean quality score (7.7 vs. 6.6; p = 0.04), were more complete and included more sufficient information. Conclusions and relevanceParticipants felt that SR saves them time, increases their efficiency and allows them to quickly document more relevant details. Quality analysis supports the perception that SR allows for more detailed notes, but whether dictation is objectively faster than typing remains unclear, and participants described some scenarios where typing is still preferred. Dictation can be effective for creating comprehensive documentation, especially when physicians like and feel comfortable using SR. Research is needed to further improve integration of SR with EHR systems and assess its impact on clinical practice, workflows, provider and patient experience, and costs.

Read full abstract

Use Of Speech Recognition Research Articles

Related Topics

Articles published on Use Of Speech Recognition

AUTOMATED LECTURE NOTE-TAKING SYSTEM

Assessing pronunciation using dictation tools

Virtual Notes Assistant Using ML

Patient Satisfaction With Speech Recognition in the Exam Room: Exploratory Survey.

Novel Speech Recognition Systems Applied to Forensics within Child Exploitation: Wav2vec2.0 vs. Whisper.

Fine Motor Therapy Using Avatar-based AI for Autistic Children (Virtual Reality)

Towards Speech Recognition and Training Utilization in the Nuclear Power Main Control Room

Automatic Communication Error Detection Using Speech Recognition and Linguistic Analysis for Proactive Control of Loss of Separation

Speech Recognition Implementation Using MFCC and DTW Algorithm for Home Automation

Effect of Speech Recognition on Problem Solving and Recall in Consumer Digital Health Tasks: Controlled Laboratory Experiment.

Physician use of speech recognition versus typing in clinical documentation: A controlled observational study

Improving the Translation Environment for Professional Translators

Speech recognition for clinical documentation from 1990 to 2018: a systematic review.

Acoustic domain classification and recognition through ensemble based multilevel classification

Speech Recognition for Medical Dictation: Overview in Quebec and Systematic Review.

PP109 Use Of Speech Recognition In Medical Reports: A Systematic Review

An unsupervised deep domain adaptation approach for robust speech recognition

Naming practice app on tablets and communication robots for people with aphasia

From Translation Studies and audiovisual translation to media accessibility

HyperCode: Voice aided programming

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Use Of Speech Recognition Research Articles

Related Topics

Articles published on Use Of Speech Recognition

AUTOMATED LECTURE NOTE-TAKING SYSTEM

Assessing pronunciation using dictation tools

Virtual Notes Assistant Using ML

Patient Satisfaction With Speech Recognition in the Exam Room: Exploratory Survey.

Novel Speech Recognition Systems Applied to Forensics within Child Exploitation: Wav2vec2.0 vs. Whisper.

Fine Motor Therapy Using Avatar-based AI for Autistic Children (Virtual Reality)

Towards Speech Recognition and Training Utilization in the Nuclear Power Main Control Room

Automatic Communication Error Detection Using Speech Recognition and Linguistic Analysis for Proactive Control of Loss of Separation

Speech Recognition Implementation Using MFCC and DTW Algorithm for Home Automation

Effect of Speech Recognition on Problem Solving and Recall in Consumer Digital Health Tasks: Controlled Laboratory Experiment.

Physician use of speech recognition versus typing in clinical documentation: A controlled observational study

Improving the Translation Environment for Professional Translators

Speech recognition for clinical documentation from 1990 to 2018: a systematic review.

Acoustic domain classification and recognition through ensemble based multilevel classification

Speech Recognition for Medical Dictation: Overview in Quebec and Systematic Review.

PP109 Use Of Speech Recognition In Medical Reports: A Systematic Review

An unsupervised deep domain adaptation approach for robust speech recognition

Naming practice app on tablets and communication robots for people with aphasia

From Translation Studies and audiovisual translation to media accessibility

HyperCode: Voice aided programming