Human Speech Recognition Research Articles

This work reports the development of highly sensitive Piezoresistive flexible strain sensor for human motion detection and speech recognition. Initially, a conductive polymer composite (CPC) solution comprising of thermoplastic polyurethane and Carbon Nanoparticles was prepared using Dimethylformamide as solvent and Chloroform as dispersant with the composition of 50% v/v. The solution was heated to a temperature of 60°C for evaporation of the solvent until it contained 13.5% w/v solvent for steady electrospinning. In this way, the CPC solution was used to develop Electrospun Nanofibrous Yarns (ENFYs) by applying a potential difference of 40 KV between the electrospinning needle and Aluminum collector. A cotton fabric was wrapped on the Aluminum collector to allow twisting of the deposited electrospun nanofibers. This novel collector configuration resulted in the formation of nanofibrous yarns due to the whirling action of the advancing jet of CPC solution. The cotton fabric on the collector facilitated twisting of fibers by allowing them to roll over the fabric. The fabricated ENFY sensors showed remarkable stretchability up to 102% strain while achieving a gauge factor of 70 at 100% strain. Long-term usage necessitates repeatability, which was demonstrated by cyclic loading at a crosshead speed of 40 mm/min for up to 1000 cycles using a custom-developed linear actuator, with no signs of fracture. ENFY strain sensor was attached to different parts of human body such as finger, fist, elbow, knee and ankle and was found capable of measuring and observing angle, position and frequency of motion. Owing to its ultrasensitive behavior, the developed sensor was able to measure heart rate as well. When the developed sensor was attached to Adam’s apple for speech recognition it showed remarkable response towards different utterances and breathing and gulping actions with clearly distinguishable signals. These results demonstrate that the developed novel ENFY flexible strain sensor can be employed for proprioceptive sensing and speech recognition for human-machine interaction, soft robotics and wearable devices etc.

Read full abstract

Interlingual Live Subtitling (ILS) is an innovative translation and accessibility method where a written text in one language is produced live from an oral source in another language. ILS can be provided through different methods, some of which involve the participation of one or more humans, whereas others are fully automatic. Speech-to-text interpreting (STTI) is a form of human-mediated ILS that is situated at the crossroads of audiovisual translation, media accessibility and simultaneous interpreting, as well as between human-mediated translation and automatic language processing systems. One of the most promising forms of STTI is interlingual respeaking. It builds upon intralingual respeaking (the most common form of speech-to-text captioning, which does not include language transfer) and involves the participation of a human interpreter plus speech recognition software. Although interlingual respeaking is in great demand, there are other approaches to STTI -with different degrees of human intervention- which are currently being used by broadcasters and conference organizers. The purpose of this research is to test the efficiency of five of those methods, namely, (1) interlingual respeaking, (2) simultaneous interpreting plus intralingual respeaking, (3) simultaneous interpreting plus automatic speech recognition, (4) intralingual respeaking plus machine translation and (5) automatic speech recognition plus machine translation. The results provide a useful insight into the current efficiency of five different ILS methods and strengthen the idea that efficiency is not restricted to accuracy, but includes factors such as delay and the type of resources (either human or machine) required. It is hoped that this research may help provide the industry with tools to make informed choices between different forms of ILS (at least for the language combination English-Spanish) while offering employment opportunities for simultaneous interpreters and respeakers in the digital era.

Read full abstract

Human Speech Recognition Research Articles

Related Topics

Articles published on Human Speech Recognition

Modelling Human Word Learning and Recognition Using Visually Grounded Speech

Improving College Assistance with the Help of Richer Human Computer Interaction and Speech Recognition

Electrospun nanofibrous yarn based piezoresistive flexible strain sensor for human motion detection and speech recognition

An empirical analysis on the efficiency of five interlingual live subtitling workflows

A Project Based Learning Approach for Improving Students' Computational Thinking Skills.

A model of speech recognition for hearing-impaired listeners based on deep learning

Review on human speech Recognition Techniques

Arabic speech recognition by end-to-end, modular systems and human

Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture

Psycho-acoustics inspired automatic speech recognition

Make Patient Consultation Warmer: A Clinical Application for Speech Emotion Recognition

Thoughts on the potential to compensate a hearing loss in noise

Thoughts on the potential to compensate a hearing loss in noise

Learning Waveform-Based Acoustic Models Using Deep Variational Convolutional Neural Networks

Evidence for automated scoring and shorter passages of CBM-R in early elementary school.

A NOVEL SPEECH RECOGNITION SYSTEM USING FUZZY NEURAL NETWORK

Speech Recognition Using Elman Artificial Neural Network and Linear Predictive Coding

The Recognition of Persian Phonemes Using PPNet.

EARSHOT: A Minimal Neural Network Model of Incremental Human Speech Recognition.

Optimisation of phonetic aware speech recognition through multi-objective evolutionary algorithms

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Human Speech Recognition Research Articles

Related Topics

Articles published on Human Speech Recognition

Modelling Human Word Learning and Recognition Using Visually Grounded Speech

Improving College Assistance with the Help of Richer Human Computer Interaction and Speech Recognition

Electrospun nanofibrous yarn based piezoresistive flexible strain sensor for human motion detection and speech recognition

An empirical analysis on the efficiency of five interlingual live subtitling workflows

A Project Based Learning Approach for Improving Students' Computational Thinking Skills.

A model of speech recognition for hearing-impaired listeners based on deep learning

Review on human speech Recognition Techniques

Arabic speech recognition by end-to-end, modular systems and human

Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture

Psycho-acoustics inspired automatic speech recognition

Make Patient Consultation Warmer: A Clinical Application for Speech Emotion Recognition

Thoughts on the potential to compensate a hearing loss in noise

Thoughts on the potential to compensate a hearing loss in noise

Learning Waveform-Based Acoustic Models Using Deep Variational Convolutional Neural Networks

Evidence for automated scoring and shorter passages of CBM-R in early elementary school.

A NOVEL SPEECH RECOGNITION SYSTEM USING FUZZY NEURAL NETWORK

Speech Recognition Using Elman Artificial Neural Network and Linear Predictive Coding

The Recognition of Persian Phonemes Using PPNet.

EARSHOT: A Minimal Neural Network Model of Incremental Human Speech Recognition.

Optimisation of phonetic aware speech recognition through multi-objective evolutionary algorithms