Voice Activity Detection Method Research Articles

It is generally considered that remote and rural telephone users generate less traffic as compared to urban area users. This lowers the attraction of investment in rural areas by the telecommunications companies and service providers. The financial implications of wiring a vast area for low telephone traffic causes most telephone service providers to ignore those regions. Still, it is known that telecommunications are essential to the economic development of a region and that traffic increases rapidly as soon as the service is available. A satellite–based telephone network can provide efficient long distance telephone service to remote rural communities at a lower cost than land-based wired networks in most cases. Mobile satellite systems already provide this service, but are limited in capacity and charge high per-minute fees for the satellite link. Small earth stations and GEO satellites can provide this service more efficiently and at lower cost. On top of that, bandwidth efficient multiplexing with compressed speech, Voice Activity Detection (VAD) and Packet discarding methods can even further reduce the cost of service for the users in rural areas. In this paper, Statistical Time Division Multiplexer (STDM) architecture was simulated. Two packet discarding methods, random packet discarding and cyclic packet discarding are used to maximize bandwidth utilization along with VAD. Results indicate that considering monologue speech source, with 80%, activity for each 6.4 kbps sources, on a channel of 64 kbps, 12 users can be allowed to be multiplexed instead of 9, therefore a Digital Speech Interpolation (DSI) advantage of 1.33 is achieved with 3% packet loss. Furthermore, it is observed that cyclic packet discarding technique perform better than random packet discarding in terms of subjective quality

In this paper an efficient implementation of speech to text converter for mobile application is presented. The prime motive of this work is to formulate a system which would give optimum performance in terms of complexity, accuracy, delay and memory requirements for mobile environment. The speech to text converter consists of two stages namely front-end analysis and pattern recognition. The proposed method uses effective methods for voice activity detection in preprocessing, feature extraction and recognizer. The energy of high frequency part is separately considered as zero crossing rate to differentiate noise from speech. RASTAPLP feature extraction method is used in which RASTA filter suppresses the spectral components that change more slowly or quickly than the typical range of change of speech thus avoiding unnecessary information in the extracted features. In the proposed system Generalized Regression Neural Network is used as recognizer in which syllable level recognition is used that reduces memory requirement and complexity for mobile application. Thus a small database containing all possible syllable pronunciation of the user is sufficient to give recognition accuracy closer to 100%. Reduction in 50% with respect to delay and memory requirement is proved in the proposed system. Thus the proposed technique entertains realization of real time speaker dependant applications like mobile phones, PDAs etc.

Voice Activity Detection Method Research Articles

Related Topics

Articles published on Voice Activity Detection Method

Frame Selection for Robust Speaker Identification: A Hybrid Approach

Enhanced Speech Based Jointly Statistical Probability Distribution Function for Voice Activity Detection

Automatic classification of Furnariidae species from the Paranaense Littoral region using speech-related features and machine learning

Speaker diarization system using HXLPS and deep neural network

Applying the Bi-level HMM for Robust Voice-activity Detection

The Application of Extreme Learning Machine and Support Vector Machine in Speech Endpoint Detection

Kernel Method for Voice Activity Detection in the Presence of Transients

Voice Activity Detection Using Fuzzy Entropy and Support Vector Machine

Optimally weighted maximum a posteriori probabilities based on minimum classification error for dual-microphone voice activity detection

Maximising Bandwdith Efficency of Statistical Multiplexer Architecture using Frame Droping Methods

Voice activity detection based on facial movement

Dual-Microphone Voice Activity Detection Technique Based on Two-Step Power Level Difference Ratio

Sparse Representation with Optimized Learned Dictionary for Robust Voice Activity Detection

Acoustic Environment Identification and Its Applications to Audio Forensics

Multi Channel Voice Active Detection Using Instance Filed Auto-Interrelation Function

A Novel Speech to Text Converter System for Mobile Applications

Enhanced voice activity detection in kernel subspace domain

일반화된 가우시안 분포를 이용한 신호 준공간 기반의 음성검출기법

Statistical voice activity detection in kernel space

Voice activity detection based on conditional MAP criterion incorporating the spectral gradient

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Voice Activity Detection Method Research Articles

Related Topics

Articles published on Voice Activity Detection Method

Frame Selection for Robust Speaker Identification: A Hybrid Approach

Enhanced Speech Based Jointly Statistical Probability Distribution Function for Voice Activity Detection

Automatic classification of Furnariidae species from the Paranaense Littoral region using speech-related features and machine learning

Speaker diarization system using HXLPS and deep neural network

Applying the Bi-level HMM for Robust Voice-activity Detection

The Application of Extreme Learning Machine and Support Vector Machine in Speech Endpoint Detection

Kernel Method for Voice Activity Detection in the Presence of Transients

Voice Activity Detection Using Fuzzy Entropy and Support Vector Machine

Optimally weighted maximum a posteriori probabilities based on minimum classification error for dual-microphone voice activity detection

Maximising Bandwdith Efficency of Statistical Multiplexer Architecture using Frame Droping Methods

Voice activity detection based on facial movement

Dual-Microphone Voice Activity Detection Technique Based on Two-Step Power Level Difference Ratio

Sparse Representation with Optimized Learned Dictionary for Robust Voice Activity Detection

Acoustic Environment Identification and Its Applications to Audio Forensics

Multi Channel Voice Active Detection Using Instance Filed Auto-Interrelation Function

A Novel Speech to Text Converter System for Mobile Applications

Enhanced voice activity detection in kernel subspace domain

일반화된 가우시안 분포를 이용한 신호 준공간 기반의 음성검출기법

Statistical voice activity detection in kernel space

Voice activity detection based on conditional MAP criterion incorporating the spectral gradient