MASS: Microphone Array Speech Simulator in Room Acoustic Environment for Multi-Channel Speech Coding and Enhancement

Rui Cheng,Zihao Cui,Changchun Bao

doi:10.3390/app10041484

Rui Cheng, Zihao Cui + Show 1 more

Open Access

https://doi.org/10.3390/app10041484

Copy DOI

Journal: Applied Sciences	Publication Date: Feb 21, 2020
Citations: 23	License type: CC BY 4.0

Affiliation: Beijing University of Technology

Abstract

Multi-channel speech coding and enhancement is an indispensable technology in speech communication. In order to verify the effectiveness of multi-channel speech coding and enhancement methods in the research and development, a microphone array speech simulator (MASS) used in room acoustic environment is proposed. The proposed MASS is the improvement and extension of the existing multi-channel speech simulator. It aims to simulate clean speech, noisy speech, clean speech with reverberation, noisy speech with reverberation, and noise signals by microphone array used for multi-channel coding and enhancement of speech signal in room acoustic environment. The experimental results of the multi-channel speech coding and enhancement prove that the MASS could well simulate the signals used in real room acoustic environment and can be applied to the research of the related fields.

Highlights

With the rapid development of signal processing and deep learning technology, more and more speech signal processing algorithms are proposed, which greatly promotes the progress of the speech processing, especially for multi-channel speech signal processing technology, such as multi-channel speech coding and multi-channel speech enhancement, because it can use the spatial information of speech signal, so as to obtain better processing effect than single-channel methods [1]
Microphone Array Speech Simulator is an improvement and extension of the existing pyroomacoustics, which exploits object-oriented of Python to createofathe clean and pyroomacoustics, intuitive application
The noisy speech microphone array with reverberation simulated bysimulated the three methods is used for training a convolutional dataset of microphone array with reverberation by the three methods is used for training neural network (CNN)-based multi-channel speech enhancement method

Summary

Introduction

With the rapid development of signal processing and deep learning technology, more and more speech signal processing algorithms are proposed, which greatly promotes the progress of the speech processing, especially for multi-channel speech signal processing technology, such as multi-channel speech coding and multi-channel speech enhancement, because it can use the spatial information of speech signal, so as to obtain better processing effect than single-channel methods [1]. Zhang et al built an articulatory dataset specifying in Chinese Mandarin [9] and investigated its efficacy in speech animation, and the dataset was created by Carstens EMA AG501 device This real multi-channel speech data can be only used for specific structures but not for the simulation in any acoustic environment. Spatialized Multi-Speaker Wall Street Journal (SMS-WSJ) [13] proposed by Drude et al is a multi-channel dataset of overlapping speech for training, evaluation, and the detailed analysis of source separation and extraction It has a high degree of randomness w.r.t. room size, array center, and rotation, as well as speaker position. Through the analysis of isotropic spherically diffuse noise and set the corresponding signal-to-noise ratio (SNR) to better of the simulated speech room and the experiments on the multi-channel speech coding simulate real lifemicrophone scenes, sucharray as meeting acoustic environment.

Microphone

Simulation of Microphone Array Speech in Room Acoustic Environment

The meeting room acoustic

Analysis of the Simulated

Methods

Simulation of Microphone Array Speech Signals

Application in Multi-Channel Speech Coding

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MASS: Microphone Array Speech Simulator in Room Acoustic Environment for Multi-Channel Speech Coding and Enhancement

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Speech enhancement based on a modified spectral subtraction method
Md. T. Islam ... S.A. Fattah
-
Md. T. Islam, et. al.Md. T. Islam ... S.A. Fattah
01 Aug 2014
01 Aug 2014

Multichannel Speech Enhancement by Raw Waveform-Mapping Using Fully Convolutional Networks
Chang-Le Liu ... Jen-Wei Huang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Chang-Le Liu, et. al.Chang-Le Liu ... Jen-Wei Huang
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

Bayesian Multichannel Speech Enhancement with a Deep Speech Prior
Kouhei Sekiguchi ... Kazuyoshi Yoshii
-
Kouhei Sekiguchi, et. al.Kouhei Sekiguchi ... Kazuyoshi Yoshii
01 Nov 2018
01 Nov 2018

Supervised Single Channel Speech Enhancement Based on Dual-Tree Complex Wavelet Transforms and Nonnegative Matrix Factorization Using the Joint Learning Process and Subband Smooth Ratio Mask
Md Shohidul Islam ... Tarek Hasan Al Mahmud
Electronics | VOL. 8
Md Shohidul Islam, et. al.Md Shohidul Islam ... Tarek Hasan Al Mahmud
22 Mar 2019
Electronics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MASS: Microphone Array Speech Simulator in Room Acoustic Environment for Multi-Channel Speech Coding and Enhancement

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences