Audio Separation Research Articles

In recent years, the State Grid of China has placed significant emphasis on the monitoring of noise in substations, driven by growing environmental concerns. This paper presents a substation noise monitoring system designed based on an end-network-cloud architecture, aiming to acquire and analyze substation noise, and report anomalous noise levels that exceed national standards for substation operation and maintenance. To collect real-time noise data at substations, a self-developed noise acquisition device is developed, enabling precise analysis of acoustic characteristics. Moreover, to subtract the interfering environmental background noise (bird/insect chirping, human voice, etc.) and determine if noise exceedances are originating from substation equipment, an intelligent noise separation algorithm is proposed by leveraging the convolutional time-domain audio separation network (Conv-TasNet), dual-path recurrent neural network (DPRNN), and dual-path transformer network (DPTNet), respectively, and evaluated under various scenarios. Experimental results show that (1) deep-learning-based separation algorithms outperform the traditional spectral subtraction method, where the signal-to-distortion ratio improvement (SDRi) and the scale-invariant signal-to-noise ratio improvement (SI-SNRi) of Conv-TasNet, DPRNN, DPTNet and the traditional spectral subtraction are 12.6 and 11.8, 13.6 and 12.4, 14.2 and 12.9, and 4.6 and 4.1, respectively; (2) DPTNet and DPRNN exhibit superior performance in environment noise separation and substation equipment noise separation, respectively; and (3) 91% of post-separation data maintains sound pressure level deviations within 1 dB, showcasing the effectiveness of the proposed algorithm in separating interfering noises while preserving the accuracy of substation noise sound pressure levels.

Read full abstract

In the field of speech separation, the traditional single-channel and multi-channel speech separation methods have made great progress. However, the accuracy of separation and automatic speech recognition(ASR) rate are not yet satisfactory. With the development of neural networks, some scholars began to use deep learning to achieve speech separation. Although this kind of method improves the accuracy of speech separation, it also leads to the need for pre-training the model, higher computational complexity and reduced separation performance when the model does not match the mixed signal. This paper has conducted an in-depth study on the scene of multi-speaker separation, and proposed a new dual-channel speech separation algorithm based on the Comb-Filter Effect (CFE). The CFE is an effect that occurs when a signal passes through a first-order differential microphone(FDM) array. And this effect is discovered and exploited for the first time. By using this effect, this paper designed a new signal spectrum estimation method that can realize accurate estimation of speech signal, and combined this method with traditional spectral subtraction to achieve the purpose of speech separation. Finally, this paper compared the proposed algorithm with the traditional FastICA-based algorithm and the fully-convolutional time-domain audio separation network(Conv-TasNet)-based algorithm. The results of simulation and comparison experiments show that the algorithm can effectively separate two-way speech signals while greatly reducing the computational complexity and has excellent robustness. In various situations, the proposed algorithm can obtain the Scale-Invariant Source-to-Noise Ratio improvement (SI-SNRi) of 9.19 dB on average. In addition, the Short-Time Objective Intelligibility (STOI) and Perceptual Evaluation of Speech Quality (PESQ) of the speech signal can be improved by an average of 33% and 70% or more respectively.

Read full abstract

Audio Separation Research Articles

Related Topics

Articles published on Audio Separation

Speech Isolation and Recognition in Crowded Noise Using a Dual-Path Recurrent Neural Network

The whole is greater than the sum of its parts: improving music source separation by bridging networks

Intelligent Substation Noise Monitoring System: Design, Implementation and Evaluation

Anime Audio Retrieval Based on Audio Separation and Feature Recognition

A Facial Feature and Lip Movement Enhanced Audio-Visual Speech Separation Model.

Overlapped spectral demodulation of fiber Bragg grating using convolutional time-domain audio separation network

Ensemble System of Deep Neural Networks for Single-Channel Audio Separation

Audio feature extraction: Foreground and Background audio separation using KNN algorithm

Cross-Domain Conv-TasNet Speech Enhancement Model with Two-Level Bi-Projection Fusion of Discrete Wavelet Transform

A speech separation algorithm based on the comb-filter effect

Single-channel blind separation of co-frequency signals based on convolutional network

Zero-Shot Audio Source Separation through Query-Based Learning from Weakly-Labeled Data

Fast accuracy estimation of deep learning based multi-class musical source separation

Gestalt Principles Emerge When Learning Universal Sound Source Separation

Dual-Path Hybrid Attention Network for Monaural Speech Separation

Auditory-like simultaneous separation mechanisms spontaneously learned by a deep source separation network

Attention-Based Joint Training of Noise Suppression and Sound Event Detection for Noise-Robust Classification.

Research on Blind Source Separation of Transformer Vibration Signal Based on Full Convolution Time Domain Audio Separation Network

ConvTasNet-based anomalous noise separation for intelligent noise monitoring

Target exaggeration for deep learning-based speech enhancement

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Audio Separation Research Articles

Related Topics

Articles published on Audio Separation

Speech Isolation and Recognition in Crowded Noise Using a Dual-Path Recurrent Neural Network

The whole is greater than the sum of its parts: improving music source separation by bridging networks

Intelligent Substation Noise Monitoring System: Design, Implementation and Evaluation

Anime Audio Retrieval Based on Audio Separation and Feature Recognition

A Facial Feature and Lip Movement Enhanced Audio-Visual Speech Separation Model.

Overlapped spectral demodulation of fiber Bragg grating using convolutional time-domain audio separation network

Ensemble System of Deep Neural Networks for Single-Channel Audio Separation

Audio feature extraction: Foreground and Background audio separation using KNN algorithm

Cross-Domain Conv-TasNet Speech Enhancement Model with Two-Level Bi-Projection Fusion of Discrete Wavelet Transform

A speech separation algorithm based on the comb-filter effect

Single-channel blind separation of co-frequency signals based on convolutional network

Zero-Shot Audio Source Separation through Query-Based Learning from Weakly-Labeled Data

Fast accuracy estimation of deep learning based multi-class musical source separation

Gestalt Principles Emerge When Learning Universal Sound Source Separation

Dual-Path Hybrid Attention Network for Monaural Speech Separation

Auditory-like simultaneous separation mechanisms spontaneously learned by a deep source separation network

Attention-Based Joint Training of Noise Suppression and Sound Event Detection for Noise-Robust Classification.

Research on Blind Source Separation of Transformer Vibration Signal Based on Full Convolution Time Domain Audio Separation Network

ConvTasNet-based anomalous noise separation for intelligent noise monitoring

Target exaggeration for deep learning-based speech enhancement