Time-and-Frequency Fusion based on Multi-scale convolution for Speech separation in intelligent manufacturing

Linfeng Jia,Li Wang,Yutao Liao,Zihao Gao,Tengteng Wen,Liming Wu

doi:10.1109/imasbd57215.2022.00022

Abstract

In intelligent manufacturing technology, machine with speech separation ability can effectively improve the efficiency of human-computer interaction, which is conducive to the rapid development of intelligent manufacturing industry. In single-channel speech separation based on deep learning, the performance of time domain features is better than that of frequency domain features. However, the current methods based on time domain feature have poor robustness in real noise environment, and time domain feature has limitations on the performance of the separation model. Therefore, we propose a Time-and Frequency fusion based on multi-scale convolution model(Tff-MscNet), which integrates time domain features and frequency domain features to improve multidimensional information of data. In order to further improve the performance of separation network, we introduced multiscale convolution block to improve the feature extraction ability of the network. We compare with the Conv-TasNet baseline model and the latest time-frequency fusion speech separation baseline model in GRID speech dataset. Experiments show that the performance and robustness of the proposed method are improved greatly in the experimental environment with real noise.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Time-and-Frequency Fusion based on Multi-scale convolution for Speech separation in intelligent manufacturing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An EEG-based attention recognition method: fusion of time domain, frequency domain, and non-linear dynamics features.
Di Chen ... Yuanqing Li
Frontiers in Neuroscience | VOL. 17
Di Chen, et. al.Di Chen ... Yuanqing Li
12 Jul 2023
Frontiers in Neuroscience | VOL. 17

Classification of Motor Imagery EEG Based on Time-Domain and Frequency-Domain Dual-Stream Convolutional Neural Network
E Huang ... Z Zhang
IRBM | VOL. 43
E Huang, et. al.E Huang ... Z Zhang
29 Apr 2021
IRBM | VOL. 43

Intelligent mechanical manufacturing technology based on intelligent manufacturing technology
Kang Zheng ... J Wu
MATEC Web of Conferences | VOL. 382
Kang Zheng, et. al.Kang Zheng ... J Wu
01 Jan 2023
MATEC Web of Conferences | VOL. 382

Feature Extraction and Feature Combination for Basic Taste Sensation Recognition Based on Facial Electromyography
Han Gao ... Hengyang Wang
-
Han Gao, et. al.Han Gao ... Hengyang Wang
25 Nov 2022
25 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Time-and-Frequency Fusion based on Multi-scale convolution for Speech separation in intelligent manufacturing

Abstract

Talk to us

Similar Papers