Environmental Sound Extraction Using Onomatopoeic Words

Yuki Okamoto,Masaaki Yamamoto,Keisuke Imoto,Shota Horiguchi,Yohei Kawaguchi

doi:10.1109/icassp43922.2022.9747835

Environmental Sound Extraction Using Onomatopoeic Words

Yuki Okamoto, Masaaki Yamamoto + Show 3 more

Open Access

https://doi.org/10.1109/icassp43922.2022.9747835

Copy DOI

Publication Date: May 23, 2022

Citations: 3

Affiliation: Ritsumeikan University, Hitachi (Japan), Doshisha University

#Onomatopoeic Word #U-Net Architecture + Show 5 more

Abstract
Full-Text PDF
Similar Papers

Abstract

An onomatopoeic word, which is a character sequence that phonetically imitates a sound, is effective in expressing characteristics of sound such as duration, pitch, and timbre. We propose an environmental-sound-extraction method using onomatopoeic words to specify the target sound to be extracted. By this method, we estimate a time-frequency mask from an input mixture spectrogram and an onomatopoeic word using a U-Net architecture, then extract the corresponding target sound by masking the spectrogram. Experimental results indicate that the proposed method can extract only the target sound corresponding to the onomatopoeic word and performs better than conventional methods that use sound-event classes to specify the target sound.

Full Text