SemanticAC: Semantics-Assisted Framework for Audio Classification

Yicheng Xiao,Xiu Li,Ran Liao,Hantao Zhou,Shuyan Li,Yue Ma

doi:10.1109/icassp49357.2023.10096319

Yicheng Xiao, Xiu Li + Show 4 more

Open Access

PDF Available

https://doi.org/10.1109/icassp49357.2023.10096319

Copy DOI

Export

Save

Cite

Publication Date: Jun 4, 2023

Citations: 5

Affiliation: Tsinghua University

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

In this paper, we propose SemanticAC, a semantics-assisted framework for Audio Classification to better leverage the semantic information. Unlike conventional audio classification methods that treat class labels as discrete vectors, we employ a language model to extract abundant semantics from labels and optimize the semantic consistency between audio signals and their labels. We verify that simple textual information from labels and advanced pretraining models enable more abundant semantic supervision for better performance. Specifically, we design a text encoder to capture the semantic information from the text extension of labels. Then we map the audio signals to align with the semantics of corresponding class labels via an audio encoder and a similarity calculation module so as to enforce the semantic consistency. Extensive experiments on two audio datasets, ESC-50 and US8K demonstrate that our proposed method consistently outperforms the compared audio classification methods.

Full Text