Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration

Yanxiong Li,Yuhan Zhang,Wucheng Wang,Mingle Liu,Qianhua He

doi:10.1109/tmm.2019.2947199

Abstract

Recent efforts have been made on acoustic scene classification in the audio signal processing community. In contrast, few studies have been conducted on acoustic scene clustering, which is a newly emerging problem. Acoustic scene clustering aims at merging the audio recordings of the same class of acoustic scene into a single cluster without using prior information and training classifiers. In this study, we propose a method for acoustic scene clustering that jointly optimizes the procedures of feature learning and clustering iteration. In the proposed method, the learned feature is a deep embedding that is extracted from a deep convolutional neural network (CNN), while the clustering algorithm is the agglomerative hierarchical clustering (AHC). We formulate a unified loss function for integrating and optimizing these two procedures. Various features and methods are compared. The experimental results demonstrate that the proposed method outperforms other unsupervised methods in terms of the normalized mutual information and the clustering accuracy. In addition, the deep embedding outperforms many state-of-the-art features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Oct 25, 2019
Citations: 44

Similar Papers

Acoustic Scene Classification Using Aggregation of Two-Scale Deep Embeddings
Ho Ka Chon ... Qisheng Huang
-
Ho Ka Chon, et. al.Ho Ka Chon ... Qisheng Huang
13 Oct 2021
13 Oct 2021

Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning
Noriyuki Tonami ... Keisuke Imoto
IEICE Transactions on Information and Systems | VOL. E104.D
Noriyuki Tonami, et. al.Noriyuki Tonami ... Keisuke Imoto
16 Oct 2020
IEICE Transactions on Information and Systems | VOL. E104.D

A hybrid approach with multi-channel i-vectors and convolutional neural networks for acoustic scene classification
Hamid Eghbal-Zadeh ... Bernhard Lehner
-
Hamid Eghbal-Zadeh, et. al.Hamid Eghbal-Zadeh ... Bernhard Lehner
01 Aug 2017
01 Aug 2017

Domestic Activities Clustering From Audio Recordings Using Convolutional Capsule Autoencoder Network
Ziheng Lin ... Yanxiong Li
-
Ziheng Lin, et. al.Ziheng Lin ... Yanxiong Li
06 Jun 2021
06 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia