Instance-level loss based multiple-instance learning framework for acoustic scene classification

Won-Gook Choi,Joon-Hyuk Chang,Jae-Mo Yang,Han-Gil Moon

doi:10.1016/j.apacoust.2023.109757

Abstract

An acoustic scene is inferred by detecting properties combining diverse sounds and acoustic environments. This study is intended to discover these properties effectively using multiple-instance learning (MIL). MIL, also known as a weakly supervised learning approach, is a strategy for extracting an instance vector from an audio chunk that composes an audio clip and utilizing these unlabeled instances to infer a scene corresponding to the input data. However, many studies pointed out an underestimation problem of MIL. In this study, we propose an enhanced MIL framework more suitable for ASC systems by defining instance-level labels and loss to extract and cluster instances effectively. Furthermore, we design a lightweight convolutional neural network named FUSE comprising frequency-, temporal-sided depthwise, and pointwise convolutional filters. Experimental results show that the confidence and proportion of positive instances significantly increase compared to vanilla MIL, overcoming the underestimation problem and improving the classification accuracy even higher than the supervised learning. The proposed system achieved a performance of 81.1%, 72.3%, and 58.3% on the TAU urban acoustic scenes 2019, 2020 mobile, and 2022 mobile datasets with 139 K parameters, respectively. In particular, it achieves the highest performance among the systems having under the 1 M parameters for the TAU urban acoustic scenes 2019 dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Instance-level loss based multiple-instance learning framework for acoustic scene classification

Abstract

Talk to us

Similar Papers

More From: Applied Acoustics

Lead the way for us

Journal: Applied Acoustics	Publication Date: Dec 6, 2023
Citations: 3

Similar Papers

Visual sentiment analysis via deep multiple clustered instance learning
Wenjing Gao ... Yonghua Zhu
Journal of Intelligent & Fuzzy Systems | VOL. 39
Wenjing Gao, et. al.Wenjing Gao ... Yonghua Zhu
01 Jan 2020
Journal of Intelligent & Fuzzy Systems | VOL. 39

Design and Analysis of Techniques for Multiple-Instance Learning in the Presence of Balanced and Skewed Class Distributions

-

01 Jan 2015
01 Jan 2015

Weakly supervised histopathology cancer image segmentation and classification
Yan Xu ... Jun-Yan Zhu
Medical Image Analysis | VOL. 18
Yan Xu, et. al.Yan Xu ... Jun-Yan Zhu
22 Feb 2014
Medical Image Analysis | VOL. 18

Multiple instance learning based on positive instance selection and bag structure construction
Zhan Li ... Jun-Li Liang
Pattern Recognition Letters | VOL. 40
Zhan Li, et. al.Zhan Li ... Jun-Li Liang
03 Dec 2013
Pattern Recognition Letters | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Instance-level loss based multiple-instance learning framework for acoustic scene classification

Abstract

Talk to us

Similar Papers

More From: Applied Acoustics