Audio Set Classification with Attention Model: A Probabilistic Perspective

Qiuqiang Kong,Wenwu Wang,Yong Xu,Mark D Plumbley

doi:10.1109/icassp.2018.8461392

Abstract

This paper investigates the Audio Set classification. Audio Set is a large scale weakly labelled dataset (WLD) of audio clips. In WLD only the presence of a label is known, without knowing the happening time of the labels. We propose an attention model to solve this WLD problem and explain the attention model from a novel probabilistic perspective. Each audio clip in Audio Set consists of a collection of features. We call each feature as an instance and the collection as a bag following the terminology in multiple instance learning. In the attention model, each instance in the bag has a trainable probability measure for each class. The classification of the bag is the expectation of the classification output of the instances in the bag with respect to the learned probability measure. Experiments show that the proposed attention model achieves a mAP of 0.327 on Audio Set, outperforming the Google's baseline of 0.314.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Audio Set Classification with Attention Model: A Probabilistic Perspective

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Visual sentiment analysis via deep multiple clustered instance learning
Wenjing Gao ... Wenjun Zhang
Journal of Intelligent & Fuzzy Systems | VOL. 39
Wenjing Gao, et. al.Wenjing Gao ... Wenjun Zhang
01 Jan 2020
Journal of Intelligent & Fuzzy Systems | VOL. 39

Weakly supervised histopathology cancer image segmentation and classification
Yan Xu ... Eric I-Chao Chang
Medical Image Analysis | VOL. 18
Yan Xu, et. al.Yan Xu ... Eric I-Chao Chang
22 Feb 2014
Medical Image Analysis | VOL. 18

Design and Analysis of Techniques for Multiple-Instance Learning in the Presence of Balanced and Skewed Class Distributions

-

01 Jan 2015
01 Jan 2015

Multiple Instance bagging based ensemble classification of hyperspectral images
Ugur Ergul ... Gokhan Bilgin
-
Ugur Ergul, et. al.Ugur Ergul ... Gokhan Bilgin
01 May 2016
01 May 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio Set Classification with Attention Model: A Probabilistic Perspective

Abstract

Talk to us

Similar Papers