Disassembling object representations without labels

Zunlei Feng,Yongming He,Yike Yuan,Li Sun,Huiqiong Wang,Mingli Song

doi:10.1016/j.neucom.2021.07.004

Abstract

In this paper, we study a new representation-learning task, which we termed as disassembling object representations. Given an image featuring multiple objects, the goal of disassembling is to acquire a latent representation, of which each part corresponds to one category of objects. Disassembling thus finds its application in a wide domain such as image editing and few- or zero-shot learning, as it enables category-specific modularity in the learned representations. To this end, we propose an unsupervised approach to achieving disassembling, named Unsupervised Disassembling Object Representation (UDOR). UDOR follows a double auto-encoder architecture, in which a fuzzy classification and an object-removing operation are imposed. The fuzzy classification constrains each part of the latent representation to encode features of up to one object category, while the object-removing, combined with a generative adversarial network, enforces the modularity of the representations and integrity of the reconstructed image. Furthermore, we devise two metrics to respectively measure the modularity of disassembled representations and the visual integrity of reconstructed images. Experimental results demonstrate that the proposed UDOR, despite unsupervised, achieves truly encouraging results on par with those of supervised methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Disassembling object representations without labels

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Understanding and Learning of the Knowledge of the Different Categories of Objects
Zbigniew Les ... Magdalena Les
-
Zbigniew Les, et. al.Zbigniew Les ... Magdalena Les
01 Jan 2013
01 Jan 2013

Modality independent adversarial network for generalized zero shot image classification
Haofeng Zhang ... Ling Shao
Neural Networks | VOL. 134
Haofeng Zhang, et. al.Haofeng Zhang ... Ling Shao
21 Nov 2020
Neural Networks | VOL. 134

Information Generative Bayesian Adversarial Networks: A Representation Learning Model for Transmission Gear Parameters
Jie Li ... Haibo He
IEEE/ASME Transactions on Mechatronics | VOL. 24
Jie Li, et. al.Jie Li ... Haibo He
01 Oct 2019
IEEE/ASME Transactions on Mechatronics | VOL. 24

Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning
Jingjing Li ... Yang Yang
-
Jingjing Li, et. al.Jingjing Li ... Yang Yang
12 Oct 2020
12 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Disassembling object representations without labels

Abstract

Talk to us

Similar Papers

More From: Neurocomputing