Compositional Scene Representation Learning via Reconstruction: A Survey.

Jinyang Yuan,Xiangyang Xue,Tonglin Chen,Bin Li

doi:10.1109/tpami.2023.3286184

Abstract

Visual scenes are composed of visual concepts and have the property of combinatorial explosion. An important reason for humans to efficiently learn from diverse visual scenes is the ability of compositional perception, and it is desirable for artificial intelligence to have similar abilities. Compositional scene representation learning is a task that enables such abilities. In recent years, various methods have been proposed to apply deep neural networks, which have been proven to be advantageous in representation learning, to learn compositional scene representations via reconstruction, advancing this research direction into the deep learning era. Learning via reconstruction is advantageous because it may utilize massive unlabeled data and avoid costly and laborious data annotation. In this survey, we first outline the current progress on reconstruction-based compositional scene representation learning with deep neural networks, including development history and categorizations of existing methods from the perspectives of the modeling of visual scenes and the inference of scene representations; then provide benchmarks, including an open source toolbox to reproduce the benchmark experiments, of representative methods that consider the most extensively studied problem setting and form the foundation for other methods; and finally discuss the limitations of existing methods and future directions of this research topic.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Compositional Scene Representation Learning via Reconstruction: A Survey.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Oct 1, 2023
Citations: 10

Similar Papers

A Survey on ensemble learning under the era of deep learning
Yongquan Yang ... Haijun Lv
Artificial Intelligence Review | VOL. 56
Yongquan Yang, et. al.Yongquan Yang ... Haijun Lv
02 Nov 2022
Artificial Intelligence Review | VOL. 56

Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints
Jinyang Yuan ... Xiangyang Xue
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Jinyang Yuan, et. al.Jinyang Yuan ... Xiangyang Xue
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Robust and Verifiable Information Embedding Attacks to Deep Neural Networks via Error-Correcting Codes
Jinyuan Jia ... Neil Zhenqiang Gong
-
Jinyuan Jia, et. al.Jinyuan Jia ... Neil Zhenqiang Gong
24 May 2021
24 May 2021

Reconstruction of natural visual scenes from neural spikes with deep neural networks
Yichen Zhang ... Jian K Liu
Neural Networks | VOL. 125
Yichen Zhang, et. al.Yichen Zhang ... Jian K Liu
08 Feb 2020
Neural Networks | VOL. 125

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compositional Scene Representation Learning via Reconstruction: A Survey.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence