Learning Compositional Sparse Models of Bimodal Percepts

Suren Kumar,Vikas Dhiman,Jason Corso

doi:10.1609/aaai.v28i1.8753

Abstract

Various perceptual domains have underlying compositional semantics that are rarely captured in current models. We suspect this is because directly learning the compositional structure has evaded these models. Yet, the compositional structure of a given domain can be grounded in a separate domain thereby simplifying its learning. To that end, we propose a new approach to modeling bimodal percepts that explicitly relates distinct projections across each modality and then jointly learns a bimodal sparse representation. The resulting model enables compositionality across these distinct projections and hence can generalize to unobserved percepts spanned by this compositional basis. For example, our model can be trained on 'red triangles' and 'blue squares'; yet, implicitly will also have learned 'red squares' and 'blue triangles'. The structure of the projections and hence the compositional basis is learned automatically for a given language model. To test our model, we have acquired a new bimodal dataset comprising images and spoken utterances of colored shapes in a tabletop setup. Our experiments demonstrate the benefits of explicitly leveraging compositionality in both quantitative and human evaluation studies.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Compositional Sparse Models of Bimodal Percepts

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 19, 2014
Citations: 5

Similar Papers

Learning Compositional Sparse Bimodal Models.
Suren Kumar ... Jason J Corso
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 40
Suren Kumar, et. al.Suren Kumar ... Jason J Corso
12 Apr 2017
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 40

Compositional Nature of Language to Represent Bimodal Visual- Audial Percepts
...
International Journal of Engineering and Advanced Technology | VOL. 8
, et. al. ...
14 Sep 2019
International Journal of Engineering and Advanced Technology | VOL. 8

Author response: Immune surveillance of the lung by migrating tissue monocytes
Mathieu P Rodero ... Fabrice Licata
-
Mathieu P Rodero, et. al.Mathieu P Rodero ... Fabrice Licata
07 Jul 2015
07 Jul 2015

Contrast effects in paired comparisons: Evidence for both stimulus-based and response-based processes.
Douglas H Wedell
Journal of Experimental Psychology: Human Perception and Performance | VOL. 21
Douglas H WedellDouglas H Wedell
01 Jan 1995
Journal of Experimental Psychology: Human Perception and Performance | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Compositional Sparse Models of Bimodal Percepts

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence