SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks.

Laetitia Teodorescu,Katja Hofmann,Pierre-Yves Oudeyer

doi:10.3389/frai.2021.782081

Laetitia Teodorescu, Katja Hofmann + Show 1 more

Open Access

PDF Available

https://doi.org/10.3389/frai.2021.782081

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

An embodied, autonomous agent able to set its own goals has to possess geometrical reasoning abilities for judging whether its goals have been achieved, namely it should be able to identify and discriminate classes of configurations of objects, irrespective of its point of view on the scene. However, this problem has received little attention so far in the deep learning literature. In this paper we make two key contributions. First, we propose SpatialSim (Spatial Similarity), a novel geometrical reasoning diagnostic dataset, and argue that progress on this benchmark would allow for diagnosing more principled approaches to this problem. This benchmark is composed of two tasks: “Identification” and “Discrimination,” each one instantiated in increasing levels of difficulty. Secondly, we validate that relational inductive biases—exhibited by fully-connected message-passing Graph Neural Networks (MPGNNs)—are instrumental to solve those tasks, and show their advantages over less relational baselines such as Deep Sets and unstructured models such as Multi-Layer Perceptrons. We additionally showcase the failure of high-capacity CNNs on the hard Discrimination task. Finally, we highlight the current limits of GNNs in both tasks.

Highlights

Building autonomous agents that can explore their environment and build an open-ended repertoire of skills is a long-standing goal of developmental robotics and artificial intelligence in general
We find that GNNs that operate on fully-connected underlying graphs of the objects perform better compared to a lessconnected counterpart we call Recurrent Deep Set (RDS), to regular Deep Sets and to unstructured Multi-Layer Perceptron (MLP), suggesting that relational computation between objects is instrumental in solving the SpatialSim tasks
Since the DS and RDS layer can be seen as strippeddown versions of the message-passing Graph Neural Networks (MPGNNs) layer, if d and h stay constant the number of parameters drops for the RDS layer, and drops even further for the DS layer

Summary

Introduction

Building autonomous agents that can explore their environment and build an open-ended repertoire of skills is a long-standing goal of developmental robotics and artificial intelligence in general. To this end, one option that has been explored in the literature is autotelic agents: agents that can set their own goals and learn to reach them. Crucial to the development of such agents, especially from a developmental perspective, is the learning of a goal-achievement function (or reward function) that measures how close the agent is to reaching its goal (Bahdanau et al, 2019; Colas et al, 2020). Since this representation is crucial for performance and robustness of the networks, a principled approach would be representing these states and goals in a cognitively plausible way

Objectives

Results

Conclusion