Task-Agnostic Object Recognition for Mobile Robots through Few-Shot Image Matching

Agnese Chiatti,Emanuele Bastianelli,Ilaria Tiddi,Enrico Motta,Gianluca Bardaro,Prasenjit Mitra

doi:10.3390/electronics9030380

Abstract

To assist humans with their daily tasks, mobile robots are expected to navigate complex and dynamic environments, presenting unpredictable combinations of known and unknown objects. Most state-of-the-art object recognition methods are unsuitable for this scenario because they require that: (i) all target object classes are known beforehand, and (ii) a vast number of training examples is provided for each class. This evidence calls for novel methods to handle unknown object classes, for which fewer images are initially available (few-shot recognition). One way of tackling the problem is learning how to match novel objects to their most similar supporting example. Here, we compare different (shallow and deep) approaches to few-shot image matching on a novel data set, consisting of 2D views of common object types drawn from a combination of ShapeNet and Google. First, we assess if the similarity of objects learned from a combination of ShapeNet and Google can scale up to new object classes, i.e., categories unseen at training time. Furthermore, we show how normalising the learned embeddings can impact the generalisation abilities of the tested methods, in the context of two novel configurations: (i) where the weights of a Convolutional two-branch Network are imprinted and (ii) where the embeddings of a Convolutional Siamese Network are L2-normalised.

Highlights

As the fields of Artificial Intelligence (AI) and Robotics mature and evolve, an increasing number of hardware and software solutions have become available, reducing the costs and technical barriers of developing novel robotic platforms
We further extend this investigation to assess whether the Deep representations learned by similarity matching on ShapeNet can: (i) outperform the previously-explored shallow representations, as well as (ii) generalise to new object classes
The results obtained in [19], which are summarised in Table 3, albeit providing an improvement from random label assignment in all configurations, were not satisfactory to discriminate different object classes

Summary

Introduction

As the fields of Artificial Intelligence (AI) and Robotics mature and evolve, an increasing number of hardware and software solutions have become available, reducing the costs and technical barriers of developing novel robotic platforms. The problem of real-time object recognition has reached satisfactory solutions [8,9,10,11] only in experimental scenarios where a very large amounts of human-annotated data is available and all object classes are assumed to be predetermined, known as the closed world assumption [20] Problems such as the paucity of training data or the adaptability to new learning environments are pervasive across all sub-fields of Artificial Intelligence (AI), and specific to the object recognition area.

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Feb 25, 2020
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Task-Agnostic Object Recognition for Mobile Robots through Few-Shot Image Matching

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

A regularized point-to-manifold distance metric for multi-view multi-manifold learning
Faraein Aeini ... Amir Masoud Eftekhari Moghadam
Engineering Applications of Artificial Intelligence | VOL. 82
Faraein Aeini, et. al.Faraein Aeini ... Amir Masoud Eftekhari Moghadam
02 Apr 2019
Engineering Applications of Artificial Intelligence | VOL. 82

Object Classification and Detection in High Dimensional Feature Space

-

01 Jan 2013
01 Jan 2013

A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision.
Davide Borra ... Elisa Magosso
Frontiers in human neuroscience | VOL. 15
Davide Borra, et. al.Davide Borra ... Elisa Magosso
08 Jul 2021
Frontiers in human neuroscience | VOL. 15

DoGNet: A deep architecture for synapse detection in multiplexed fluorescence images.
Victor Kulikov ... Matthew Stone
PLOS Computational Biology | VOL. 15
Victor Kulikov, et. al.Victor Kulikov ... Matthew Stone
13 May 2019
PLOS Computational Biology | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Task-Agnostic Object Recognition for Mobile Robots through Few-Shot Image Matching

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics