Abstract
ABSTRACTIndoor object recognition is a key task for indoor navigation by mobile robots. Although previous work has produced impressive results in recognizing known and familiar objects, the research of indoor object recognition for robot is still insufficient. In order to improve the detection precision, our study proposed a prior knowledge-based deep learning method aimed to enable the robot to recognize indoor objects on sight. First, we integrate the public Indoor dataset and the private frames of videos (FoVs) dataset to train a convolutional neural network (CNN). Second, mean images, which are used as a type of colour knowledge, are generated for all the classes in the Indoor dataset. The distance between every mean image and the input image produces the class weight vector. Scene knowledge, which consists of frequencies of occurrence of objects in the scene, is then employed as another prior knowledge to determine the scene weight. Finally, when a detection request is launched, the two vectors together with a vector of classification probability instigated by the deep model are multiplied to produce a decision vector for classification. Experiments show that detection precision can be improved by employing the prior colour and scene knowledge. In addition, we applied the method to object recognition in a video. The results showed potential application of the method for robot vision.
Highlights
The detection and recognition of indoor objects is an essential task in robot vision
The experimental results suggest that our proposed method may be applied to indoor object detection, and the use of prior knowledge is helpful in enhancing a robot vision for indoor object recognition
We proposed a prior knowledge-based deep model for indoor object recognition
Summary
The detection and recognition of indoor objects is an essential task in robot vision. Classical studies on indoor object recognition mainly relied on machine learning techniques (Ding et al, 2016, 2017; Mei, Yang, & Yin, 2017; Nan, Xie, & Sharf, 2012; Serre, Wolf, Bileschi, Riesenhuber, & Poggio, 2007; Uijlings, van de Sande, Gevers, & Smeulders, 2013). These methods involve a complex pipeline design and cannot learn deep features to generalize their extension.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have