An End-to-End Trainable Multi-Column CNN for Scene Recognition in Extremely Changing Environment.

Zhenyu Li,Aiguo Zhou,Yong Shen

doi:10.3390/s20061556

Abstract

Scene recognition is an essential part in the vision-based robot navigation domain. The successful application of deep learning technology has triggered more extensive preliminary studies on scene recognition, which all use extracted features from networks that are trained for recognition tasks. In the paper, we interpret scene recognition as a region-based image retrieval problem and present a novel approach for scene recognition with an end-to-end trainable Multi-column convolutional neural network (MCNN) architecture. The proposed MCNN utilizes filters with receptive fields of different sizes to have Multi-level and Multi-layer image perception, and consists of three components: front-end, middle-end and back-end. The first seven layers VGG16 are taken as front-end for two-dimensional feature extraction, Inception-A is taken as the middle-end for deeper learning feature representation, and Large-Margin Softmax Loss (L-Softmax) is taken as the back-end for enhancing intra-class compactness and inter-class-separability. Extensive experiments have been conducted to evaluate the performance according to compare our proposed network to existing state-of-the-art methods. Experimental results on three popular datasets demonstrate the robustness and accuracy of our approach. To the best of our knowledge, the presented approach has not been applied for the scene recognition in literature.

Highlights

With the rapid development of machine learning and artificial intelligence, the application of visual robots has attracted wide attention [1,2]
Visual robots are applied in the field of autonomous navigation, that is motivated by promising application in future autonomous driving
We proposed a novel approach for scene recognition with an end-to-end trainable multiproposed a novel approach for scene recognition with an end-to-end trainable multi-column Convolutional Neural Networks (CNNs)

Summary

Introduction

With the rapid development of machine learning and artificial intelligence, the application of visual robots has attracted wide attention [1,2]. Many algorithms have been applied to scene recognition [4,5,6,7], and one of the most popular are Convolutional Neural Networks (CNNs). CNNs are a deep learning method specially designed for image classification and image recognition based on multi-layer neural network. Due to the limitation of spatial structure and computational consumption, the traditional multi-layer neural network cannot meet the basic needs of robot navigation, the emergence of CNNs effectively solves these problems. Traditional neural networks, object detection and scene recognition. The architecture as shown in in Figure inin a Figure 2 This network network utilizes utilizes filter filterof ofdifferent differentsize sizetotodeal dealwith withthe thescale scaleand andviewpoint viewpointchange change complex environment, as well as has a strong ability of multi-level and multi-layer scene perception. Loss becomes more discriminating, which is helpful to distinguish the different scenes information

The of Inception-A

Handcrafted Feature Method

CNN-Based Method

Proposed Approach

Image Retrieval

Recognized

Experimental Results and Analysis

Performance Measurements

Dataset Used in the Experiment

The Nordland Dataset

The KTH-IDOL2 Dataset

The KITTI Dataset

Scene Recognition with Viewpoint Change

Robustness

Ablation Study

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Mar 11, 2020
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An End-to-End Trainable Multi-Column CNN for Scene Recognition in Extremely Changing Environment.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Single-Image Crowd Counting via Multi-Column Convolutional Neural Network
Yingying Zhang ... Desen Zhou
-
Yingying Zhang, et. al.Yingying Zhang ... Desen Zhou
01 Jun 2016
01 Jun 2016

Image Steganalysis via Multi-Column Convolutional Neural Network
Qi Ke ... Zhang Daxing
-
Qi Ke, et. al.Qi Ke ... Zhang Daxing
01 Aug 2018
01 Aug 2018

Image Object and Scene Recognition Based on Improved Convolutional Neural Network
Guoyan Li ... Fei Wang
The International Arab Journal of Information Technology | VOL. 21
Guoyan Li, et. al.Guoyan Li ... Fei Wang
01 Jan 2024
The International Arab Journal of Information Technology | VOL. 21

Fast Regression Convolutional Neural Network for Visual Crowd Counting
Shen Khang Teoh ... Vooi Voon Yap
-
Shen Khang Teoh, et. al.Shen Khang Teoh ... Vooi Voon Yap
13 Jul 2021
13 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An End-to-End Trainable Multi-Column CNN for Scene Recognition in Extremely Changing Environment.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors