DBGC: Dimension-Based Generic Convolution Block for Object Recognition.

Chirag Patel,Muhammad Ahmed Khan,Mohd Zuhair,Radhika Patel,Syed Aziz Shah,Urvi Bhatt,Urvashi Sharma,Shubhankar Majumdar,Kirit Modi,Dulari Bhatt,Nagaraj Cholli,Khushi Patel,Hemant Ghayvat,Akash Patel,Sharnil Pandya

doi:10.3390/s22051780

Abstract

The object recognition concept is being widely used a result of increasing CCTV surveillance and the need for automatic object or activity detection from images or video. Increases in the use of various sensor networks have also raised the need of lightweight process frameworks. Much research has been carried out in this area, but the research scope is colossal as it deals with open-ended problems such as being able to achieve high accuracy in little time using lightweight process frameworks. Convolution Neural Networks and their variants are widely used in various computer vision activities, but most of the architectures of CNN are application-specific. There is always a need for generic architectures with better performance. This paper introduces the Dimension-Based Generic Convolution Block (DBGC), which can be used with any CNN to make the architecture generic and provide a dimension-wise selection of various height, width, and depth kernels. This single unit which uses the separable convolution concept provides multiple combinations using various dimension-based kernels. This single unit can be used for height-based, width-based, or depth-based dimensions; the same unit can even be used for height and width, width and depth, and depth and height dimensions. It can also be used for combinations involving all three dimensions of height, width, and depth. The main novelty of DBGC lies in the dimension selector block included in the proposed architecture. Proposed unoptimized kernel dimensions reduce FLOPs by around one third and also reduce the accuracy by around one half; semi-optimized kernel dimensions yield almost the same or higher accuracy with half the FLOPs of the original architecture, while optimized kernel dimensions provide 5 to 6% higher accuracy with around a 10 M reduction in FLOPs.

Highlights

The Convolution Neural Network is a widely used deep learning architecture for computer vision tasks such as object detection, object segmentation, and object recognition [1]
ShuffleNetv2 architecture, evaluated the generic nature of the unit on the dataset explained in we evaluated the generic nature of the Dimension-Based Generic Convolution Block (DBGC) unit on the PASCAL VOC dataset explained Section
DBGC was used with ESPNetv2 and ShuffleNetv2 architectures

Summary

Introduction

The Convolution Neural Network is a widely used deep learning architecture for computer vision tasks such as object detection, object segmentation, and object recognition [1]. This module can be added into any architecture to reduce numbers of FLOPs without affecting accuracy. Two main contributions of the research lie in developing semi-optimized kernel and optimized kernel methods. Such methods reduce the number of FLOPs while providing equal or greater accuracy. An important task for any computer vision application is to extract correct features [20] It is mentioned in paper [21] that fusion methods for extracting features can be used for better performance. The following sections explain the basic architectures of each of these networks, and states their merits and demerits

ShuffleNetv2

ESPNetv2

DiCENet

MobileNetv2

Materials and Methods analysis for the proposed

Introduction to Separable Convolution

Simple

Depth-Wise Separable Convolution

Introduction to Convolution Kernels

DBGC—Dimension-Based

It load introducing theand dimension selector information module in Section

It the completes blendbyasintroducing discussed in

Convolution

13. Implementation

Experimental Setup

Dataset Details

Results Analysis

Unoptimized Kernel Dimensions

Conclusions

Future Work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Feb 24, 2022
Citations: 35	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

DBGC: Dimension-Based Generic Convolution Block for Object Recognition.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

The relationship between the height dimension and numerical processing.
Liat Goldfarb ... Ram Naaman
Quarterly journal of experimental psychology (2006) | VOL. 73
Liat Goldfarb, et. al.Liat Goldfarb ... Ram Naaman
29 Aug 2020
Quarterly journal of experimental psychology (2006) | VOL. 73

Pruning CNN filters via quantifying the importance of deep visual representations
Ali Alqahtani ... Ehab Essa
Computer Vision and Image Understanding | VOL. 208-209
Ali Alqahtani, et. al.Ali Alqahtani ... Ehab Essa
18 May 2021
Computer Vision and Image Understanding | VOL. 208-209

Evaluating Extended Pruning on Object Detection Neural Networks
Simon O'Keeffe ... Rudi Villing
-
Simon O'Keeffe, et. al.Simon O'Keeffe ... Rudi Villing
01 Jun 2018
01 Jun 2018

Relationship between anthropometric indices and Mizaj (temperament) in Persian medicine.
Mojgan Tansaz ... Morteza Mojahedi
Caspian journal of internal medicine | VOL. 14
Mojgan Tansaz, et. al.Mojgan Tansaz ... Morteza Mojahedi
01 Jan 2023
Caspian journal of internal medicine | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DBGC: Dimension-Based Generic Convolution Block for Object Recognition.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)