Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation

Ramla Bensaci,Ayoub Benchabana,Oussama Aiadi,Belal Khaldi

doi:10.3390/app112110176

Abstract

Automatic image annotation is an active field of research in which a set of annotations are automatically assigned to images based on their content. In literature, some works opted for handcrafted features and manual approaches of linking concepts to images, whereas some others involved convolutional neural networks (CNNs) as black boxes to solve the problem without external interference. In this work, we introduce a hybrid approach that combines the advantages of both CNN and the conventional concept-to-image assignment approaches. J-image segmentation (JSEG) is firstly used to segment the image into a set of homogeneous regions, then a CNN is employed to produce a rich feature descriptor per area, and then, vector of locally aggregated descriptors (VLAD) is applied to the extracted features to generate compact and unified descriptors. Thereafter, the not too deep clustering (N2D clustering) algorithm is performed to define local manifolds constituting the feature space, and finally, the semantic relatedness is calculated for both image–concept and concept–concept using KNN regression to better grasp the meaning of concepts and how they relate. Through a comprehensive experimental evaluation, our method has indicated a superiority over a wide range of recent related works by yielding F1 scores of 58.89% and 80.24% with the datasets Corel 5k and MSRC v2, respectively. Additionally, it demonstrated a relatively high capacity of learning more concepts with higher accuracy, which results in N+ of 212 and 22 with the datasets Corel 5k and MSRC v2, respectively.

Highlights

With technological advancement, it is becoming increasingly simple for people to capture photographs at various locations and activities
Image labeling procedure entails giving to a picture one or more labels that describe its content. This procedure may be used for a variety of tasks, including automatic photo labeling on social media [1], automatic photo description for visually impaired persons [2], and automatic text production from photographs [3]
Automatic image annotation (AIA) methods can roughly be categorized into two categories, global- and local-based methods

Summary

Introduction

It is becoming increasingly simple for people to capture photographs at various locations and activities. Image labeling procedure (image annotation) entails giving to a picture one or more labels (tags) that describe its content This procedure may be used for a variety of tasks, including automatic photo labeling on social media [1], automatic photo description for visually impaired persons [2], and automatic text production from photographs [3]. Since it takes a lot of time and effort, manual image labeling (tagging) is inconvenient for small collections and impossible for huge collections.

Related Work

Our Proposal

Region Representation

Calculating Blob–Label Co-Occurrences

Annotating New Images

Experiments and Result Analysis

Experiment Setup

Scenario 1

Method

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Oct 29, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Traffic scene recognition based on deep CNN and VLAD spatial pyramids
Bai-Ling Zhang ... Jeremy S Smith
-
Bai-Ling Zhang, et. al.Bai-Ling Zhang ... Jeremy S Smith
01 Jul 2017
01 Jul 2017

Codebook-Free Compact Descriptor for Scalable Visual Search
Yicheng Huang ... Feng Gao
IEEE Transactions on Multimedia | VOL. 21
Yicheng Huang, et. al.Yicheng Huang ... Feng Gao
01 Feb 2019
IEEE Transactions on Multimedia | VOL. 21

VLAD encoded Deep Convolutional features for unconstrained face verification
Navaneeth Bodla ... Jingxiao Zheng
-
Navaneeth Bodla, et. al.Navaneeth Bodla ... Jingxiao Zheng
01 Dec 2016
01 Dec 2016

Attributes and Action Recognition Based on Convolutional Neural Networks and Spatial Pyramid VLAD Encoding
Jeremy S Smith ... Bailing Zhang
-
Jeremy S Smith, et. al.Jeremy S Smith ... Bailing Zhang
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences