Zero-Shot Image Classification Based on a Learnable Deep Metric.

Jingyi Liu,Dongjing Tu,Caijuan Shi,Yazhi Liu,Ze Shi

doi:10.3390/s21093241

Jingyi Liu, Dongjing Tu + Show 3 more

Open Access

https://doi.org/10.3390/s21093241

Copy DOI

Abstract

The supervised model based on deep learning has made great achievements in the field of image classification after training with a large number of labeled samples. However, there are many categories without or only with a few labeled training samples in practice, and some categories even have no training samples at all. The proposed zero-shot learning greatly reduces the dependence on labeled training samples for image classification models. Nevertheless, there are limitations in learning the similarity of visual features and semantic features with a predefined fixed metric (e.g., as Euclidean distance), as well as the problem of semantic gap in the mapping process. To address these problems, a new zero-shot image classification method based on an end-to-end learnable deep metric is proposed in this paper. First, the common space embedding is adopted to map the visual features and semantic features into a common space. Second, an end-to-end learnable deep metric, that is, the relation network is utilized to learn the similarity of visual features and semantic features. Finally, the invisible images are classified, according to the similarity score. Extensive experiments are carried out on four datasets and the results indicate the effectiveness of the proposed method.

Highlights

Thanks to the development of deep learning models, image classification and image recognition have made continuous progress
Inspired by the relation network model, we propose a new Zero-shot learning (ZSL) method based on the learnable deep metric in this paper
The zero-shot image classification method based on learnable deep metric

Summary

Introduction

Thanks to the development of deep learning models, image classification and image recognition have made continuous progress. Sandouk et al [13] have used the Euclidean distance between embedded concepts in the concept embedding space to reflect the semantic similarity; while the simple metric has the limitation of unlearnable and being predefined in advance To overcome these limitations, Sung et al [14] have proposed the relation network model (RN) to learn a learnable end-to-end deep metric for comparing the relation between visual features and semantic features with the relationship scores. ZIC-LDM, can learn the correlation between visual features and semantic features in the common space with the learnable deep metric, and it adjusts the correlation end-to-end in a data-driven way This can greatly alleviate the semantic gap problem caused by the inconsistency between the manifold of visual features and semantic features. Experiments are conducted on widely used datasets and the experimental results indicate that ZIC-LDM has the ability to achieve better zero-shot image classification performance compared with other methods

Zero-Shot Learning

Meta Learning

Semantic Features

Similarity Measure for Zero-Shot Image Classificaiton

Task Define n

Relation Module

Common Space Embedding Module

Objective Function

Model Implementation

Zero-Shot Image Classification

Generalized Zero-Shot Image Classification

Dataset and Settings

Traditional Zero-Shot Image Classification

Generalized

Loss Convergence Analysis

Distance Metric Study

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: May 7, 2021
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Zero-Shot Image Classification Based on a Learnable Deep Metric.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Embedded Zero-Shot Image Classification Based on Bidirectional Feature Mapping
Huadong Sun ... Pengyi Zhang
Applied sciences | VOL. 14
Huadong Sun, et. al.Huadong Sun ... Pengyi Zhang
17 Jun 2024
Applied sciences | VOL. 14

Class knowledge overlay to visual feature learning for zero-shot image classification
Cheng Xie ... Qing Liu
Computer Vision and Image Understanding | VOL. 207
Cheng Xie, et. al.Cheng Xie ... Qing Liu
31 Mar 2021
Computer Vision and Image Understanding | VOL. 207

Zero-shot image classification based on factor space
Shijie Guan ... Anqi Yin
International Journal of Web Engineering and Technology | VOL. 16
Shijie Guan, et. al.Shijie Guan ... Anqi Yin
01 Jan 2020
International Journal of Web Engineering and Technology | VOL. 16

Method for improving zero‐shot image classification
Xiangfeng Chen ... Hu Han
The Journal of Engineering | VOL. 2018
Xiangfeng Chen, et. al.Xiangfeng Chen ... Hu Han
18 Oct 2018
The Journal of Engineering | VOL. 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero-Shot Image Classification Based on a Learnable Deep Metric.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)