Zero-Shot Classification Based on Word Vector Enhancement and Distance Metric Learning

Ji Zhang,Yongjie Zhai,Yu Chen

doi:10.1109/access.2020.2998495

Ji Zhang, Yongjie Zhai + Show 1 more

Open Access

https://doi.org/10.1109/access.2020.2998495

Copy DOI

Abstract

The zero-shot classification algorithm has been widely concerned in recent years, in which the labeling of samples of a new category is unnecessary and the cost of annotations can be reduced in applications. This paper presents a zero-shot method for image classification based on word vectors enhancement and distance metric learning. Specifically, the convolutional neural network (CNN) is employed to extract image feature vectors which have the same dimension as semantic feature vectors. Then, an unsupervised learning method is applied on Wikipedia corpus for extracting word vectors and the skip-gram is used to obtain word vectors. The model of analysis dictionary learning is improved by reducing redundant information in word vectors. The obtained sparse vectors are used as semantic features and a distance metric learning method is employed to measure the distance between image features and semantic features. Finally, the classification is implemented by a nearest neighbor based classifier. The effectiveness of the proposed algorithm is validated on the AwA and CUB data sets. Experimental results demonstrate that the proposed method has good performance in terms of both accuracy and robustness.

Highlights

Most of the existing object classification methods are within the scope of supervised learning
3 that the performance of the analysis dictionary learning (ADL)-distance metric learning (DML) method has been significantly improved compared with the Euc method
Based on word vector enhancement and distance metric learning, this paper proposes a zero-shot image classification method, enhancing the accuracy of classification and overcoming the limitation of attribute learning, not necessarily labeling a large amount of data

Summary

INTRODUCTION

Most of the existing object classification methods are within the scope of supervised learning. The core idea of the embedding model method is to simultaneously map all visual features and category labels to a certain space, and perform zero-shot classification based on the similarity measure [16], [17]. VOLUME 8, 2020 configured in equal importance at this time, the relationship between samples cannot be effectively described In this case, this paper uses distance metric learning (DML) to measure the distance between the image feature vector and the semantic feature vector, and uses the nearest neighbor classifier to classify depending upon the distance. 3) A zero-shot image classification method based on word vector enhancement and distance metric learning is proposed, which acquires better performance in accuracy and robustness than several mainstream ZSL methods

RELATED WORK

DISTANCE METRIC LEARNING

LARGE MARGIN NEAREST NEIGHBOR ALGORITHM

Findings

DISCUSSION AND CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Zero-Shot Classification Based on Word Vector Enhancement and Distance Metric Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions

Lead the way for us

Similar Papers

Word Embeddings for Natural Language Processing

-

01 Jan 2015
01 Jan 2015

An effective recommendation model based on deep representation learning
Juan Ni ... Shangce Gao
Information sciences | VOL. 542
Juan Ni, et. al.Juan Ni ... Shangce Gao
19 Jul 2020
Information sciences | VOL. 542

Robust Semantic Template Matching Using A Superpixel Region Binary Descriptor.
Hua Yang ... Zhouping Yin
IEEE Transactions on Image Processing | VOL. 28
Hua Yang, et. al.Hua Yang ... Zhouping Yin
17 Jan 2019
IEEE Transactions on Image Processing | VOL. 28

Intelligent Recognition and Teaching of English Fuzzy Texts Based on Fuzzy Computing and Big Data
Ling Liu ... Yuanpeng Zhang
Wireless Communications and Mobile Computing | VOL. 2021
Ling Liu, et. al.Ling Liu ... Yuanpeng Zhang
10 Jul 2021
Wireless Communications and Mobile Computing | VOL. 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero-Shot Classification Based on Word Vector Enhancement and Distance Metric Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions