Visual Embedding Augmentation in Fourier Domain for Deep Metric Learning

Zheng Wang,Zhenwei Gao,Yang Yang,Heng Tao Shen,Guoqing Wang

doi:10.1109/tcsvt.2023.3260082

Abstract

Deep Metric Learning (DML) is very effective for many computer vision applications such as image retrieval or cross-modal matching. The common paradigm for DML is to seek metric spaces that can encode semantically similar objects close while locating the dissimilar ones far away from each other. To make features more discriminative, the mainstream methods usually design various specific loss functions to seek the help of hard negatives through complex hard mining strategies or hard synthesizing with additional networks. In spite of their fruitfulness, these approaches ignore the impact of low-level information in images on the performance, which may degrade the discerning ability of learned embedding. To alleviate these problems, we introduce a simple yet effective augmentation method to generate more hard negatives by swapping the low-frequency spectra of negative instances with anchors in the Fourier domain. Specifically, unlike previous methods, our proposed approach does not involve any complex design strategies but enriches hard negatives by manipulating the low-level variability of images only with simple Fourier transforms. In addition, our method is treated as a universal plug-in, which can be incorporated into different models for performance improvement. In the end, we conduct extensive experiments to evaluate our method on the widely-used datasets including CUB-200-2011, CARS-196, and Stanford Online Products. Our quantitative results demonstrate that the proposed plug-in outperforms previous approaches consistently and significantly across different datasets and evaluation metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual Embedding Augmentation in Fourier Domain for Deep Metric Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Oct 1, 2023
Citations: 3

Similar Papers

Improving Deep Metric Learning by Divide and Conquer.
Artsiom Sanakoyeu ... Pingchuan Ma
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Artsiom Sanakoyeu, et. al.Artsiom Sanakoyeu ... Pingchuan Ma
01 Jan 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

Non-isotropy Regularization for Proxy-based Deep Metric Learning
Karsten Roth ... Oriol Vinyals
-
Karsten Roth, et. al.Karsten Roth ... Oriol Vinyals
01 Jun 2022
01 Jun 2022

Deep Metric Learning with Graph Consistency
Binghui Chen ... Zhaoyi Yan
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Binghui Chen, et. al.Binghui Chen ... Zhaoyi Yan
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Deep Metric Learning With Manifold Class Variability Analysis
Dae Ha Kim ... Byung Cheol Song
IEEE Transactions on Multimedia | VOL. 24
Dae Ha Kim, et. al.Dae Ha Kim ... Byung Cheol Song
01 Jan 2021
IEEE Transactions on Multimedia | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual Embedding Augmentation in Fourier Domain for Deep Metric Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology