DAKRS: Domain Adaptive Knowledge-Based Retrieval System for Natural Language-Based Vehicle Retrieval

Synh Viet-Uyen Ha,Huy Dinh-Anh Le,Quang Qui-Vinh Nguyen,Nhat Minh Chung

doi:10.1109/access.2023.3260149

Synh Viet-Uyen Ha, Huy Dinh-Anh Le + Show 2 more

Open Access

https://doi.org/10.1109/access.2023.3260149

Copy DOI

Abstract

Given Natural Language (NL) text descriptions, NL-based vehicle retrieval aims to extract target vehicles from a multi-view multi-camera traffic video pool. Due to inherent distinctions between textual and visual data, this is a challenging multi-modal retrieval task that requires robust feature extractors (e.g. neural network) to well-align the abstract representations of texts and images in the same domain. However, solutions to the problem have been challenged by the high data complexities of not only the multi-view, multi-camera attributes of visual data and the diverse range of textual descriptions but also a lack of high-volume datasets in this relatively new field, alongside a prominently large domain gap between training and test sets. Many existing approaches have developed computationally expensive models to separately extract the subspaces of language and vision before blending into the same shared representation space while only focusing on single-modal information and ignoring much of the multi-modal information to deal with the aforementioned issues. Hence, we propose a Domain Adaptive Knowledge-based Retrieval System (DAKRS) to effectively and efficiently align multi-modal knowledge in a setting of limited labels. Our contributions are threefold: (i) An efficient extension of Contrastive Language-Image Pre-training (CLIP)’s transfer learning into a baseline text-to-image multi-modular vehicle retrieval framework; (ii) A data enhancement module to create pseudo-vehicle tracks from the traffic video pool by leveraging the robustness of baseline retrieval model combine with background subtraction; and (iii) A SSDA (SSDA) scheme to engineer pseudo-labels for adapting model parameters to the target domain distribution. Experimental results are benchmarked on the Cityflow-NL dataset, illustrating our competitiveness against state-of-the-art performances in terms of effectiveness and efficiency without needing further post-processing or ensembling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DAKRS: Domain Adaptive Knowledge-Based Retrieval System for Natural Language-Based Vehicle Retrieval

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Journal: IEEE Access	Publication Date: Jan 1, 2023
License type: CC BY-NC-ND 4.0

Similar Papers

Symmetric Network with Spatial Relationship Modeling for Natural Language-based Vehicle Retrieval
Chuyang Zhao ... Sipeng Zhang
-
Chuyang Zhao, et. al.Chuyang Zhao ... Sipeng Zhang
01 Jun 2022
01 Jun 2022

FindVehicle and VehicleFinder: a NER dataset for natural language-based vehicle retrieval and a keyword-based cross-modal vehicle retrieval system
Runwei Guan ... Yutao Yue
Multimedia Tools and Applications | VOL. 83
Runwei Guan, et. al.Runwei Guan ... Yutao Yue
14 Aug 2023
Multimedia Tools and Applications | VOL. 83

Ontology-Based Natural Language Texts Generation from Knowledge Base
Longwei Qian ... Wenzu Li
-
Longwei Qian, et. al.Longwei Qian ... Wenzu Li
01 Jan 2021
01 Jan 2021

Transfer Learning with Convolutional Neural Networks for Classification of Abdominal Ultrasound Images.
Phillip M Cheng ... Harshawn S Malhi
Journal of Digital Imaging | VOL. 30
Phillip M Cheng, et. al.Phillip M Cheng ... Harshawn S Malhi
28 Nov 2016
Journal of Digital Imaging | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DAKRS: Domain Adaptive Knowledge-Based Retrieval System for Natural Language-Based Vehicle Retrieval

Abstract

Talk to us

Similar Papers

More From: IEEE Access