TDCMR: Triplet-Based Deep Cross-Modal Retrieval for Geo-Multimedia Data

Jiagang Song,Yunwu Lin,Jiayu Song,Leyuan Zhang,Weiren Yu

doi:10.3390/app112210803

Abstract

Mass multimedia data with geographical information (geo-multimedia) are collected and stored on the Internet due to the wide application of location-based services (LBS). How to find the high-level semantic relationship between geo-multimedia data and construct efficient index is crucial for large-scale geo-multimedia retrieval. To combat this challenge, the paper proposes a deep cross-modal hashing framework for geo-multimedia retrieval, termed as Triplet-based Deep Cross-Modal Retrieval (TDCMR), which utilizes deep neural network and an enhanced triplet constraint to capture high-level semantics. Besides, a novel hybrid index, called TH-Quadtree, is developed by combining cross-modal binary hash codes and quadtree to support high-performance search. Extensive experiments are conducted on three common used benchmarks, and the results show the superior performance of the proposed method.

Highlights

With the rapid development of mobile internet, social networks, and Location-Based Service (LBS), large numbers of multimedia data [1] with geographical information (a.k.a geo-multimedia) [2], such as text, image [3,4], and video [5,6,7,8], are collected and stored on the internet
We propose a triplet-based deep cross-modal hashing framework, named Tripletbased Deep Cross-Modal Retrieval (TDCMR), which aims to extract deep sample features to alleviate the semantic gap through a triplet deep neural network unified feature learning and hash learning process
To solve the problem of low representation ability and slow query speed in geomultimedia data representation and query, this paper aims to narrow the cognitive gap between human and computer in multimedia data semantic understanding through a deep neural network, construct the deep cross-modal hash (Triplet-based Deep Cross-Modal Retrieval, TDCMR) network model based on triples, and encode geo-multimedia data semantically by a trained network model

Summary

Introduction

With the rapid development of mobile internet, social networks, and Location-Based Service (LBS), large numbers of multimedia data [1] with geographical information (a.k.a geo-multimedia) [2], such as text, image [3,4], and video [5,6,7,8], are collected and stored on the internet. Nearest neighbor spatial keyword query (NNSKQ) is a very important retrieval technique in LBS applications, which only focuses on location information and keyword information to find spatial objects. The traditional multi-modal retrieval techniques ignore the geographic location information To solve this dilemma, many researchers have tried to integrate multi-modal information into the query and proposed an effective nearest neighbor query method for geo-multimedia data [15]

Objectives

Methods

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Nov 16, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

TDCMR: Triplet-Based Deep Cross-Modal Retrieval for Geo-Multimedia Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Position and Velocity Tracking in Cellular Networks Using the Kalman Filter
Mohammed Olama ... Teja Kuruganti
-
Mohammed Olama, et. al.Mohammed Olama ... Teja Kuruganti
01 Apr 2009
01 Apr 2009

A framework to develop location based services applications using OGC map services
N Fernando ... D Dias
-
N Fernando, et. al.N Fernando ... D Dias
01 Dec 2010
01 Dec 2010

Efficient Bulk Loading to Accelerate Spatial Keyword Queries
...
-
, et. al. ...
15 Dec 2013
15 Dec 2013

Optimizing Geofencing for Location-Based Services: A New Application of Spatial Marketing
Odile J Streed ... Albert Kagan
-
Odile J Streed, et. al.Odile J Streed ... Albert Kagan
13 Oct 2014
13 Oct 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TDCMR: Triplet-Based Deep Cross-Modal Retrieval for Geo-Multimedia Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences