VMLH: Efficient Video Moment Location via Hashing

Zhifang Tan,Fei Dong,Xinfang Liu,Chenglong Li,Xiushan Nie

doi:10.3390/electronics12020420

Zhifang Tan, Fei Dong + Show 3 more

Open Access

PDF Available

https://doi.org/10.3390/electronics12020420

Copy DOI

Export

Save

Cite

Journal: Electronics	Publication Date: Jan 13, 2023
Citations: 1	License type: CC BY 4.0

Affiliation: Shandong University, Shandong Jianzhu University

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Video-moment location by query is a hot topic in video understanding. However, most of the existing methods ignore the importance of location efficiency in practical application scenarios; video and query sentences have to be fed into the network at the same time during the retrieval, which leads to low efficiency. To address this issue, in this study, we propose an efficient video moment location via hashing (VMLH). In the proposed method, query sentences and video clips are, respectively, converted into hash codes and hash code sets, in which the semantic similarity between query sentences and video clips is preserved. The location prediction network is designed to predict the corresponding timestamp according to the similarity among hash codes, and the videos do not need to be fed into the network during the process of retrieval and location. Furthermore, different from the existing methods, which require complex interactions and fusion between video and query sentences, the proposed VMLH method only needs a simple XOR operation among codes to locate the video moment with high efficiency. This paper lays the foundation for fast video clip positioning and makes it possible to apply large-scale video clip positioning in practice. The experimental results on two public datasets demonstrate the effectiveness of the method.

Full Text