On the tag localization of web video

Haojie Li,Bin Liu,Yue Guan,Lei Yi,Zhong-Xuan Luo

doi:10.1007/s00530-014-0404-y

Abstract

Nowadays, numerous social videos have pervaded on the web. Social web videos are characterized with the accompanying rich contextual information which describe the content of videos and thus greatly facilitate video search and browsing. Generally, those contextual data such as tags are provided at the whole video level, without temporal indication of when they actually appear in the video, let alone the spatial annotation of object related tags in the video frames. However, many tags only describe parts of the video content. Therefore, tag localization, the process of assigning tags to the underlying relevant video segments or frames even regions in frames is gaining increasing research interests and a benchmark dataset for the fair evaluation of tag localization algorithms is highly desirable. In this paper, we describe and release a dataset called DUT-WEBV, which contains about 4,000 videos collected from YouTube portal by issuing 50 concepts as queries. These concepts cover a wide range of semantic aspects including scenes like "mountain", events like "flood", objects like "cows", sites like "gas station", and activities like "handshaking", offering great challenges to the tag (i.e., concept) localization task. For each video of a tag, we carefully annotate the time durations when the tag appears in the video and also label the spatial location of object with mask in frames for object related tag. Besides the video itself, the contextual information, such as thumbnail images, titles, and YouTube categories, is also provided. Together with this benchmark dataset, we present a baseline for tag localization using multiple instance learning approach. Finally, we discuss some open research issues for tag localization in web videos.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the tag localization of web video

Abstract

Talk to us

Similar Papers

More From: Multimedia Systems

Lead the way for us

Journal: Multimedia Systems	Publication Date: Aug 7, 2014
Citations: 32

Similar Papers

DUT-WEBV: A Benchmark Dataset for Performance Evaluation of Tag Localization for Web Video
Haojie Li ... Lei Yi
-
Haojie Li, et. al.Haojie Li ... Lei Yi
01 Jan 2013
01 Jan 2013

Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification
Meng Wang ... Guangda Li
IEEE Transactions on Multimedia | VOL. 14
Meng Wang, et. al.Meng Wang ... Guangda Li
01 Aug 2012
IEEE Transactions on Multimedia | VOL. 14

The Importance of Web 2.0 to the 50-Plus
Dick Stroud
-
Dick StroudDick Stroud
14 Sep 2010
The Importance of Web 2.0 to the 50-Plus
Dick Stroud

Lung Nodules Identification in CT Scans Using Multiple Instance Learning*
Wiem Safta ... Hichem Frigui
-
Wiem Safta, et. al.Wiem Safta ... Hichem Frigui
01 Dec 2022
01 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the tag localization of web video

Abstract

Talk to us

Similar Papers

More From: Multimedia Systems