SketchQL: Video Moment Querying with a Visual Query Interface

Renzhi Wu,Xu Chu,Ali Payani,Pramod Chunduri,Joy Arulraj,Kexin Rong

doi:10.1145/3677140

Abstract

Localizing video moments based on the movement patterns of objects is an important task in video analytics. Existing video analytics systems offer two types of querying interfaces based on natural language and SQL, respectively. However, both types of interfaces have major limitations. SQL-based systems require high query specification time, whereas natural language-based systems require large training datasets to achieve satisfactory retrieval accuracy. To address these limitations, we present SketchQL, a video database management system (VDBMS) for offline, exploratory video moment retrieval that is both easy to use and generalizes well across multiple video moment datasets. To improve ease-of-use, SketchQL features a visual query interface that enables users to sketch complex visual queries through intuitive drag-and-drop actions. To improve generalizability, SketchQL operates on object-tracking primitives that are reliably extracted across various datasets using pre-trained models. We present a learned similarity search algorithm for retrieving video moments closely matching the user's visual query based on object trajectories. SketchQL trains the model on a diverse dataset generated with a novel simulator, that enhances its accuracy across a wide array of datasets and queries. We evaluate SketchQL on four real-world datasets with nine queries, demonstrating its superior usability and retrieval accuracy over state-of-the-art VDBMSs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SketchQL: Video Moment Querying with a Visual Query Interface

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Management of Data

Lead the way for us

Similar Papers

SketchQL Demonstration: Zero-Shot Video Moment Querying with Sketches
Renzhi Wu ... Kexin Rong
Proceedings of the VLDB Endowment | VOL. 17
Renzhi Wu, et. al.Renzhi Wu ... Kexin Rong
01 Aug 2024
Proceedings of the VLDB Endowment | VOL. 17

Natural Language Aided Visual Query Building for Complex Data Access
Shimei Pan ... Michelle Zhou
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 24
Shimei Pan, et. al.Shimei Pan ... Michelle Zhou
11 Jul 2010
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 24

An extensible database architecture for nationwide power quality monitoring
Dilek Küçük ... Muammer Ermiş
International Journal of Electrical Power & Energy Systems | VOL. 32
Dilek Küçük, et. al.Dilek Küçük ... Muammer Ermiş
16 Dec 2009
International Journal of Electrical Power & Energy Systems | VOL. 32

Localizing Moments in Video with Temporal Language
Lisa Anne Hendricks ... Trevor Darrell
-
Lisa Anne Hendricks, et. al.Lisa Anne Hendricks ... Trevor Darrell
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SketchQL: Video Moment Querying with a Visual Query Interface

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Management of Data