SketchQL: Video Moment Querying with a Visual Query Interface

Wu, Renzhi; Chunduri, Pramod; Payani, Ali; Chu, Xu; Arulraj, Joy; Rong, Kexin

doi:10.1145/3677140

Citation Details

SketchQL: Video Moment Querying with a Visual Query Interface

Localizing video moments based on the movement patterns of objects is an important task in video analytics. Existing video analytics systems offer two types of querying interfaces based on natural language and SQL, respectively. However, both types of interfaces have major limitations. SQL-based systems require high query specification time, whereas natural language-based systems require large training datasets to achieve satisfactory retrieval accuracy. To address these limitations, we present SketchQL, a video database management system (VDBMS) for offline, exploratory video moment retrieval that is both easy to use and generalizes well across multiple video moment datasets. To improve ease-of-use, SketchQL features avisual query interfacethat enables users to sketch complex visual queries through intuitive drag-and-drop actions. To improve generalizability, SketchQL operates on object-tracking primitives that are reliably extracted across various datasets using pre-trained models. We present a learned similarity search algorithm for retrieving video moments closely matching the user's visual query based on object trajectories. SketchQL trains the model on a diverse dataset generated with a novel simulator, that enhances its accuracy across a wide array of datasets and queries. We evaluate SketchQL on four real-world datasets with nine queries, demonstrating its superior usability and retrieval accuracy over state-of-the-art VDBMSs. more »

Award ID(s):: 2335881

PAR ID:: 10574924

Author(s) / Creator(s):: Wu, Renzhi; Chunduri, Pramod; Payani, Ali; Chu, Xu; Arulraj, Joy; Rong, Kexin

Publisher / Repository:: Association for Computing Machinery

Date Published:: 2024-10-01

Journal Name:: Proceedings of the ACM on Management of Data

Volume:: 2

Issue:: 4

ISSN:: 2836-6573

Page Range / eLocation ID:: 1 to 27

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3677140

More Like this