Sketch Based Video Object

Sketch-based video object manipulation is an emerging field focusing on using hand-drawn sketches as input to interact with and understand video content. Current research emphasizes developing robust models, often employing transformer-based architectures and graph convolutional networks, to achieve accurate object segmentation, localization, and even detection directly from sketches, bridging the semantic gap between abstract drawings and video data. This research area is significant because it offers a more intuitive and efficient way to interact with video data compared to traditional methods, with potential applications in video editing, content retrieval, and human-computer interaction. The development of large-scale annotated datasets is also a key focus to improve model performance and generalization.

Papers