Panoptic Scene Graph

Panoptic Scene Graph Generation (PSG) aims to create a comprehensive visual understanding of images by simultaneously segmenting objects (including both "things" and "stuff") and identifying their relationships. Current research focuses on improving the accuracy and robustness of PSG models, particularly addressing challenges like long-tailed relation distributions and biased annotations, often employing transformer-based architectures and incorporating language models for enhanced relation prediction. Advances in PSG are significant for broader scene understanding tasks and have implications for applications such as robotics, autonomous driving, and image captioning, enabling more nuanced and complete interpretations of visual data.

Papers