Referring Expression
Referring expression (RE) research focuses on how language is used to identify specific objects or entities within a context, whether visual (images, videos) or auditory (speech). Current research emphasizes developing models, often leveraging large language models (LLMs) and multimodal architectures, that generate accurate and contextually appropriate REs, and accurately comprehend and locate the referred entities. This work is significant for advancing natural language understanding, improving human-computer interaction, and enabling applications such as improved speech-based disease detection and more effective assistive technologies.
Papers
May 16, 2022
April 13, 2022
March 22, 2022
February 25, 2022