Perceptual Information

Perceptual information research focuses on understanding how sensory inputs are processed and interpreted to form our experience of the world, exploring both biological and artificial systems. Current research emphasizes integrating perceptual knowledge from various modalities (vision, audio, language) using techniques like multimodal learning with vision transformers, diffusion models, and large language models to improve tasks such as object recognition, image classification, and cross-modal retrieval. These advancements have significant implications for improving AI systems' ability to understand and interact with the world in a more human-like way, as well as for gaining deeper insights into the neural mechanisms underlying human perception.

Papers