Image Narrative Generation

Image narrative generation focuses on automatically creating coherent stories from images or sequences of images, aiming to bridge the gap between visual and textual information. Current research emphasizes developing models that maintain character consistency across multiple frames, improve story coherence and visual fidelity, and offer control over narrative style and emotional arc, often leveraging large pre-trained language and vision models and incorporating techniques like bidirectional generation and attention mechanisms. This field is significant for its potential applications in entertainment, education, and creative content generation, as well as for advancing our understanding of multimodal representation learning and narrative structure.

Papers