HowToCaption: Prompting LLMs to Transform Video Annotations at Scale [2310.04900]