YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls [2412.09168]