Learning Manipulation Skills through Robot Chain-of-Thought with Sparse Failure Guidance [2405.13573]