Reinforcement Learning of CPG-regulated Locomotion Controller for a Soft Snake Robot [2207.04899]