Leader Reward for POMO-Based Neural Combinatorial Optimization [2405.13947]