Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed [2412.10381]