Semi Supervised Reward

Semi-supervised reward learning aims to reduce the reliance on expensive, human-labeled data in training reward models for reinforcement learning. Current research focuses on leveraging unlabeled data through techniques like iterative self-training, pseudo-labeling, and data augmentation, often integrated with various model architectures including deep neural networks and differentiable decision trees. These methods improve the efficiency and scalability of reward learning, impacting diverse applications such as robotics, automated driving, and large language model alignment by enabling the development of more effective and human-aligned AI systems with less human intervention. Active learning strategies are also being explored to minimize the number of human queries needed for effective reward model training.

Papers