Return Conditioned Supervised Learning

Return-conditioned supervised learning (RCSL) focuses on training models that predict actions or outputs conditioned on desired future outcomes, such as high rewards or specific target values. Current research explores various model architectures, including diffusion models and decision transformers, and investigates techniques to improve robustness and efficiency, addressing challenges like hyperparameter sensitivity and the impact of environmental stochasticity. This approach holds significant promise for advancing offline reinforcement learning, improving the efficiency of conditional generation tasks in diverse fields, and enabling more controllable and aligned AI systems.

Papers

April 15, 2022

Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning
Mayee F. Chen, Daniel Y. Fu, Avanika Narayan, Michael Zhang, Zhao Song, Kayvon Fatahalian, Christopher Ré
Contrastive Learning Native Robustness Formality Transfer Supervised Contrastive Learning Better Representation Return Conditioned Supervised Learning Latent Class Conditional Autoencoder

March 28, 2022

A Fast and Efficient Conditional Learning for Tunable Trade-Off between Accuracy and Robustness
Souvik Kundu, Sairam Sundaresan, Massoud Pedram, Peter A. Beerel
Adversarial Attack Native Robustness Adversarial Training Convolution Operation Optimal Trade Return Conditioned Supervised Learning Standard Adversarial Training

February 10, 2022

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks
Nan Wu, Stanisław Jastrzębski, Kyunghyun Cho, Krzysztof J. Geras
Return Conditioned Supervised Learning Greedy SLIM Greedy Learning

November 3, 2021

Deep Least Squares Alignment for Unsupervised Domain Adaptation
Youshan Zhang, Brian D. Davison
Domain Adaptation Unsupervised Domain Adaptation Optimal Alignment Return Conditioned Supervised Learning Domain Distribution

Return Conditioned Supervised Learning

Papers

Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning

A Fast and Efficient Conditional Learning for Tunable Trade-Off between Accuracy and Robustness

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Deep Least Squares Alignment for Unsupervised Domain Adaptation