Value Function

Value functions, central to reinforcement learning and optimal control, estimate the expected cumulative reward from a given state or state-action pair, guiding agents towards optimal behavior. Current research focuses on improving value function approximation accuracy and stability, particularly using neural networks (including shallow ReLU networks and transformers), and developing algorithms that address challenges like offline learning, multi-task optimization, and robustness to noise and uncertainty. These advancements are crucial for enhancing the efficiency and reliability of reinforcement learning agents in diverse applications, from robotics and autonomous systems to personalized recommendations and safe AI.

Papers

November 14, 2023

SceneScore: Learning a Cost Function for Object Arrangement
Ivan Kapelyukh, Edward Johns
Non Humanoid Robot Value Function Object Relation Cost Function Object Arrangement

November 3, 2023

Using General Value Functions to Learn Domain-Backed Inventory Management Policies
Durgesh Kalwar, Omkar Shelke, Harshad Khadilkar
Reinforcement Learning Value Function Inventory Control Inventory Policy Replenishment Decision

October 28, 2023

Bird's Eye View Based Pretrained World model for Visual Navigation
Kiran Lekkala, Chen Liu, Laurent Itti
Real World World Model Value Function Visual Navigation Bird Specie Differential Drive Robot Sim2Real Transfer

October 6, 2023

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman, Haohan Wang, Yu-Xiong Wang
Language Model Task Planning Monte Carlo Tree Search Value Function Contrastive Reasoner

October 4, 2023

Kernel-based function learning in dynamic and non stationary environments
Alberto Giaretta, Mauro Bisiacco, Gianluigi Pillonetto
Machine Learning Value Function Simple Function Kernel Ridge Regression Stationary Distribution Optimal Prediction

September 29, 2023

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Xidong Feng, Ziyu Wan, Muning Wen, Stephen Marcus McAleer, Ying Wen, Weinan Zhang, Jun Wang
Language Model Training Data Reasoning Capability Value Function Multi Step Reasoning Tree Search Decoder Only Large Language Model

September 28, 2023

Transfer Learning for Bayesian Optimization on Heterogeneous Search Spaces
Zhou Fan, Xinran Han, Zi Wang
Transfer Learning Bayesian Optimization Value Function Black Box Optimization Black Box Function

September 22, 2023

Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Chethan Bhateja, Derek Guo, Dibya Ghosh, Anikait Singh, Manan Tomar, Quan Vuong, Yevgen Chebotar, Sergey Levine, Aviral Kumar
Value Function Robot Policy Learning Online Video

September 18, 2023

Error Reduction from Stacked Regressions
Xin Chen, Jason M. Klusowski, Yan Shuo Tan
Value Function Linear Estimator Error Mitigation Stacking Ensemble Isotonic Regression Single Point Estimation

September 14, 2023

Rates of Convergence in Certain Native Spaces of Approximations used in Reinforcement Learning
Ali Bouland, Shengyuan Niu, Sai Tej Paruchuri, Andrew Kurdila, John Burns, Eugenio Schuster
Reinforcement Learning Early Stage Convergence Optimal Control Average Approximation Value Function Kernel Hilbert Space Universal Rate Convergence Rate Analysis

September 11, 2023

Data Summarization beyond Monotonicity: Non-monotone Two-Stage Submodular Maximization
Shaojie Tang
Submodular Maximization Value Function Objective Function Partial Monotonicity Monotone Submodular Function Constant Factor Approximation Data Summarization

August 22, 2023

Supply Function Equilibrium in Networked Electricity Markets
YuanzhangXiao, ChaithanyaBandi, Ermin Wei
Value Function Network Topology Electricity Market Supply Demand

July 26, 2023

July 22, 2023

HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park, Dibya Ghosh, Benjamin Eysenbach, Sergey Levine
Reinforcement Learning Value Function Latent State Past Action Goal Conditioned Reinforcement Learning Unsupervised Pre Training Offline Goal Conditioned Reinforcement Learning Standardized Benchmark Inverse Q Learning

June 30, 2023

Landmark Guided Active Exploration with State-specific Balance Coefficient
Fei Cui, Jiaojiao Fang, Mengke Yang, Guizhong Liu
Value Function Hierarchical Reinforcement Learning Goal Conditioned Unsupervised Exploration State Adaptive

June 29, 2023

Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He, Tianyi Zhou, Meng Fang, Setareh Maghsudi
Reinforcement Learning Deep Reinforcement Learning Value Function Temporal Difference Q Value Critic Regularization

June 27, 2023

Wasserstein Generative Regression
Shanshan Song, Tong Wang, Guohao Shen, Yuanyuan Lin, Jian Huang
Value Function Generative Framework Conditional Distribution Nonparametric Regression

June 9, 2023