Sub Linear Regret

Sub-linear regret focuses on designing algorithms that minimize the cumulative difference between an online learning algorithm's performance and that of an optimal strategy with perfect foresight. Current research emphasizes various online learning settings, including contextual bandits, multi-armed bandits, and online convex optimization, often employing techniques like Thompson sampling, upper confidence bounds (UCB), and Follow-the-Perturbed-Leader (FPL). These advancements are crucial for improving the efficiency and robustness of algorithms in dynamic environments, with applications ranging from personalized recommendations and resource allocation to adaptive control and federated learning. The pursuit of sub-linear regret drives the development of more efficient and adaptable algorithms across numerous machine learning domains.

Papers

June 12, 2023

Efficiently Learning the Graph for Semi-supervised Learning
Dravyansh Sharma, Maxwell Jones
Graph Drawing Semi Supervised Learning Sub Linear Regret Dense Graph Graph Regularization Learning Complexity

May 29, 2023

Contextual Bandits with Budgeted Information Reveal
Kyra Gan, Esmaeil Keyvanshokooh, Xueqing Liu, Susan Murphy
Contextual Bandit Treatment Effect Sub Linear Regret Contextual Bandit Algorithm Personalized Treatment

May 24, 2023

Regret Matching+: (In)Stability and Fast Convergence in Games
Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo
Video Game Sub Linear Regret Extensive Form Game Normal Form Game Exponential Convergence Rate Stability Threshold Regret Matching Large Scale Game

May 15, 2023

Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension
Yue Wu, Jiafan He, Quanquan Gu
Sample Complexity Sub Linear Regret \Ell_p$ Norm

February 2, 2023

Learning with Exposure Constraints in Recommendation Systems
Omer Ben-Porat, Rotem Torkan
LeArning Abstract Recommendation System Sub Linear Regret Contextual Multi Armed Bandit Ethnic Medium Equal Exposure Corridor Content Creator

January 31, 2023

Online Learning in Dynamically Changing Environments
Changlong Wu, Ananth Grama, Wojciech Szpankowski
Online Learning Environment Feature Regret Minimization Regret Analysis Convex Loss Sub Linear Regret Novel Stealth Loss

January 24, 2023

Gossiped and Quantized Online Multi-Kernel Learning
Tomas Ortega, Hamid Jafarkhani
Sub Linear Regret Centralized Learning Gossip Algorithm Kernel Learning Online Kernel

January 22, 2023

Doubly Adversarial Federated Bandits
Jialin Yi, Milan Vojnović
Multi Armed Bandit Bandit Feedback Bandit Algorithm Sub Linear Regret Oblivious Adversary

December 13, 2022

Interactive Learning with Pricing for Optimal and Stable Allocations in Markets
Yigit Efe Erginbas, Soham Phade, Kannan Ramchandran
Large Scale Resource Allocation Collaborative Filtering Curious Price Sub Linear Regret Interactive Learning Market Analysis Combinatorial Bandit Stable Payoff Allocation

October 24, 2022

Private Online Prediction from Experts: Separations and Faster Rates
Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar
Expert Knowledge Regret Bound Separation Performance Sub Linear Regret Fast Rate Oblivious Adversary Private Prediction Approximate Differential Privacy

October 21, 2022

Competing Bandits in Time Varying Matching Markets
Deepan Muthirayan, Chinmay Maheshwari, Pramod P. Khargonekar, Shankar Sastry
Sub Linear Regret Single Agent Learning Matching Market

September 8, 2022

Online Low Rank Matrix Completion
Prateek Jain, Soumyabrata Pal
Matrix Completion Sub Linear Regret User Item Low Rank Structure Low Rank Matrix Completion

September 6, 2022

A Zeroth-Order Momentum Method for Risk-Averse Online Convex Games
Zifan Wang, Yi Shen, Zachary I. Bell, Scott Nivison, Michael M. Zavlanos, Karl H. Johansson
Bandit Feedback Sub Linear Regret Risk Averse Reinforcement Learning Cournot Game Convex Game

August 21, 2022

Energy-aware Scheduling of Virtualized Base Stations in O-RAN with Online Learning
Michail Kalntis, George Iosifidis
Online Learning Online Learning Algorithm Sub Linear Regret Performance Guarantee Open Radio Access Network Base Station Optimality Gap Energy Efficient Scheduling

July 23, 2022

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee, Avishek Ghosh, Sayak Ray Chowdhury, Aditya Gopalan
Scientific Inference Environment Exploration Linear Bandit Future Implication Sub Linear Regret Smallest Eigenvalue

July 16, 2022

Online Prediction in Sub-linear Space
Binghui Peng, Fred Zhang
Sub Linear Regret Adaptive Adversary Online Prediction

July 8, 2022

Interactive Recommendations for Optimal Allocations in Markets with Constraints
Yigit Efe Erginbas, Soham Phade, Kannan Ramchandran
Multi Armed Bandit Resource Allocation Recommendation System Participation Constraint Personalized Recommendation Sub Linear Regret Market Analysis Combinatorial Bandit Optimal Allocation

May 10, 2022

Universal Caching
Ativ Joshi, Abhishek Sinha
Regret Bound Regret Minimization Sub Linear Regret Service Caching Static Regret

April 22, 2022

MNL-Bandits under Inventory and Limited Switches Constraints
Hongbin Zhang, Yu Yang, Feng Wu, Qixin Zhang
Sub Linear Regret Choice Model Assortment Optimization Multinomial Logit Logistic Bandit

April 20, 2022

Online Caching with no Regret: Optimistic Learning via Recommendations
Naram Mhaisen, George Iosifidis, Douglas Leith
Movie Recommendation Simple Regret Sub Linear Regret Follow the Regularized Leader Service Caching Optimistic Learning Bipartite Network