Best Arm

"Best arm" identification, a core problem in multi-armed bandit research, focuses on efficiently identifying the optimal option (arm) from a set with unknown reward distributions. Current research emphasizes developing algorithms, such as those based on confidence intervals and successive elimination, that minimize the number of trials needed to identify the best arm, particularly in non-stationary environments or with resource constraints like limited memory or communication bandwidth. This field is crucial for optimizing resource allocation in various applications, including robotics (e.g., controlling robotic arms), clinical trials, and online advertising, where efficient decision-making under uncertainty is paramount.

33papers

Papers

May 17, 2025

Variance-Optimal Arm Selection: Regret Minimization and Best Arm Identification
Best Arm Person Identification Hybrid Reward Arm Selection Sub Optimality Gap Sub Gaussian Regret Minimization Norm Bounded

May 11, 2025

UniDiffGrasp: A Unified Framework Integrating VLM Reasoning and VLM-Guided Part Diffusion for Open-Vocabulary Constrained Grasping with Dual Arms
Task Oriented Grasping Best Arm Open Vocabulary Composite Diffusion LLM Reasoning

April 11, 2025

Influential Bandits: Pulling an Arm May Change the Environment
Mutual Influence Restless Bandit Multi Armed Bandit Bandit Identification Best Arm Bandit Algorithm Environment Feature

March 26, 2025

ARMO: Autoregressive Rigging for Multi-Category Objects
3D Shape Generation Large Scale Generative Model Best Arm Autoregressive Neural Network Univariate Symbolic Skeleton

March 11, 2025

Concept-Driven Deep Learning for Enhanced Protein-Specific Molecular Generation
Best Arm Concept Learner Molecule Generation Molecular Generation Faithful Generation

March 6, 2025

GeoFIK: A Fast and Reliable Geometric Solver for the IK of the Franka Arm based on Screw Theory Enabling Multiple Redundancy Parameters
Inverse Kinematics Geometry Problem Solver Best Arm

March 4, 2025

Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits
Streaming Algorithm Stochastic Way Multi Armed Bandit Stochastic Multi Armed Bandit Lower Bound Gap Dependent Memory Regret Best Arm

February 18, 2025

D3-ARM: High-Dynamic, Dexterous and Fully Decoupled Cable-driven Robotic Arm
Robotic Arm Optical Network Best Arm Coupling Mechanism Cable Driven Real Power Cable Manipulation

January 30, 2025

Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method
Retrieval Performance Open Domain Question Complex Query Best Arm Retrieval Technique

January 28, 2025

One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning
Low Rank Adaptation LeArning Abstract Best Arm Single CLIP Shot Learning

November 16, 2024

ARM: Appearance Reconstruction Model for Relightable 3D Generation
3D Mesh Single Image 3D Reconstruction Best Arm Appearance Modeling Sparse View Image

November 4, 2024

Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification
High Probability Bound Best Arm Identification Person Identification Braking Control Tail Bound Polynomial System Best Arm Time Matter

October 31, 2024

A Fast and Model Based Approach for Evaluating Task-Competence of Antagonistic Continuum Arms
Soft Robotic Arm Model Based Cable Driven Continuum Best Arm

October 10, 2024

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Regularized Reinforcement Learning Reward Model Test Time Best Arm Autoregressive Text Generation

September 3, 2024

Three-dimensional geometric resolution of the inverse kinematics of a 7 degree of freedom articulated arm
Robotic Arm Inverse Kinematics Freedom Matter Spatial Resolution Best Arm Geometric Modeling Different Degree

September 1, 2024

Kinematics & Dynamics Library for Baxter Arm
Non Humanoid Robot Baxter Robot Redundant Manipulator Open Source Library Kinematic Control Best Arm Kinematic Theory

August 26, 2024

Representative Arm Identification: A fixed confidence approach to identify cluster representatives
Disentangling Confidence Score Distribution Best Arm Identification Sample Complexity Best Arm Multi Armed Bandit

August 22, 2024

Identifying the Best Arm in the Presence of Global Environment Shifts
Adversarial Bandit Non Stationary Selection Policy Semi Bandit Speech Presence Environmental Change Best Arm

July 29, 2024

Design and Control of a Novel Six-Degree-of-Freedom Hybrid Robotic Arm
External Control Parallel Robot Hybrid Robot Fruit Harvesting Best Arm Agricultural Robot Robotic Arm Product Design Long Form Novel

July 5, 2024

On the Low-Rank Parametrization of Reward Models for Controlled Language Generation
High Efficiency Task Specific Reward Accurate Decoding Language Model Best Arm

Best Arm

Papers

Variance-Optimal Arm Selection: Regret Minimization and Best Arm Identification

UniDiffGrasp: A Unified Framework Integrating VLM Reasoning and VLM-Guided Part Diffusion for Open-Vocabulary Constrained Grasping with Dual Arms

Influential Bandits: Pulling an Arm May Change the Environment

ARMO: Autoregressive Rigging for Multi-Category Objects

Concept-Driven Deep Learning for Enhanced Protein-Specific Molecular Generation

GeoFIK: A Fast and Reliable Geometric Solver for the IK of the Franka Arm based on Screw Theory Enabling Multiple Redundancy Parameters

Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits

D3-ARM: High-Dynamic, Dexterous and Fully Decoupled Cable-driven Robotic Arm

Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method

One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning

ARM: Appearance Reconstruction Model for Relightable 3D Generation

Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification

A Fast and Model Based Approach for Evaluating Task-Competence of Antagonistic Continuum Arms

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

Three-dimensional geometric resolution of the inverse kinematics of a 7 degree of freedom articulated arm

Kinematics & Dynamics Library for Baxter Arm

Representative Arm Identification: A fixed confidence approach to identify cluster representatives

Identifying the Best Arm in the Presence of Global Environment Shifts

Design and Control of a Novel Six-Degree-of-Freedom Hybrid Robotic Arm

On the Low-Rank Parametrization of Reward Models for Controlled Language Generation