Gradient Manipulation

Gradient manipulation is a burgeoning field exploring how altering gradients during model training or inference can improve performance, enhance robustness, or reveal model vulnerabilities. Current research focuses on applications such as improving the efficiency and safety of reinforcement learning, enhancing the transferability of adversarial attacks, and developing more effective methods for detecting backdoors and out-of-distribution samples. These techniques are impacting various machine learning domains, from speech and image recognition to safe AI development, by addressing critical challenges in model training, optimization, and security. The resulting advancements contribute to more robust, efficient, and trustworthy AI systems.

Papers