the latest in aiBeta

Input Gradient

Input gradients, the rate of change of a model's output with respect to its input, are central to understanding and improving deep learning models. Current research focuses on leveraging input gradients for enhancing model robustness against adversarial attacks, generating more faithful and reliable explanations of model decisions (e.g., using Grad-CAM and its variants), and improving the interpretability of complex models like vision transformers and large language models. These efforts are significant because they address critical challenges in deploying deep learning models responsibly and reliably in various applications, from medical diagnosis to ensuring the safety of AI systems.

15papers

Papers

September 30, 2024

Characterizing Model Robustness via Natural Input Gradients
Gradient Norm Model Robustness Input Gradient

June 3, 2024

Expected Grad-CAM: Towards gradient faithfulness
Grad CAM Gradient Faithfulness Input Gradient

May 28, 2024

Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Adversarial Example Adversarial Prompt Input Gradient Large Language Model

March 22, 2024

Forward Learning for Gradient-based Black-box Saliency Map Generation
Saliency Map Generation Gradient Computation Gradient Based Input Gradient Saliency Map

February 25, 2024

Unmasking Dementia Detection by Masking Input Gradients: A JSM Approach to Model Interpretability and Precision
Input Gradient Multimodal Model Multidimensional Local Precision Rate Explainable AI Inherent Interpretability Post Hoc XAI Method Model Interpretability Saliency Map

February 21, 2024

Average gradient outer product as a mechanism for deep neural collapse
Variability Collapse Deep Neural Network Functional Mechanism Auto Encoder Neural Collapse Input Gradient

February 5, 2024

Towards Eliminating Hard Label Constraints in Gradient Inversion Attacks
Label Constraint Input Gradient Image Reconstruction Gradient Inversion Attack Label Recovery

August 30, 2023

Learning Diverse Features in Vision Transformers for Improved Generalization
Vision Transformer Deep Learning Model Input Gradient Attention Head Feature Diversity Improved Generalization MNIST Canadian Institute for Advanced

June 5, 2023

Input-gradient space particle inference for neural network ensembles
Deep Ensemble Neural Network Ensemble Ensemble Method Ensemble Learning Input Gradient Particle Based Variational Inference

March 28, 2023

Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization
Vision Transformer Adversarial Sample Gradient Backpropagation Input Gradient Transferable Adversarial Attack Token Level Gradient

January 28, 2023

On the Lipschitz Constant of Deep Networks and Double Descent
Deep Network Generalization Error Double Descent Model Complexity Lipschitz Bound Input Gradient Implicit Regularization

November 14, 2022

A Rigorous Study Of The Deep Taylor Decomposition
Deep Taylor Decomposition Saliency Method Deep Neural Network Input Gradient Rigorous Framework

August 21, 2022

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective
Unified Framework Mixed Sample Data Augmentation Spatio Temporal Mixup Mechanism Input Gradient Adversarial Robustness

June 14, 2022

On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective
Optimal Transport Adversarial Attack Neural Network Saliency Map Input Gradient

May 25, 2022

Deletion and Insertion Tests in Regression Models
Regression Model Long Form Deletion Logistic Regression Test Point Insertion Explainable AI Black Box Function Input Gradient