Orthogonal Training

Orthogonal training is a technique used to improve the stability, robustness, and generalization of deep learning models by imposing orthogonality constraints on weight matrices. Current research focuses on applying this method to various architectures, including vision-language models and convolutional neural networks, often incorporating techniques like Cayley parameterization and polar decomposition-based initialization to achieve and maintain orthogonality. This approach shows promise in enhancing model performance in diverse applications, such as image classification, adversarial robustness, and the detection of synthetically generated images, by mitigating issues like vanishing/exploding gradients and overfitting.

Papers

October 16, 2024

MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection
Bokai Lin, Zihao Zeng, Zipeng Xiao, Siqi Kou, Tianqi Hou, Xiaofeng Gao, Hao Zhang, Zhijie Deng
Projection Bias KV Cache Projection Matrix Orthogonal prOjection BBox Adapter KV Cache Compression Orthogonal Training

July 11, 2024

Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Self-Regularization
Jinlong Li, Dong Zhao, Zequn Jie, Elisa Ricci, Lin Ma, Nicu Sebe
Native Robustness Zero Shot Vision Language Model Fine Tuning Consistency Regularization Orthogonal Statistical Learning Orthogonal Fine Tuning Orthogonal Training

October 2, 2023

From Stability to Chaos: Analyzing Gradient Descent Dynamics in Quadratic Regression
Xuxing Chen, Krishnakumar Balasubramanian, Promit Ghosal, Bhavya Agrawalla
Gradient Descent Core Stability Linear Regression Step Size Kill Chaos Bifurcation Diagram Quadratic Neural Network Gradient Descent Dynamic Orthogonal Training

November 25, 2022

TAOTF: A Two-stage Approximately Orthogonal Training Framework in Deep Neural Networks
Taoyong Cui, Jianze Li, Yuhan Dong, Li Liu
Deep Neural Network Multi Stage Orthogonality Constraint Orthogonal Gradient Orthogonal Training

October 27, 2022

On the biological plausibility of orthogonal initialisation for solving gradient instability in deep neural networks
Nikolay Manchev, Michael Spratling
Neural Network Deep Neural Network Synaptic Weight Orthogonal Basis Gradient Stability Biological Plausibility Orthogonal Training

October 20, 2022

LOT: Layer-wise Orthogonal Training on Improving $\ell_2$ Certified Robustness
Xiaojun Xu, Linyi Li, Bo Li
Native Robustness Adversarial Robustness Convolution Layer Convolution Kernel Orthogonal Training

March 4, 2022

Detecting GAN-generated Images by Orthogonal Training of Multiple CNNs
Sara Mandelli, Nicolò Bonettini, Paolo Bestagini, Stefano Tubaro
Deep Learning Convolutional Neural Network Data Detection Synthetic Image Image Generator Orthogonal Training

Orthogonal Training

Papers

MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection

Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Self-Regularization

From Stability to Chaos: Analyzing Gradient Descent Dynamics in Quadratic Regression

TAOTF: A Two-stage Approximately Orthogonal Training Framework in Deep Neural Networks

On the biological plausibility of orthogonal initialisation for solving gradient instability in deep neural networks

LOT: Layer-wise Orthogonal Training on Improving $\ell_2$ Certified Robustness

Detecting GAN-generated Images by Orthogonal Training of Multiple CNNs