Weight Matrix

Weight matrices, the core components of neural networks, are the subject of intense research focused on improving efficiency, generalization, and interpretability. Current efforts explore low-rank approximations, structured matrices (e.g., Monarch, Block Tensor-Train), and novel training methods like weight decay and parameter-efficient fine-tuning (PEFT) techniques such as LoRA, to optimize their structure and reduce computational costs. These advancements are crucial for scaling up deep learning models, enabling their application to larger datasets and more complex tasks, and enhancing our understanding of how these models learn and generalize.

Papers

June 22, 2022

Bi-stochastically normalized graph Laplacian: convergence to manifold Laplacian and robustness to outlier noise
Xiuyuan Cheng, Boris Landa
Native Robustness Early Stage Convergence Graph Laplacian Weight Matrix Outlier Data Laplacian Canonization Divisive Normalization Laplacian Related Constraint

June 12, 2022

Characterizing the Implicit Bias of Regularized SGD in Rank Minimization
Tomer Galanti, Zachary S. Siegel, Aparna Gupte, Tomaso Poggio
Stochastic Gradient Descent Low Rank Implicit Bias Mini Batch Weight Matrix

April 5, 2022

Multi-Weight Respecification of Scan-specific Learning for Parallel Imaging
Hui Tao, Haifeng Wang, Shanshan Wang, Dong Liang, Xiaoling Xu, Qiegen Liu
Weight Matrix Parallel Imaging Fast Reconstruction Space Interpolation

April 1, 2022

Monarch: Expressive Structured Matrices for Efficient and Accurate Training
Tri Dao, Beidi Chen, Nimit Sohoni, Arjun Desai, Michael Poli, Jessica Grogan, Alexander Liu, Aniruddh Rao, Atri Rudra, Christopher Ré
Fine Tuning High Efficiency Weight Matrix Structured Matrix Accurate Training Sparse to Sparse Training Butterfly Matrix

March 28, 2022

Random matrix analysis of deep neural network weight matrices
Matthias Thamm, Max Staats, Bernd Rosenow
Neural Network Deep Neural Network Random Matrix Weight Matrix Neural Network Weight Matrix

March 8, 2022

Robust Local Preserving and Global Aligning Network for Adversarial Domain Adaptation
Wenwen Qiang, Jiangmeng Li, Changwen Zheng, Bing Su, Hui Xiong
Loss Function Unsupervised Domain Adaptation Weight Matrix Adversarial Domain Adaptation Alignment Network Information Theoretic Loss

March 3, 2022

Uniform Approximations for Randomized Hadamard Transforms with Applications
Yeshwanth Cherapanamjeri, Jelani Nelson
Financial Application Weight Matrix Hadamard Transform Uniform Approximation

February 11, 2022

A Modern Self-Referential Weight Matrix That Learns to Modify Itself
Kazuki Irie, Imanol Schlag, Róbert Csordás, Jürgen Schmidhuber
Gradient Descent Multi Task Reinforcement Learning Weight Matrix Self Improvement

December 31, 2021

Training and Generating Neural Networks in Compressed Weight Space
Kazuki Irie, Jürgen Schmidhuber
Training Data Recurrent Neural Network Weight Matrix Neural Generation Weight Space

November 26, 2021

Impact of classification difficulty on the weight matrices spectra in Deep Learning and application to early-stopping
Xuran Meng, Jianfeng Yao
Deep Learning Application Proficiency Global Impact Random Matrix Weight Matrix Large Matrix Spectral Analysis Classification Difficulty

November 3, 2021

Tuning the Weights: The Impact of Initial Matrix Configurations on Successor Features Learning Efficacy
Hyunsu Lee
Reinforcement Learning Global Impact Efficient Learning Balancing Weight Artificial Agent Weight Matrix Successor Representation Grid World