Low Rank

Low-rank techniques aim to reduce the computational cost and memory requirements of large-scale machine learning models by representing high-dimensional data or model parameters using lower-dimensional structures. Current research focuses on applying low-rank methods to improve the efficiency of large language models (LLMs) and other deep learning architectures, often through techniques like low-rank adaptation (LoRA) and its variants, as well as matrix and tensor factorization. These advancements are significant because they enable the training and deployment of larger and more powerful models on resource-constrained devices, improving performance in various applications such as natural language processing, computer vision, and recommendation systems. Furthermore, theoretical work is exploring the inherent low-rank properties of trained models to better understand and optimize these methods.

Papers

July 15, 2024

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients
Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu, Jiawei Zhao, Yuandong Tian, Zhangyang Wang
Low Rank LOw Rank Low Rank Structure

July 13, 2024

Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis
Xiuying Wei, Skander Moalla, Razvan Pascanu, Caglar Gulcehre
Neural Network Training Data High Efficiency Low Rank Transformer Language Model Transformer Based LLM Scaling Analysis

July 11, 2024

Generalized Low-Rank Matrix Completion Model with Overlapping Group Error Representation
Wenjing Lu, Zhuang Fang, Liang Wu, Liming Tang, Hanxin Liu, Chuanjiang He
Low Rank Matrix Completion Low Rank Matrix Completion Low Rank Matrix Recovery Error Prone Group

July 8, 2024

Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Bowen Shen, Zheng Lin, Daren Zha, Wei Liu, Jian Luan, Bin Wang, Weiping Wang
Large Language Model Fine Grained Low Rank Structured Pruning Free Pruning Advanced Activation Mechanism

July 3, 2024

Attention Incorporated Network for Sharing Low-rank, Image and K-space Information during MR Image Reconstruction to Achieve Single Breath-hold Cardiac Cine Imaging
Siying Xu, Kerstin Hammernik, Andreas Lingg, Jens Kuebler, Patrick Krumm, Daniel Rueckert, Sergios Gatidis, Thomas Kuestner
Low Rank Attention Network K Space Cine Cardiac Magnetic Resonance Cine Magnetic Resonance MR Image Reconstruction

July 1, 2024

Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
Haobo Song, Hao Zhao, Soumajit Majumder, Tao Lin
Foundation Model Parameter Efficient Fine Tuning Low Rank Simple Strategy Model Capacity

June 25, 2024

Federated Dynamical Low-Rank Training with Global Loss Convergence Guarantees
Steffen Schotthöfer, M. Paul Laiu
Low Rank Splitting Algorithm Global Convergence Guarantee Horizontal Federated Learning

June 24, 2024

June 22, 2024

Efficient Low-rank Identification via Accelerated Iteratively Reweighted Nuclear Norm Minimization
Hao Wang, Ye Wang, Xiangyu Yang
Low Rank Low Rate ACCELERATION Local Convergence Nuclear Norm Low Rank Matrix Estimation Schatten $P$ Norm

June 21, 2024

Unlocking the Global Synergies in Low-Rank Adapters
Zixi Zhang, Cheng Zhang, Xitong Gao, Robert D. Mullins, George A. Constantinides, Yiren Zhao
Large Language Model Parameter Efficient Fine Tuning Low Rank Adapter Module Emerging Synergy

June 18, 2024

A variational Bayes approach to debiased inference for low-dimensional parameters in high-dimensional linear regression
Ismaël Castillo, Alice L'Huillier, Kolyan Ray, Luke Travis
High Dimensional Scientific Inference Low Rank Mean Field Sparse Linear Regression Parameter Space Variational Bayes Scalable Variational Mean Field Variational

June 13, 2024

June 11, 2024

Low Rank Multi-Dictionary Selection at Scale
Boya Ma, Maxwell McNeil, Abram Magner, Petko Bogdanov
Visual Analogue Scale Many Sparse Low Rank

May 29, 2024

Compressing Large Language Models using Low Rank and Low Precision Decomposition
Rajarshi Saha, Naomi Sagan, Varun Srivastava, Andrea J. Goldsmith, Mert Pilanci
Low Rank Low Rank Structure Compressed Model LLM Compression

Low Rank

Papers

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis

Generalized Low-Rank Matrix Completion Model with Overlapping Group Error Representation

Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations

Attention Incorporated Network for Sharing Low-rank, Image and K-space Information during MR Image Reconstruction to Achieve Single Breath-hold Cardiac Cine Imaging

Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Federated Dynamical Low-Rank Training with Global Loss Convergence Guarantees

Learning on Transformers is Provable Low-Rank and Sparse: A One-layer Analysis

Inferring stochastic low-rank recurrent neural networks from neural data

Efficient Low-rank Identification via Accelerated Iteratively Reweighted Nuclear Norm Minimization

Unlocking the Global Synergies in Low-Rank Adapters

A variational Bayes approach to debiased inference for low-dimensional parameters in high-dimensional linear regression

PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation

MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning

FouRA: Fourier Low Rank Adaptation

Low Rank Multi-Dictionary Selection at Scale

Reweighted Solutions for Weighted Low Rank Approximation

Low-Rank Adaption on Transformer-based Oriented Object Detector for Satellite Onboard Processing of Remote Sensing Images

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

Compressing Large Language Models using Low Rank and Low Precision Decomposition