Sparse Operation

Sparse operation research focuses on optimizing computations involving sparse matrices, prevalent in machine learning models like graph neural networks and Mixture-of-Experts, to improve training speed and efficiency. Current research emphasizes developing specialized libraries and algorithms, such as those employing block-sparse structures and randomized computations, for various hardware architectures (GPUs, IPUs). These advancements are crucial for enabling the training and deployment of larger, more complex models while mitigating the computational and memory burdens associated with dense matrix operations, impacting fields like natural language processing and recommendation systems.

Papers

March 21, 2024

iSpLib: A Library for Accelerating Graph Neural Networks using Auto-tuned Sparse Operations
Md Saidul Hoque Anik, Pranav Badhe, Rohit Gampa, Ariful Azad
Graph Neural Network Pytorch Model Easy to Use Library Sparse Operation Dense Matrix Multiplication

March 29, 2023

PopSparse: Accelerated block sparse matrix multiplication on IPU
Zhiyi Li, Douglas Orr, Valeriu Ohan, Godfrey Da costa, Tom Murray, Adam Sanders, Deniz Beker, Dominic Masters
Sparse Matrix Multiplication Sparse Operation Dense Matrix Multiplication

November 29, 2022

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Trevor Gale, Deepak Narayanan, Cliff Young, Matei Zaharia
Mixture of Expert Sparse Training Sparse Operation GPU Kernel

October 19, 2022

RSC: Accelerating Graph Neural Networks Training via Randomized Sparse Computations
Zirui Liu, Shengyuan Chen, Kaixiong Zhou, Daochen Zha, Xiao Huang, Xia Hu
Sparse Operation Sparse Computation

September 14, 2022

Efficient Quantized Sparse Matrix Operations on Tensor Cores
Shigang Li, Kazuki Osawa, Torsten Hoefler
High Efficiency Sparse Matrix Sparse Transformer Tensor Core Precision Matrix Sparse Operation

August 4, 2022

ZeroFL: Efficient On-Device Training for Federated Learning with Local Sparsity
Xinchi Qiu, Javier Fernandez-Marques, Pedro PB Gusmao, Yan Gao, Titouan Parcollet, Nicholas Donald Lane
Device Training Sparse Local Model Sparse Operation

July 14, 2022

DASS: Differentiable Architecture Search for Sparse neural networks
Hamid Mousavi, Mohammad Loni, Mina Alibeigi, Masoud Daneshtalab
Sparse Network Sparse Neural Network Sparse Matrix Differentiable Architecture Search Sparse Convolution Sparse Operation

April 26, 2022

Expanding the Latent Space of StyleGAN for Real Face Editing
Yin Yu, Ghasedi Kamran, Wu HsiangTao, Yang Jiaolong, Tong Xi, Fu Yun
Latent Space StyleGAN Latent Latent Code Face Editing Sparse Operation

February 10, 2022

Differential Private Knowledge Transfer for Privacy-Preserving Cross-Domain Recommendation
Chaochao Chen, Huiwen Wu, Jiajie Su, Lingjuan Lyu, Xiaolin Zheng, Li Wang
Differential Privacy Recommendation Performance Cross Domain Recommendation Sparse Operation