Weight Sharing

Weight sharing, a technique where multiple parts of a neural network share the same parameters, aims to improve efficiency and performance by reducing model size and computational cost. Current research focuses on applying weight sharing to various architectures, including transformers, recurrent neural networks (RNNs), and convolutional neural networks (CNNs), often within the context of federated learning, continual learning, and neural architecture search. This approach offers significant advantages in resource-constrained environments and large-scale applications, impacting both the efficiency of training and the deployment of deep learning models.

Papers

June 3, 2022

Supernet Training for Federated Image Classification under System Heterogeneity
Taehyeon Kim, Se-Young Yun
Supernet Training Weight Sharing Federated Image Classification System Heterogeneity Heterogeneous Training

April 13, 2022

CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing
Chen Liang, Pengcheng He, Yelong Shen, Weizhu Chen, Tuo Zhao
Consistency Regularization Model Ensemble Weight Sharing Level Perturbation BERT Base

March 4, 2022

WPNAS: Neural Architecture Search by jointly using Weight Sharing and Predictor
Ke Lin, Yong A, Zhuoxin Gan, Yingying Jiang
Neural Architecture Search Supervised ImageNet Weight Sharing Deodata Predictor Shot Forecasting

January 27, 2022

LiteLSTM Architecture for Deep Recurrent Neural Networks
Nelly Elsayed, Zag ElSayed, Anthony S. Maida
Long Short Term Memory Deep Recurrent Neural Network Weight Sharing Spatiotemporal Sequence

December 5, 2021

Exploring Complicated Search Spaces with Interleaving-Free Sampling
Yunjie Tian, Lingxi Xie, Jiemin Fang, Jianbin Jiao, Qixiang Ye, Qi Tian
Neural Architecture Search Search Space Weight Sharing

November 8, 2021

DeepSteal: Advanced Model Extractions Leveraging Efficient Weight Stealing in Memories
Adnan Siraj Rakin, Md Hafizul Islam Chowdhuryy, Fan Yao, Deliang Fan
DNN Architecture Side Channel Model Extraction Attack Personal Memory Weight Sharing Model Extraction