Layer Wise

Layer-wise training methods in deep learning focus on optimizing neural networks by training or adapting individual layers or blocks of layers sequentially or in parallel, rather than optimizing the entire network end-to-end. Current research explores layer-wise approaches within various architectures, including transformers and convolutional neural networks, to improve efficiency, address resource constraints (especially in federated learning and edge computing), enhance model interpretability, and mitigate issues like catastrophic forgetting and overfitting. These techniques offer significant potential for advancing both the theoretical understanding of deep learning and its practical applications by enabling training of larger models on resource-limited devices and improving model robustness and generalization.

Papers

September 11, 2023

Towards Federated Learning Under Resource Constraints via Layer-wise Training and Depth Dropout
Pengfei Guo, Warren Richard Morningstar, Raviteja Vemulapalli, Karan Singhal, Vishal M. Patel, Philip Andrew Mansfield
Machine Learning Model Federated Learning Layer Wise Resource Constraint Dropout Layer

September 4, 2023

Layer-wise training for self-supervised learning on graphs
Oscar Pina, Verónica Vilaplana
Graph Neural Network Self Supervised Learning Graph Drawing Layer Wise GNN Layer Inductive Graph

May 11, 2023

Provable Guarantees for Nonlinear Feature Learning in Three-Layer Neural Networks
Eshaan Nichani, Alex Damian, Jason D. Lee
Deep Learning Deep Network Layer Wise Provable Guarantee Three Layer Neural Network

March 27, 2023

Comparison between layer-to-layer network training and conventional network training using Deep Convolutional Neural Networks
Kiran Kumar Ashish Bhyravabhottla, WonSook Lee
Deep Convolutional Neural Network Consistent Comparison Layer Wise Training Network

March 26, 2023

Efficient Parallel Split Learning over Resource-constrained Wireless Edge Networks
Zheng Lin, Guangyu Zhu, Yiqin Deng, Xianhao Chen, Yue Gao, Kaibin Huang, Yuguang Fang
Layer Wise Resource Constrained Edge Model Splitting Parallel Split Learning

February 24, 2023

Defending Against Backdoor Attacks by Layer-wise Feature Analysis
Najeeb Moharram Jebreel, Josep Domingo-Ferrer, Yiming Li
Deep Neural Network Backdoor Attack Layer Wise Training Time Attack Trigger Attack

January 15, 2023

Improving Reliability of Fine-tuning with Block-wise Optimisation
Basel Barakat, Qiang Huang
Fine Tuning Pre Trained Model Layer Wise Sequential Optimization Adaptation Layer

November 8, 2022

Comparative layer-wise analysis of self-supervised speech models
Ankita Pasad, Bowen Shi, Karen Livescu
Speech Recognition Pre Trained Model Spoken Language Understanding Layer Wise Self Supervised Speech Model Intermediate Representation

October 28, 2022

LOFT: Finding Lottery Tickets through Filter-wise Training
Qihan Wang, Chen Dun, Fangshuo Liao, Chris Jermaine, Anastasios Kyrillidis
Neural Network Convolutional Neural Network Layer Wise Lottery Ticket Lottery Ticket Hypothesis Model Parallel

October 17, 2022

PARTIME: Scalable and Parallel Processing Over Time with Deep Neural Networks
Enrico Meloni, Lapo Faggi, Simone Marullo, Alessandro Betti, Matteo Tiezzi, Marco Gori, Stefano Melacci
Neural Network Deep Neural Network Pytorch Model Layer Wise Different PaRT Parallel Processing Visual Counterfactual Explanation

October 3, 2022

Block-wise Training of Residual Networks via the Minimizing Movement Scheme
Skander Karkar, Ibrahim Ayed, Emmanuel de Bézenac, Patrick Gallinari
Back Propagation Gradient Flow Layer Wise Optimization Layer Motion Optimization

September 15, 2022

Layerwise Bregman Representation Learning with Applications to Knowledge Distillation
Ehsan Amid, Rohan Anil, Christopher Fifty, Manfred K. Warmuth
Knowledge Distillation Strong Generalization Financial Application Layer Wise Bregman Divergence Bregman Information

August 18, 2022

Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for Classification
Quanshi Zhang, Xu Cheng, Yilan Chen, Zhefan Rao
Knowledge Distillation Classification Code Knowledge Based DNN Framework DNN Model Layer Wise

August 17, 2022

Learning with Local Gradients at the Edge
Michael Lomnitz, Zachary Daniels, David Zhang, Michael Piacentino
LeArning Abstract Back Propagation Extreme Edge Layer Wise Local Gradient Backpropagation Free Projected Gradient Descent

August 1, 2022

July 22, 2022

Layer-Wise Partitioning and Merging for Efficient and Scalable Deep Learning
Samson B. Akintoye, Liangxiu Han, Huw Lloyd, Xin Zhang, Darren Dancey, Haoming Chen, Daoqiang Zhang
Deep Neural Network High Efficiency DNN Framework Layer Wise Scalable Neural Non Parallel

July 1, 2022

Visual Transformer Meets CutMix for Improved Accuracy, Communication Efficiency, and Data Privacy in Split Learning
Sihun Baek, Jihong Park, Praneeth Vepakomma, Ramesh Raskar, Mehdi Bennis, Seong-Lyun Kim
Convolutional Neural Network Data Augmentation Split Learning Layer Wise Communication Efficiency Accuracy Improvement Visual Transformer

June 22, 2022

Neural Networks as Paths through the Space of Representations
Richard D. Lange, Devin Kwok, Jordan Matelsky, Xinyue Wang, David S. Rolnick, Konrad P. Kording
Neural Network Meaningful Representation Deep Space Root to Leaf Path Layer Wise High Dimensional Representation Representational Geometry

June 16, 2022

A Robust Stacking Framework for Training Deep Graph Models with Multifaceted Node Features
Jiuhai Chen, Jonas Mueller, Vassilis N. Ioannidis, Tom Goldstein, David Wipf
Graph Neural Network Layer Wise Node Feature Graph Datasets Graph Propagation Stacking Framework Deep Graph Model

Layer Wise

Papers

Towards Federated Learning Under Resource Constraints via Layer-wise Training and Depth Dropout

Layer-wise training for self-supervised learning on graphs

Provable Guarantees for Nonlinear Feature Learning in Three-Layer Neural Networks

Comparison between layer-to-layer network training and conventional network training using Deep Convolutional Neural Networks

Efficient Parallel Split Learning over Resource-constrained Wireless Edge Networks

Defending Against Backdoor Attacks by Layer-wise Feature Analysis

Improving Reliability of Fine-tuning with Block-wise Optimisation

Comparative layer-wise analysis of self-supervised speech models

LOFT: Finding Lottery Tickets through Filter-wise Training

PARTIME: Scalable and Parallel Processing Over Time with Deep Neural Networks

Block-wise Training of Residual Networks via the Minimizing Movement Scheme

Layerwise Bregman Representation Learning with Applications to Knowledge Distillation

Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for Classification

Learning with Local Gradients at the Edge

MV6D: Multi-View 6D Pose Estimation on RGB-D Frames Using a Deep Point-wise Voting Network

Improving the Trainability of Deep Neural Networks through Layerwise Batch-Entropy Regularization

Layer-Wise Partitioning and Merging for Efficient and Scalable Deep Learning

Visual Transformer Meets CutMix for Improved Accuracy, Communication Efficiency, and Data Privacy in Split Learning

Neural Networks as Paths through the Space of Representations

A Robust Stacking Framework for Training Deep Graph Models with Multifaceted Node Features