Knowledge Distillation Loss

Knowledge distillation loss is a technique used to transfer knowledge from a large, complex "teacher" model to a smaller, more efficient "student" model, improving the student's performance and reducing computational costs. Current research focuses on optimizing the distillation process itself, exploring adaptive weighting of losses, and incorporating techniques like attention mechanisms and gradient reweighting to address challenges such as catastrophic forgetting and imbalanced data. This approach is proving valuable across diverse applications, including image recognition, speech processing, and natural language processing, by enabling the deployment of high-performing models on resource-constrained devices or for continual learning scenarios.

Papers

May 11, 2024

AdaKD: Dynamic Knowledge Distillation of ASR models using Adaptive Loss Weighting
Shreyan Ganguly, Roshan Nayak, Rakshith Rao, Ujan Deb, Prathosh AP
Knowledge Distillation ASR Model Adaptive Loss Knowledge Distillation Loss Adaptive Knowledge Distillation Instance Level Loss Conventional Knowledge Distillation

April 2, 2024

Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners
Keon-Hee Park, Kyungwoo Song, Gyeong-Moon Park
Knowledge Distillation Vision Paper Class Incremental Learning Language Transformer Knowledge Distillation Loss Shot Incremental

February 28, 2024

Gradient Reweighting: Towards Imbalanced Class-Incremental Learning
Jiangpeng He, Fengqing Zhu
Class Incremental Learning Class Imbalance Dynamic Reweighting Unbiased Learning Gradient Update Knowledge Distillation Loss

December 2, 2023

Dual-Teacher De-biasing Distillation Framework for Multi-domain Fake News Detection
Jiayang Li, Xuan Feng, Tianlong Gu, Liang Chang
Domain Bias Knowledge Distillation Loss Domain Fake News Detection

November 20, 2023

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Hao Li, Dingwen Zhang, Yalun Dai, Nian Liu, Lechao Cheng, Jingfeng Li, Jingdong Wang, Junwei Han
Semantic Segmentation General Image Segmentation Human NeRF Knowledge Distillation Loss 3D Context G NeRF

August 18, 2023

CCFace: Classification Consistency for Low-Resolution Face Recognition
Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Hossein Kashiani, Nasser M. Nasrabadi
Multi Resolution Low Resolution Deep Face Recognition Knowledge Distillation Loss Resolution Face Recognition

July 29, 2023

UPFL: Unsupervised Personalized Federated Learning towards New Clients
Tiandi Ye, Cen Chen, Yinggui Wang, Xiang Li, Ming Gao
Personalized Federated Learning Personalized Model Client Collaboration Risk Minimization Knowledge Distillation Loss

May 9, 2023

SRIL: Selective Regularization for Class-Incremental Learning
Jisu Han, Jaemin Na, Wonjun Hwang
Deep Learning Model Class Incremental Learning Knowledge Distillation Loss

April 20, 2023

Decouple Non-parametric Knowledge Distillation For End-to-end Speech Translation
Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Zhen Li
Knowledge Distillation Machine Translation Speech Translation End to End Speech Translation Knowledge Distillation Loss

October 27, 2022

Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition
Steven Vander Eeckt, Hugo Van hamme
Automatic Speech Recognition Catastrophic Forgetting Multilingual Automatic Speech Recognition Effective Method WEight AVERaging Knowledge Distillation Loss

September 29, 2022

Teaching Where to Look: Attention Similarity Knowledge Distillation for Low Resolution Face Recognition
Sungho Shin, Joosoon Lee, Junseok Lee, Yeonguk Yu, Kyoobin Lee
Attention Map Low Resolution Face Recognition Benchmark Knowledge Distillation Loss Resolution Face Recognition Attention Based Knowledge Distillation

March 7, 2022

MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
Shiming Chen, Ziming Hong, Guo-Sen Xie, Wenhan Yang, Qinmu Peng, Kai Wang, Jian Zhao, Xinge You
Knowledge Distillation Zero Shot Learning Window System Knowledge Distillation Loss

September 14, 2021

A Note on Knowledge Distillation Loss Function for Object Classification
Defang Chen
Knowledge Distillation Loss Function Short Note Object Classification Entropy Regularized Knowledge Distillation Loss