Knowledge Distillation

Knowledge distillation is a machine learning technique that transfers knowledge from a large, complex "teacher" model to a smaller, more efficient "student" model, aiming to improve the student's performance and reduce computational costs. Current research focuses on improving distillation methods for various model architectures, including convolutional neural networks, transformers, and large language models, often incorporating techniques like parameter-efficient fine-tuning, multi-task learning, and data augmentation to enhance knowledge transfer. This approach is significant because it enables the deployment of high-performing models on resource-constrained devices and addresses challenges related to model size, training time, and privacy in diverse applications such as image captioning, speech processing, and medical diagnosis.

1123papers

Papers - Page 8

January 5, 2025

Strategic Fusion Optimizes Transformer Compression
Layer Pruning Transformer Compression Transformer Model Pruning Strategy Knowledge Distillation

January 3, 2025

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Mixture Component Vision Encoders Vision Language Model MOVE Brilliance Knowledge Distillation Visual Encoder

January 1, 2025

LENS-XAI: Redefining Lightweight and Explainable Network Security through Knowledge Distillation and Variational Autoencoders for Scalable Intrusion Detection in Cybersecurity
Knowledge Distillation Cybersecurity Perspective Network Security Intrusion Detection Attack Pattern Variational Autoencoders Intrusion Detection System

December 27, 2024

Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models
Content Based Feature Feature Alignment Knowledge Distillation Language Model Compression Technique Soft Label Distillation

December 26, 2024

SpectralKD: A Unified Framework for Interpreting and Distilling Vision Transformers via Spectral Analysis
Spectral Analysis Human Understanding Knowledge Distillation

December 24, 2024

HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation
Knowledge Distillation Joint Cross Attention HTR System Handwritten Text Recognition

December 21, 2024

Cross-View Consistency Regularisation for Knowledge Distillation
Logit Distillation Cross View Knowledge Distillation Consistency Regularization

December 20, 2024

A New Method to Capturing Compositional Knowledge in Linguistic Space
Knowledge Distillation Compositional Structure Language Space Visual Language Model New Method Textual Inversion

December 19, 2024

December 18, 2024

December 16, 2024

Neural Collapse Inspired Knowledge Distillation
Knowledge Distillation Deep Network Low Temperature Distillation Neural Collapse

Knowledge Distillation

Papers - Page 8

Strategic Fusion Optimizes Transformer Compression

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders

LENS-XAI: Redefining Lightweight and Explainable Network Security through Knowledge Distillation and Variational Autoencoders for Scalable Intrusion Detection in Cybersecurity

Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models

SpectralKD: A Unified Framework for Interpreting and Distilling Vision Transformers via Spectral Analysis

HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation

Cross-View Consistency Regularisation for Knowledge Distillation

A New Method to Capturing Compositional Knowledge in Linguistic Space

Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models

Self-Evolution Knowledge Distillation for LLM-based Machine Translation

SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection

Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance

Enhancing Knowledge Distillation for LLMs with Response-Priming Prompting

On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process

Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation

In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning

Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation

Split Knowledge Distillation for Large Models in IoT: Architecture, Challenges, and Solutions

Neural Collapse Inspired Knowledge Distillation