Vocabulary Trimming

Vocabulary trimming, the process of reducing the size of a language model's vocabulary, aims to improve efficiency and resource utilization without significantly sacrificing performance. Current research focuses on applying this technique to various model architectures, including transformer-based language models and latent Dirichlet allocation (LDA) models, often in conjunction with other compression methods like knowledge distillation. Findings regarding the effectiveness of vocabulary trimming are mixed, with some studies showing substantial benefits in model size reduction with minimal performance loss, while others demonstrate performance degradation, highlighting the need for careful consideration of trimming strategies and evaluation metrics. This research area is significant for deploying large language models in resource-constrained environments and for improving the efficiency of natural language processing tasks.

Papers

October 24, 2024

Dynamic Vocabulary Pruning in Early-Exit LLMs
Jort Vincenti, Karim Abdel Sadek, Joan Velja, Matteo Nulli, Metod Jazbec
Large Language Model LLM Inference Vocabulary Size Vocabulary Trimming

July 13, 2024

Minimizing PLM-Based Few-Shot Intent Detectors
Haode Zhang, Albert Y.S. Lam, Xiao-Ming Wu
Large Language Model Intent Detection Pre Trained Language Vocabulary Trimming

March 30, 2024

An Analysis of BPE Vocabulary Trimming in Neural Machine Translation
Marco Cognetta, Tatsuya Hiraoka, Naoaki Okazaki, Rico Sennrich, Yuval Pinter
General Analysis Neural Machine Translation Rare Word BPE Vocabulary Vocabulary Trimming

November 24, 2023

Analysing the Impact of Removing Infrequent Words on Topic Quality in LDA Models
Victor Bystrov, Viktoriia Naboka-Krell, Anna Staszewska-Bystrova, Peter Winker
Global Impact Topic Analysis Latent Dirichlet Allocation Text Processing Linear Discriminant Analysis Vocabulary Trimming

May 4, 2022

Knowledge Distillation of Russian Language Models with Reduction of Vocabulary
Alina Kolesnikova, Yuri Kuratov, Vasily Konovalov, Mikhail Burtsev
Knowledge Distillation Natural Language Processing Task Transformer Language Model Constructive Reduction Enhanced Vocabulary Vocabulary Trimming

Vocabulary Trimming

Papers

Dynamic Vocabulary Pruning in Early-Exit LLMs

Minimizing PLM-Based Few-Shot Intent Detectors

An Analysis of BPE Vocabulary Trimming in Neural Machine Translation

Analysing the Impact of Removing Infrequent Words on Topic Quality in LDA Models

Knowledge Distillation of Russian Language Models with Reduction of Vocabulary