LLM Fine Tuning
Fine-tuning large language models (LLMs) adapts pre-trained models to specific tasks using smaller datasets, improving performance and efficiency compared to training from scratch. Current research emphasizes parameter-efficient methods like LoRA and techniques to mitigate issues such as catastrophic forgetting and training data imbalance, often employing optimization algorithms like DPO and SVRG, and exploring diverse model architectures including Mixture-of-Experts. This area is crucial for deploying LLMs in real-world applications, enabling customization for various domains while addressing resource constraints and safety concerns.
Papers
CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation
Yuting Liu, Jinghao Zhang, Yizhou Dang, Yuliang Liang, Qiang Liu, Guibing Guo, Jianzhe Zhao, Xingwei Wang
Minor SFT loss for LLM fine-tune to increase performance and reduce model deviation
Shiming Xie, Hong Chen, Fred Yu, Zeye Sun, Xiuyu Wu
PAFT: A Parallel Training Paradigm for Effective LLM Fine-Tuning
Shiva Kumar Pentyala, Zhichao Wang, Bin Bi, Kiran Ramnath, Xiang-Bo Mao, Regunathan Radhakrishnan, Sitaram Asur, Na, Cheng
FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model
Feijie Wu, Zitao Li, Yaliang Li, Bolin Ding, Jing Gao