LLM Fine Tuning
Fine-tuning large language models (LLMs) adapts pre-trained models to specific tasks using smaller datasets, improving performance and efficiency compared to training from scratch. Current research emphasizes parameter-efficient methods like LoRA and techniques to mitigate issues such as catastrophic forgetting and training data imbalance, often employing optimization algorithms like DPO and SVRG, and exploring diverse model architectures including Mixture-of-Experts. This area is crucial for deploying LLMs in real-world applications, enabling customization for various domains while addressing resource constraints and safety concerns.
Papers
January 12, 2025
January 7, 2025
January 5, 2025
December 29, 2024
December 22, 2024
December 21, 2024
December 18, 2024
December 10, 2024
December 9, 2024
December 6, 2024
December 2, 2024
November 24, 2024
November 16, 2024
November 14, 2024
November 10, 2024
November 7, 2024
October 29, 2024
October 22, 2024