Instruction Tuned Model

Instruction tuning refines large language models (LLMs) by fine-tuning them on datasets of instructions and desired responses, aiming to improve their ability to follow diverse instructions and generate more helpful and accurate outputs. Current research focuses on developing efficient instruction datasets (including programmatic generation), exploring various model architectures and parameter-efficient fine-tuning techniques like LoRA, and evaluating model performance across diverse tasks and benchmarks, including those assessing reasoning, code generation, and multilingual capabilities. This field is significant because it enhances the practical usability of LLMs, enabling their deployment in a wider range of applications while also providing valuable insights into model behavior and alignment with human intentions.

Papers

June 5, 2023

Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Subhabrata Mukherjee, Arindam Mitra, Ganesh Jawahar, Sahaj Agarwal, Hamid Palangi, Ahmed Awadallah
Imitation Learning GPT 4 Knowledge Tracing Instruction Tuned Model Progressive Learning Whale Optimization Algorithm ORCa Behavior

May 24, 2023

May 17, 2023

Instruction Tuned Models are Quick Learners
Himanshu Gupta, Saurabh Arjun Sawant, Swaroop Mishra, Mutsumi Nakamura, Arindam Mitra, Santosh Mashetty, Chitta Baral
Training Data Transfer Learning Instruction Tuning Instruction Tuned Model Fast Learning Supervised Finetuning Task Learning

May 4, 2023

Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models
Fangkai Jiao, Bosheng Ding, Tianze Luo, Zhanfeng Mo
Training Data Global Evaluation Open Source Large Language Model Instruction Tuned Model Chat Model Instruction Datasets Chinese LLM Chinese News Text

April 17, 2023

LongForm: Effective Instruction Tuning with Reverse Instructions
Abdullatif Köksal, Timo Schick, Anna Korhonen, Hinrich Schütze
Language Model Text Generation Instruction Tuning Long Span Instruction Tuned Model Reverse Process

February 28, 2023

Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
Seonghyeon Ye, Hyeonbin Hwang, Sohee Yang, Hyeongu Yun, Yireun Kim, Minjoon Seo
Language Model Human Instruction Instruction Tuned Model Instruction Tuned Large Language Model

January 31, 2023

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts
NCD Method Instruction Tuning Instruction Tuned Model Enrichment Based Cumulative Gain

November 3, 2022

LMentry: A Language Model Benchmark of Elementary Language Tasks
Avia Efrat, Or Honovich, Omer Levy
Large Language Model Language Model OpenAI Codex Instruction Tuned Model Language Task Benchmark Suite

March 17, 2022

How Many Data Samples is an Additional Instruction Worth?
Ravsehaj Singh Puri, Swaroop Mishra, Mihir Parmar, Chitta Baral
Task Specific Model Instruction Tuned Model Task Instruction Instruction Augmentation Instruction Paradigm

Instruction Tuned Model

Papers

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation

Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models

Instruction Tuned Models are Quick Learners

Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models

LongForm: Effective Instruction Tuning with Reverse Instructions

Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

LMentry: A Language Model Benchmark of Elementary Language Tasks

How Many Data Samples is an Additional Instruction Worth?