Instruction Tuned Model

Instruction tuning refines large language models (LLMs) by fine-tuning them on datasets of instructions and desired responses, aiming to improve their ability to follow diverse instructions and generate more helpful and accurate outputs. Current research focuses on developing efficient instruction datasets (including programmatic generation), exploring various model architectures and parameter-efficient fine-tuning techniques like LoRA, and evaluating model performance across diverse tasks and benchmarks, including those assessing reasoning, code generation, and multilingual capabilities. This field is significant because it enhances the practical usability of LLMs, enabling their deployment in a wider range of applications while also providing valuable insights into model behavior and alignment with human intentions.

Papers

February 16, 2024

Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models
Dheeraj Mekala, Alex Nguyen, Jingbo Shang
Language Model Training Data Instruction Tuning Larger Language Model Instruction Tuned Model Smaller Language Model

February 3, 2024

SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks
Gourab Dey, Adithya V Ganesan, Yash Kumar Lal, Manal Shah, Shreyashee Sinha, Matthew Matero, Salvatore Giorgi, Vivek Kulkarni, H. Andrew Schwartz
Instruction Tuned Model Computational Social Science Social Context Whispering Llama Linguistic Pragmatic Implicit Communication Social Computing Task

January 26, 2024

MaLLaM -- Malaysia Large Language Model
Husein Zolkepli, Aisyah Razak, Kamarul Adha, Ariff Nazhan
Large Language Model Language Understanding Instruction Tuned Model Language Based Representation

January 1, 2024

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von Werra, Harm de Vries, Qian Liu, Niklas Muennighoff
Large Language Model Instruction Tuning Parameter Efficient Fine Tuning Instruction Tuned Model Effective Tuning

November 18, 2023

Orca 2: Teaching Small Language Models How to Reason
Arindam Mitra, Luciano Del Corro, Shweti Mahajan, Andres Codas, Clarisse Simoes, Sahaj Agarwal, Xuxi Chen, Anastasia Razdaibiedina, Erik Jones, Kriti Aggarwal, Hamid Palangi, Guoqing Zheng, Corby Rosset, Hamed Khanpour, Ahmed Awadallah
Language Model Reasoning Ability Reason Giving Instruction Tuned Model ORCa Behavior Training Signal

November 1, 2023

Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions
Taehyeon Kim, Joonkee Kim, Gihun Lee, Se-Young Yun
Zero Shot Instruction Tuned Model Diverse Instruction Instruction Tuned Language Model Based Refiner

October 30, 2023

October 23, 2023

CITB: A Benchmark for Continual Instruction Tuning
Zihan Zhang, Meng Fang, Ling Chen, Mohammad-Reza Namazi-Rad
New Benchmark Continual LEArning Instruction Tuning Natural Language Instruction Instruction Tuned Model Continual Instruction Tuning

October 21, 2023

Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications
Manuel Faysse, Gautier Viaud, Céline Hudelot, Pierre Colombo
Zero Shot Instruction Tuned Model Instruction Fine Tuning Industrial Application LLM Based Metric Task Specialization

October 20, 2023

Tuna: Instruction Tuning using Feedback from Large Language Models
Haoran Li, Yiran Liu, Xingxing Zhang, Wei Lu, Furu Wei
Human Feedback Instruction Tuning Open Source Large Language Model Instruction Tuned Model Probabilistic Ranking

September 30, 2023

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning
Xuansheng Wu, Wenlin Yao, Jianshu Chen, Xiaoman Pan, Xiaoyang Wang, Ninghao Liu, Dong Yu
Medical LLM Pre Trained Model Instruction Tuning Human Instruction Instruction Tuned Model

September 14, 2023

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Röttger, Dan Jurafsky, Tatsunori Hashimoto, James Zou
Large Language Model Human Instruction Human SAFETY Critical Lesson Instruction Tuned Model Whispering Llama Safety Fine Tuning

August 25, 2023

The Poison of Alignment
Aibek Bekbayev, Sungbae Chun, Yerzat Dulat, James Yamazaki
Alignment Problem Poisoning Attack Reasoning Benchmark Fine Tuned Model Instruction Tuned Model Instruction Dataset

August 7, 2023

UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition
Wenxuan Zhou, Sheng Zhang, Yu Gu, Muhao Chen, Hoifung Poon
Large Language Model Entity Recognition Named Entity Recognition Mutual Distillation Instruction Tuned Model

July 20, 2023

Instruction-following Evaluation through Verbalizer Manipulation
Shiyang Li, Jun Yan, Hai Wang, Zheng Tang, Xiang Ren, Vijay Srinivasan, Hongxia Jin
Global Evaluation Natural Language Processing Task Instruction Following Instruction Tuned Model Verbalizer Manipulation

June 28, 2023

On the Exploitability of Instruction Tuning
Manli Shu, Jiongxiao Wang, Chen Zhu, Jonas Geiping, Chaowei Xiao, Tom Goldstein
Instruction Tuning Data Poisoning Attack Instruction Tuned Model

June 20, 2023

Evaluating the Zero-shot Robustness of Instruction-tuned Language Models
Jiuding Sun, Chantal Shaib, Byron C. Wallace
Instruction Tuned Model Instruction Fine Tuning Instruction Tuned Language Model Zero Shot Adversarial Robustness Instruction Phrasing

June 7, 2023

Instruction Tuned Model

Papers

Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models

SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

MaLLaM -- Malaysia Large Language Model

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Orca 2: Teaching Small Language Models How to Reason

Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions

Automatic Evaluation of Generative Models with Instruction Tuning

Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace

CITB: A Benchmark for Continual Instruction Tuning

Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications

Tuna: Instruction Tuning using Feedback from Large Language Models

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

The Poison of Alignment

UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition

Instruction-following Evaluation through Verbalizer Manipulation

On the Exploitability of Instruction Tuning

Evaluating the Zero-shot Robustness of Instruction-tuned Language Models

INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources