Human Instruction

Human instruction following in AI focuses on developing models capable of accurately and reliably executing complex tasks based on diverse instructions, encompassing text, images, and audio. Current research emphasizes improving model alignment through techniques like instruction tuning and response tuning, often utilizing large language models (LLMs) and diffusion transformers, and exploring novel evaluation metrics for multi-modal, multi-turn interactions. This field is crucial for advancing human-computer interaction, enabling more intuitive and effective collaboration between humans and AI systems across various domains, from robotics and manufacturing to healthcare and education.

Papers

October 5, 2023

HandMeThat: Human-Robot Communication in Physical and Social Environments
Yanming Wan, Jiayuan Mao, Joshua B. Tenenbaum
Human Robot Interaction Human Instruction Language Grounding Physical Information Human Robot Communication Social Environment

September 30, 2023

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning
Xuansheng Wu, Wenlin Yao, Jianshu Chen, Xiaoman Pan, Xiaoyang Wang, Ninghao Liu, Dong Yu
Medical LLM Pre Trained Model Instruction Tuning Human Instruction Instruction Tuned Model

September 29, 2023

Guiding Instruction-based Image Editing via Multimodal Large Language Models
Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan
Multimodal Large Language Model Image Editing Human Instruction

September 14, 2023

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Röttger, Dan Jurafsky, Tatsunori Hashimoto, James Zou
Large Language Model Human Instruction Human SAFETY Critical Lesson Instruction Tuned Model Whispering Llama Safety Fine Tuning

August 28, 2023

Evaluating the Robustness to Instructions of Large Language Models
Yuansheng Ni, Sichao Jiang, Xinyu wu, Hui Shen, Yuli Zhou
Native Robustness Human Instruction Instruction Fine Tuning Instruction Tuned Large Language Model Task Instruction

August 27, 2023

MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records
Scott L. Fleming, Alejandro Lozano, William J. Haberkorn, Jenelle A. Jindal, Eduardo P. Reis, Rahul Thapa, Louis Blankemeier, Julian Z. Genkins, Ethan Steinberg, Ashwin Nayak, Birju S. Patel, Chia-Chun Chiang, Alison Callahan, Zepeng Huo, Sergios Gatidis, Scott J. Adams, Oluseyi Fayanju, Shreya J. Shah, Thomas Savage, Ethan Goh, Akshay S. Chaudhari, Nima Aghaeepour, Christopher Sharp, Michael A. Pfeffer, Percy Liang, Jonathan H. Chen, Keith E. Morse, Emma P. Brunskill, Jason A. Fries, Nigam H. Shah
Large Language Model Electronic Health Record Human Instruction Natural Language Instruction Natural Language Generation Patient Data

August 25, 2023

LLM2KB: Constructing Knowledge Bases using instruction tuned context aware Large Language Models
Anmol Nayak, Hari Prasad Timmapathini
Large Language Model Natural Language Processing Context Information Human Instruction Knowledge Base Inference Task Dense Passage Retrieval KG Completion

August 24, 2023

Improving Translation Faithfulness of Large Language Models via Augmenting Instructions
Yijie Chen, Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou
Human Instruction Instruction Following Instruction Generation Instruction Paradigm Imperfect Translation

August 23, 2023

From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models
Jing Yao, Xiaoyuan Yi, Xiting Wang, Jindong Wang, Xing Xie
Timely Survey Large Model Human Instruction Value Alignment Human Value Alignment Objective

August 19, 2023

I3: Intent-Introspective Retrieval Conditioned on Instructions
Kaihang Pan, Juncheng Li, Wenjie Wang, Hao Fei, Hongye Song, Wei Ji, Jun Lin, Xiaozhong Liu, Tat-Seng Chua, Siliang Tang
Human Instruction Task Specific Retriever

August 17, 2023

Watch Your Steps: Local Image and Scene Editing by Text Instructions
Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski
Neural Radiance Field Human Instruction Sample STEP Text Guided Editing Relevance Map Scene Editing

August 14, 2023

Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents
Byeonghwi Kim, Jinyeon Kim, Yuyeong Kim, Cheolhong Min, Jonghyun Choi
Human Instruction Context Aware Visual Navigation Embodied Agent Active Object

August 12, 2023

Bio-SIEVE: Exploring Instruction Tuning Large Language Models for Systematic Review Automation
Ambrose Robinson, William Thorne, Ben P. Wu, Abdullah Pandor, Munira Essat, Mark Stevenson, Xingyi Song
Systematic Review Human Instruction Bio Sieve

July 7, 2023

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Yu Liu, Kai Chen, Ping Luo
Large Language Model Human Instruction Visual Instruction Region Text Pair

June 28, 2023

Inferring the Goals of Communicating Agents from Actions and Instructions
Lance Ying, Tan Zhi-Xuan, Vikash Mansinghka, Joshua B. Tenenbaum
Scientific Inference Human Instruction Past Action Pseudo Goal Agent Communication Cooperative Agent Goal Inference Inverse Planning

June 27, 2023

Simple Steps to Success: Axiomatics of Distance-Based Algorithmic Recourse
Jenny Hamer, Jake Valladares, Vignesh Viswanathan, Yair Zick
Causal Graph Human Instruction Financial Success Algorithmic Recourse Axiomatic Approach Path Based Explanation Model Structure

June 21, 2023

OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue
Weihao Gao, Zhuo Deng, Zhiyuan Niu, Fuju Rong, Chucheng Chen, Zheng Gong, Wenze Zhang, Daimin Xiao, Fang Li, Zhenjie Cao, Zhaoyi Ma, Wenbin Wei, Lan Ma
Multimodal Large Language Model Human Instruction Dialogue Utterance

June 19, 2023

June 16, 2023

Rewriting the Script: Adapting Text Instructions for Voice Interaction
Alyssa Hwang, Natasha Oza, Chris Callison-Burch, Andrew Head
Human Instruction Voice Assistant Voice Interaction Task Guidance

Human Instruction

Papers

HandMeThat: Human-Robot Communication in Physical and Social Environments

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

Evaluating the Robustness to Instructions of Large Language Models

MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records

LLM2KB: Constructing Knowledge Bases using instruction tuned context aware Large Language Models

Improving Translation Faithfulness of Large Language Models via Augmenting Instructions

From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models

I3: Intent-Introspective Retrieval Conditioned on Instructions

Watch Your Steps: Local Image and Scene Editing by Text Instructions

Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents

Bio-SIEVE: Exploring Instruction Tuning Large Language Models for Systematic Review Automation

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Inferring the Goals of Communicating Agents from Actions and Instructions

Simple Steps to Success: Axiomatics of Distance-Based Algorithmic Recourse

OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

Instruct-NeuralTalker: Editing Audio-Driven Talking Radiance Fields with Instructions

Rewriting the Script: Adapting Text Instructions for Voice Interaction