Input Context

Input context, the information provided to a language model before a task is performed, is crucial for model performance and faithfulness. Current research focuses on improving how models utilize this context, addressing challenges like limited context windows, attention bias towards beginning and end of input, and the difficulty of processing long or complex information. This involves developing techniques like attention steering, memory-augmented retrieval, and context compression to enhance both efficiency and accuracy, with a particular emphasis on mitigating "lost-in-the-middle" effects and improving faithfulness in generated outputs. These advancements are significant for improving the reliability and scalability of large language models across various applications.

Papers

June 26, 2023

Understanding In-Context Learning via Supportive Pretraining Data
Xiaochuang Han, Daniel Simig, Todor Mihaylov, Yulia Tsvetkov, Asli Celikyilmaz, Tianlu Wang
Language Model Context Learning Domain Specific Pre Training Input Context

May 24, 2023

Trusting Your Evidence: Hallucinate Less with Context-aware Decoding
Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, Scott Wen-tau Yih
Language Model Contrastive Learning Evidence Piece Summarization Task Input Context Task Specific Decoder Factuality Metric

March 1, 2023

Multi-task neural networks by learned contextual inputs
Anders T. Sandnes, Bjarne Grimstad, Odd Kolbjørnsen
Neural Network Multi Task Task Adaptation Learnable Parameter Input Context Task Specific Subspace

February 1, 2023

Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps
Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi, Kentaro Inui
Transformer Megatron Decepticons Camera Lens Attention Map Causal Language Transformer Layer Input Context Feed Forward

October 11, 2022

Capturing Global Structural Information in Long Document Question Answering with Compressive Graph Selector Network
Yuxiang Nie, Heyan Huang, Wei Wei, Xian-Ling Mao
Memory Network Structural Information Input Context Long Form Question Network Selection

May 23, 2022

Vector-Quantized Input-Contextualized Soft Prompts for Natural Language Understanding
Rishabh Bhardwaj, Amrita Saha, Steven C. H. Hoi, Soujanya Poria
Language Understanding Soft Prompt Input Context Soft Prompt Tuning Prompt Representation

April 29, 2022

QRelScore: Better Evaluating Generated Questions with Deeper Understanding of Context-aware Relevance
Xiaoqiang Wang, Bang Liu, Siliang Tang, Lingfei Wu
Language Model Yes No Question Question Generation Deep Understanding Prompt Generation Input Context

April 22, 2022

ChapterBreak: A Challenge Dataset for Long-Range Language Models
Simeng Sun, Katherine Thai, Mohit Iyyer
Challenge Dataset Input Context Long Range Language Discourse Processing Chapter to Chapter