Long Context Large Language Model
Long-context large language models (LLMs) aim to overcome the limitations of traditional LLMs by processing significantly longer input sequences, enabling more comprehensive understanding and generation of text. Current research focuses on improving efficiency through techniques like sparse attention mechanisms, optimized memory management (e.g., KV cache compression), and efficient training strategies, as well as developing robust evaluation benchmarks that assess performance on diverse, realistic long-context tasks. This field is crucial for advancing natural language processing capabilities in applications requiring deep understanding of extensive documents, such as multi-document summarization, question answering, and complex reasoning tasks across various domains.
Papers
NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens
Cunxiang Wang, Ruoxi Ning, Boqi Pan, Tonghui Wu, Qipeng Guo, Cheng Deng, Guangsheng Bao, Xiangkun Hu, Zheng Zhang, Qian Wang, Yue Zhang
Counting-Stars: A Multi-evidence, Position-aware, and Scalable Benchmark for Evaluating Long-Context Large Language Models
Mingyang Song, Mao Zheng, Xuan Luo
SWAG: Storytelling With Action Guidance
Zeeshan Patel, Karim El-Refai, Jonathan Pei, Tianle Li
UniMem: Towards a Unified View of Long-Context Large Language Models
Junjie Fang, Likai Tang, Hongzhe Bi, Yujia Qin, Si Sun, Zhenyu Li, Haolun Li, Yongjian Li, Xin Cong, Yankai Lin, Yukun Yan, Xiaodong Shi, Sen Song, Zhiyuan Liu, Maosong Sun