Answer Token

Answer token research focuses on understanding how language models identify and utilize specific words or sub-word units to construct answers in question-answering tasks. Current research investigates how different model architectures, including transformers and retrieval-augmented generation (RAG) systems, process and weigh these tokens, often employing techniques like attention mechanisms and saliency mapping to pinpoint their contribution to the final answer. This work aims to improve the accuracy, interpretability, and trustworthiness of language models by clarifying the role of answer tokens in the generation process, leading to more reliable and explainable AI systems for various applications.

Papers

December 16, 2024

Let your LLM generate a few tokens and you will reduce the need for retrieval
Hervé Déjean
Large Language Model App to App Retrieval Classification Task Community Need Thinking Token Augmented Generation Answer Token

July 21, 2024

Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions
Sarah Wiegreffe, Oyvind Tafjord, Yonatan Belinkov, Hannaneh Hajishirzi, Ashish Sabharwal
Transformer Megatron Decepticons Assembly Task Transformer Language Model Multiple Choice Top Two Answer Ace Opencpop Multiple Choice Question Answering Answer Token

June 19, 2024

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation
Jirui Qi, Gabriele Sarti, Raquel Fernández, Arianna Bisazza
Retrieval Augmented Generation Post Hoc Attribution Answer Token

May 24, 2023

Machine Reading Comprehension using Case-based Reasoning
Dung Thai, Dhruv Agarwal, Mudit Chaudhary, Wenlong Zhao, Rajarshi Das, Manzil Zaheer, Jay-Yoon Lee, Hannaneh Hajishirzi, Andrew McCallum
Complex Reasoning Machine Reading Comprehension Case Based Reasoning Answer Extraction Answer Token

January 28, 2022

Protum: A New Method For Prompt Tuning Based on "[MASK]"
Pan He, Yuxi Chen, Yan Wang, Yanru Zhang
Pre Trained Language Model Prompt Tuning New Method Masked Language Modeling Answer Token

December 6, 2021

JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
Yueqing Sun, Qi Shi, Le Qi, Yu Zhang
Language Model Knowledge Graph Commonsense Question Answering Joint Reasoning Answer Token