Knowledge Mechanism

Understanding how large language models (LLMs) acquire, store, and utilize knowledge is a crucial area of research. Current investigations focus on characterizing knowledge representation within LLMs, exploring whether knowledge is localized to specific "neurons" or distributed more broadly, and examining how this knowledge is accessed and modified, including techniques for mitigating harmful outputs. These efforts aim to improve the trustworthiness and safety of LLMs, ultimately impacting their deployment in various applications and furthering our understanding of artificial intelligence.

Papers