LLM Representation

Large language model (LLM) representations are the internal data structures encoding information processed by LLMs, a key area of research aiming to understand how these models function and improve their performance. Current research focuses on enhancing these representations through techniques like localized fine-tuning, knowledge base integration, and adapting model architectures (e.g., using transformers with dynamic compression) to handle diverse input modalities and lengths. Understanding and manipulating LLM representations is crucial for improving model accuracy, trustworthiness, and interpretability, with implications for various applications including text generation, question answering, and recommendation systems.

Papers

May 30, 2023

Stable Anisotropic Regularization
William Rudman, Carsten Eickhoff
Natural Language Processing Jina Embeddings Textual Representation LLM Representation Surface Regularization

April 3, 2023

Inspecting and Editing Knowledge Representations in Language Models
Evan Hernandez, Belinda Z. Li, Jacob Andreas
Language Model Natural Language Neural Language Model Visual Inspection Encoding Scheme LLM Representation

March 21, 2023

The Representational Status of Deep Learning Models
Eamon Duede
Deep Learning Model Meaningful Representation Individual Representation LLM Representation Relational Concept

February 13, 2023

Implications of the Convergence of Language and Vision Model Geometries
Jiaang Li, Yova Kementchedjhieva, Anders Søgaard
Language Model Human Language Early Stage Convergence Meaningful Representation Vision Model Computer Vision Model Future Implication Language Space LLM Representation

April 13, 2022

Probing for Constituency Structure in Neural Language Models
David Arps, Younes Samih, Laura Kallmeyer, Hassan Sajjad
Neural Language Model Linear Probing Syntactic Structure LLM Representation Syntactic Knowledge Semantic Generalization

LLM Representation

Papers

Stable Anisotropic Regularization

Inspecting and Editing Knowledge Representations in Language Models

The Representational Status of Deep Learning Models

Implications of the Convergence of Language and Vision Model Geometries

Probing for Constituency Structure in Neural Language Models