Internal State
"Internal state" research encompasses diverse fields, focusing on representing and utilizing internal representations of systems to improve prediction, decision-making, and control. Current research emphasizes developing robust methods for estimating and tracking internal states across various domains, employing techniques like Bayesian methods, neural networks (including recurrent and transformer architectures), and Gaussian processes. These advancements have significant implications for diverse applications, including improved battery management, more efficient power grid control, enhanced AI safety, and more accurate predictions in areas like weather forecasting and legal speech transcription.
Papers
InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Mohammad Beigi, Ying Shen, Runing Yang, Zihao Lin, Qifan Wang, Ankith Mohan, Jianfeng He, Ming Jin, Chang-Tien Lu, Lifu Huang
How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment
Heyan Huang, Yinghao Li, Huashan Sun, Yu Bai, Yang Gao