Theoretical Understanding
Theoretical understanding in artificial intelligence currently focuses on rigorously analyzing the capabilities and limitations of various models, aiming to bridge the gap between empirical observations and formal guarantees. Research emphasizes developing theoretical frameworks for explaining model behavior, particularly in areas like large language models (LLMs), diffusion models, and graph neural networks, often employing techniques from information theory, optimization, and statistical learning theory to analyze model performance and generalization. These theoretical advancements are crucial for improving model design, enhancing reliability, and addressing concerns about robustness, fairness, and explainability, ultimately impacting the trustworthiness and responsible deployment of AI systems across diverse applications.
Papers
On the Disconnect Between Theory and Practice of Neural Networks: Limits of the NTK Perspective
Jonathan Wenger, Felix Dangel, Agustinus Kristiadi
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4
Jiaxian Guo, Bo Yang, Paul Yoo, Bill Yuchen Lin, Yusuke Iwasawa, Yutaka Matsuo