Theoretical Understanding
Theoretical understanding in artificial intelligence currently focuses on rigorously analyzing the capabilities and limitations of various models, aiming to bridge the gap between empirical observations and formal guarantees. Research emphasizes developing theoretical frameworks for explaining model behavior, particularly in areas like large language models (LLMs), diffusion models, and graph neural networks, often employing techniques from information theory, optimization, and statistical learning theory to analyze model performance and generalization. These theoretical advancements are crucial for improving model design, enhancing reliability, and addressing concerns about robustness, fairness, and explainability, ultimately impacting the trustworthiness and responsible deployment of AI systems across diverse applications.
Papers
Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers
Yuchen Liang, Peizhong Ju, Yingbin Liang, Ness Shroff
Single-Timescale Multi-Sequence Stochastic Approximation Without Fixed Point Smoothness: Theories and Applications
Yue Huang, Zhaoxian Wu, Shiqian Ma, Qing Ling
Generation through the lens of learning theory
Vinod Raman, Ambuj Tewari