Abstraction Learning
Abstraction learning focuses on enabling artificial intelligence systems to learn and utilize higher-level representations of data, simplifying complex tasks and improving generalization. Current research emphasizes developing methods to discover these abstractions within various model architectures, including generative diffusion models, reinforcement learning agents, and large language models, often employing techniques like contrastive learning, Hopfield networks, and symbolic graph representations. This field is crucial for advancing AI capabilities in areas such as robotics, natural language processing, and program synthesis, by enabling more efficient learning, improved generalization to unseen data, and enhanced interpretability of complex models.
Papers
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden, Sahand Rezaei-Shoshtari, Rosie Zhao, David Meger, Doina Precup
ShapeCoder: Discovering Abstractions for Visual Programs from Unstructured Primitives
R. Kenny Jones, Paul Guerrero, Niloy J. Mitra, Daniel Ritchie