Structural Generalization
Structural generalization in machine learning focuses on developing models capable of extrapolating learned patterns to novel combinations of previously seen components, rather than simply memorizing training examples. Current research emphasizes evaluating and improving this ability in various tasks, particularly natural language processing (NLP), using architectures like transformers and graph neural networks, often incorporating techniques like pre-training and structural constraints. The ability to achieve robust structural generalization is crucial for building more reliable and adaptable AI systems across diverse applications, including autonomous systems, semantic parsing, and machine translation.
Papers
July 8, 2024
July 5, 2024
June 19, 2024
February 2, 2024
November 8, 2023
October 23, 2023
October 21, 2023
June 20, 2023
May 26, 2023
April 26, 2023
March 9, 2023
January 12, 2023
December 19, 2022
November 25, 2022
October 24, 2022