Neural Architecture
Neural architecture research focuses on designing and optimizing the structure of artificial neural networks to improve efficiency, accuracy, and interpretability. Current efforts concentrate on developing novel architectures like Kolmogorov-Arnold Networks and transformers, employing efficient search algorithms (e.g., evolutionary algorithms, generative flows) to explore vast design spaces, and analyzing the representational similarity and training efficiency of different models. These advancements are crucial for deploying deep learning in resource-constrained environments and for gaining a deeper understanding of how neural networks learn and generalize, impacting fields ranging from computer vision and natural language processing to scientific computing and edge devices.
Papers
HyperPPO: A scalable method for finding small policies for robotic control
Shashank Hegde, Zhehui Huang, Gaurav S. Sukhatme
Reusability report: Prostate cancer stratification with diverse biologically-informed neural architectures
Christian Pedersen, Tiberiu Tesileanu, Tinghui Wu, Siavash Golkar, Miles Cranmer, Zijun Zhang, Shirley Ho
Compositional Program Generation for Few-Shot Systematic Generalization
Tim Klinger, Luke Liu, Soham Dan, Maxwell Crouse, Parikshit Ram, Alexander Gray