Multi Layer
Multi-layer architectures are a central theme in contemporary machine learning, aiming to improve the efficiency and accuracy of various models by strategically organizing computational units into multiple layers. Current research focuses on optimizing these architectures, exploring alternatives to traditional multilayer perceptrons (MLPs) such as Kolmogorov-Arnold Networks (KANs) and Fourier Analysis Networks (FANs), and investigating techniques like layer distillation and frequency shifting for improved performance and reduced computational cost. These advancements have significant implications for diverse applications, including music generation, image processing, natural language processing, and scientific computing, by enabling faster, more accurate, and more efficient models.
Papers
LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors
Yusuf Dalva, Yijun Li, Qing Liu, Nanxuan Zhao, Jianming Zhang, Zhe Lin, Pinar Yanardag
PoTable: Programming Standardly on Table-based Reasoning Like a Human Analyst
Qingyang Mao, Qi Liu, Zhi Li, Mingyue Cheng, Zheng Zhang, Rui Li
Training MLPs on Graphs without Supervision
Zehong Wang, Zheyuan Zhang, Chuxu Zhang, Yanfang Ye
A Layered Architecture for Developing and Enhancing Capabilities in Large Language Model-based Software Systems
Dawen Zhang, Xiwei Xu, Chen Wang, Zhenchang Xing, Robert Mao
Error-Feedback Model for Output Correction in Bilateral Control-Based Imitation Learning
Hiroshi Sato, Masashi Konosu, Sho Sakaino, Toshiaki Tsuji
Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity
Zichen Song, Sitan Huang, Yuxin Wu, Zhongfeng Kang
Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Xiaofeng Zhang, Yihao Quan, Chaochen Gu, Chen Shen, Xiaosong Yuan, Shaotian Yan, Hao Cheng, Kaijie Wu, Jieping Ye