Autoregressive Transformer
Autoregressive transformers are a class of neural network models that generate sequential data by predicting the next element in a sequence, one at a time. Current research focuses on improving their efficiency and applicability to diverse data types, including time series, images, 3D shapes, and even analog circuit simulations, often employing novel attention mechanisms and training strategies like packing and contrastive learning to enhance performance. These advancements are significant because they enable the generation of high-quality, complex data across various domains, impacting fields ranging from image synthesis and 3D modeling to natural language processing and scientific simulation.
Papers
Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers
Seungwook Han, Jinyeop Song, Jeff Gore, Pulkit Agrawal
Extrapolating Jet Radiation with Autoregressive Transformers
Anja Butter, François Charton, Javier Mariño Villadamigo, Ayodele Ore, Tilman Plehn, Jonas Spinner
Generalizable autoregressive modeling of time series through functional narratives
Ran Liu, Wenrui Ma, Ellen Zippi, Hadi Pouransari, Jingyun Xiao, Chris Sandino, Behrooz Mahasseni, Juri Minxha, Erdrin Azemi, Eva L. Dyer, Ali Moin
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Jiatao Gu, Yuyang Wang, Yizhe Zhang, Qihang Zhang, Dinghuai Zhang, Navdeep Jaitly, Josh Susskind, Shuangfei Zhai