Transformer Based
Transformer-based models are revolutionizing various fields by leveraging self-attention mechanisms to capture long-range dependencies in sequential data, achieving state-of-the-art results in tasks ranging from natural language processing and image recognition to time series forecasting and robotic control. Current research focuses on improving efficiency (e.g., through quantization and optimized architectures), enhancing generalization capabilities, and addressing challenges like handling long sequences and endogeneity. These advancements are significantly impacting diverse scientific communities and practical applications, leading to more accurate, efficient, and robust models across numerous domains.
Papers
Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer
Xiao Huo, Junhui Ho, Shuai Wan, Fuzheng Yang
Joint multi-dimensional dynamic attention and transformer for general image restoration
Huan Zhang, Xu Zhang, Nian Cai, Jianglei Di, Yun Zhang
Circuit Complexity Bounds for RoPE-based Transformer Architecture
Bo Chen, Xiaoyu Li, Yingyu Liang, Jiangxuan Long, Zhenmei Shi, Zhao Song
Transformer verbatim in-context retrieval across time and scale
Kristijan Armeni, Marko Pranjić, Senja Pollak
Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction
Miguel Antunes-García, Luis M. Bergasa, Santiago Montiel-Marín, Rafael Barea, Fabio Sánchez-García, Ángel Llamazares
Spatially Constrained Transformer with Efficient Global Relation Modelling for Spatio-Temporal Prediction
Ashutosh Sao, Simon Gottschalk
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer, Jack Spruyt
Transformers to Predict the Applicability of Symbolic Integration Routines
Rashid Barket, Uzma Shafiq, Matthew England, Juergen Gerhard
Context-Aware Token Selection and Packing for Enhanced Vision Transformer
Tianyi Zhang, Baoxin Li, Jae-sun Seo, Yu Cap