End to End
"End-to-end" systems aim to streamline complex processes by integrating multiple stages into a single, unified model, eliminating the need for intermediate steps and potentially improving efficiency and performance. Current research focuses on applying this approach across diverse fields, utilizing architectures like transformers, reinforcement learning, and spiking neural networks to tackle challenges in autonomous driving, robotics, speech processing, and natural language processing. This approach offers significant potential for improving the accuracy, speed, and robustness of various applications, while also simplifying development and deployment.
Papers
Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise Distillation
Rendi Chevi, Radityo Eko Prasojo, Alham Fikri Aji, Andros Tjandra, Sakriani Sakti
Integrating Lattice-Free MMI into End-to-End Speech Recognition
Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Binbin Zhang, Di Wu, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv, Lei Xie, Chao Yang, Fuping Pan, Jianwei Niu
End-to-End Transformer Based Model for Image Captioning
Yiyu Wang, Jungang Xu, Yingfei Sun
PETR: Position Embedding Transformation for Multi-View 3D Object Detection
Yingfei Liu, Tiancai Wang, Xiangyu Zhang, Jian Sun
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Seonghyeon Kim, Seung Shin, Yoonsik Kim, Han-Cheol Cho, Taeho Kil, Jaeheung Surh, Seunghyun Park, Bado Lee, Youngmin Baek
An End-to-End Approach for Seam Carving Detection using Deep Neural Networks
Thierry P. Moreira, Marcos Cleison S. Santana, Leandro A. Passos João Paulo Papa, Kelton Augusto P. da Costa
Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers
Lixiang Ru, Yibing Zhan, Baosheng Yu, Bo Du