End to End
"End-to-end" systems aim to streamline complex processes by integrating multiple stages into a single, unified model, eliminating the need for intermediate steps and potentially improving efficiency and performance. Current research focuses on applying this approach across diverse fields, utilizing architectures like transformers, reinforcement learning, and spiking neural networks to tackle challenges in autonomous driving, robotics, speech processing, and natural language processing. This approach offers significant potential for improving the accuracy, speed, and robustness of various applications, while also simplifying development and deployment.
Papers
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping
Kevin Zhang, Luka Chkhetiani, Francis McCann Ramirez, Yash Khare, Andrea Vanzo, Michael Liang, Sergio Ramirez Martin, Gabriel Oexle, Ruben Bousbib, Taufiquzzaman Peyash, Michael Nguyen, Dillon Pulliam, Domenic Donato
Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness
Xincan Feng, Akifumi Yoshimoto
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection
Jiaqing Zhang, Mingxiang Cao, Xue Yang, Weiying Xie, Jie Lei, Daixun Li, Wenbo Huang, Yunsong Li
rFaceNet: An End-to-End Network for Enhanced Physiological Signal Extraction through Identity-Specific Facial Contours
Dali Zhu, Wenli Zhang, Hualin Zeng, Xiaohao Liu, Long Yang, Jiaqi Zheng
An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning
Heitor R. GuimarĂ£es, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago H. Falk
EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Ziqi Liang, Haoxiang Shi, Jiawei Wang, Keda Lu