End to End
"End-to-end" systems aim to streamline complex processes by integrating multiple stages into a single, unified model, eliminating the need for intermediate steps and potentially improving efficiency and performance. Current research focuses on applying this approach across diverse fields, utilizing architectures like transformers, reinforcement learning, and spiking neural networks to tackle challenges in autonomous driving, robotics, speech processing, and natural language processing. This approach offers significant potential for improving the accuracy, speed, and robustness of various applications, while also simplifying development and deployment.
Papers
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana
A Compact End-to-End Model with Local and Global Context for Spoken Language Identification
Fei Jia, Nithin Rao Koluguri, Jagadeesh Balam, Boris Ginsburg
SAN: a robust end-to-end ASR model architecture
Zeping Min, Qian Ge, Guanhua Huang
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation
Fernando López, Jordi Luque
AskYourDB: An end-to-end system for querying and visualizing relational databases using natural language
Manu Joseph, Harsh Raj, Anubhav Yadav, Aaryamann Sharma
ELIAS: End-to-End Learning to Index and Search in Large Output Spaces
Nilesh Gupta, Patrick H. Chen, Hsiang-Fu Yu, Cho-Jui Hsieh, Inderjit S Dhillon