Semantic Encoder
Semantic encoders are computational models designed to extract and represent the meaning (semantics) from various data types, including text, audio, images, and even graph structures. Current research focuses on improving the efficiency and robustness of these encoders, often employing architectures like transformers, graph neural networks, and normalizing flows, and integrating them into end-to-end systems for tasks such as topic segmentation, semantic communication, and multi-modal data processing. This work is significant because effective semantic encoding is crucial for advancing numerous applications, including efficient data transmission, improved machine understanding of complex data, and enhanced performance in tasks like fraud detection and image editing.
Papers
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Haohe Liu, Xuenan Xu, Yi Yuan, Mengyue Wu, Wenwu Wang, Mark D. Plumbley
Transformer-Enhanced Motion Planner: Attention-Guided Sampling for State-Specific Decision Making
Lei Zhuang, Jingdong Zhao, Yuntao Li, Zichun Xu, Liangliang Zhao, Hong Liu