Mixture Component

Mixture component models are a powerful class of machine learning techniques that combine multiple specialized models (experts) to improve performance and efficiency on complex tasks. Current research focuses on developing novel architectures, such as mixtures of experts (MoE), and applying them to diverse fields including natural language processing, computer vision, and signal processing, often incorporating techniques like low-rank adaptation (LoRA) for parameter efficiency. These advancements are significant because they enable the creation of larger, more capable models while mitigating computational costs and improving generalization across heterogeneous datasets, leading to improved accuracy and efficiency in various applications.

Papers

September 25, 2024

SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA
Siyue Zhang, Anh Tuan Luu, Chen Zhao
Mixture Component Text to SQL End 2 End Table Question Answering Answer Selection Table Question

September 24, 2024

September 23, 2024

September 20, 2024

On-device Collaborative Language Modeling via a Mixture of Generalists and Specialists
Dongyang Fan, Bettina Messmer, Martin Jaggi
Language Model Mixture Component Expert Knowledge Generalist Learner

September 18, 2024

September 17, 2024

LPT++: Efficient Training on Mixture of Long-tailed Experts
Bowen Dong, Pan Zhou, Wangmeng Zuo
Mixture Component Efficient Training Long Tailed Learning Long Tail Long Tailed Classification TIP Generation Memory Efficient Adaptation

September 11, 2024

Zero-Shot Machine-Generated Text Detection Using Mixture of Large Language Models
Matthieu Dubois, François Yvon, Pablo Piantanida
Zero Shot Generative AI Mixture Component Machine Generated Text Human Written Text Text Generation Capability

September 10, 2024

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Wenyu Zhang, Shuo Sun, Bin Wang, Xunlong Zou, Zhuohan Liu, Yingxu He, Geyu Lin, Nancy F. Chen, Ai Ti Aw
Large Language Model Pre Trained Mixture Component Lightweight Encoder

September 9, 2024

Adapted-MoE: Mixture of Experts with Test-Time Adaption for Anomaly Detection
Tianwu Lei, Silin Chen, Bohan Wang, Zhengkai Jiang, Ningmu Zou
Anomaly Detection Representation Learning Mixture Component Expert Knowledge Test Time Adaptation Unsupervised Anomaly Detection Decision Boundary Feature Distribution Unseen Test Distribution

September 5, 2024

ChartMoE: Mixture of Expert Connector for Advanced Chart Understanding
Zhengzhuo Xu, Bowen Qu, Yiyan Qi, Sinan Du, Chengjin Xu, Chun Yuan, Jian Guo
Multimodal Large Language Model Mixture Component LD Align Chart Comprehension Alignment Training Chart Component

September 2, 2024

August 30, 2024

Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts
Rhui Dih Lee, Laura Wynter, Raghu Kiran Ganti
Full Model Mixture Component Extension Study Easy to Use Toolkit Domain Expert

August 28, 2024