Mixture Component

Mixture component models are a powerful class of machine learning techniques that combine multiple specialized models (experts) to improve performance and efficiency on complex tasks. Current research focuses on developing novel architectures, such as mixtures of experts (MoE), and applying them to diverse fields including natural language processing, computer vision, and signal processing, often incorporating techniques like low-rank adaptation (LoRA) for parameter efficiency. These advancements are significant because they enable the creation of larger, more capable models while mitigating computational costs and improving generalization across heterogeneous datasets, leading to improved accuracy and efficiency in various applications.

Papers

August 28, 2024

Automated Mixture Analysis via Structural Evaluation
Zachary T. P. Fried, Brett A. McGuire
Mixture Component Mixture Model Organic Chemistry Spectroscopic Data Structural Analysis Rotational Spectrum

August 20, 2024

DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
Yin-Jyun Luo, Kin Wai Cheuk, Woosung Choi, Toshimitsu Uesaka, Keisuke Toyama, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Wei-Hsiang Liao, Simon Dixon, Yuki Mitsufuji
Latent Representation Mixture Component Speech Representation Disentanglement Musical Instrument

August 15, 2024

August 2, 2024

MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts
Lin Ning, Harsh Lara, Meiqi Guo, Abhinav Rastogi
Large Language Model Mixture Component Mixture of Expert LLM Adaptation Dyadic Interaction Task Specialization Multi Task Adaptation

July 31, 2024

MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts
Xi Victoria Lin, Akshat Shrivastava, Liang Luo, Srinivasan Iyer, Mike Lewis, Gargi Ghosh, Luke Zettlemoyer, Armen Aghajanyan
Mixture Component Mixture of Expert Modality Specific Multimodal AI Effective Fusion Multi Modal Pre Training Modality Aware

July 29, 2024

July 28, 2024

Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models
Mohammed Al-Maamari, Mehdi Ben Amor, Michael Granitzer
Large Language Model Language Model Knowledge Distillation Mixture Component Modular System Source Domain Distilling Knowledge

July 26, 2024

Finite Neural Networks as Mixtures of Gaussian Processes: From Provable Error Bounds to Prior Selection
Steven Adams, Patanè, Morteza Lahijanian, Luca Laurenti
Neural Network Deep Neural Network Gaussian Process Mixture Component Probabilistic Model Error Bound Gaussian Model Prior Correction

July 25, 2024

HANNA: Hard-constraint Neural Network for Consistent Activity Coefficient Prediction
Thomas Specht, Mayank Nagda, Sophie Fellenz, Stephan Mandt, Hans Hasse, Fabian Jirasek
Neural Network Mixture Component Thermodynamic Integration Activity Coefficient

July 24, 2024

M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis
Junyu Li, Ye Zhang, Wen Shu, Xiaobing Feng, Yingchun Wang, Pengju Yan, Xiaolin Li, Chulin Sha, Min He
Mixture Component Multiple Instance Learning Computational Pathology Image Analysis Isocitrate Dehydrogenase Mutation

July 20, 2024

EEGMamba: Bidirectional State Space Model with Mixture of Experts for EEG Multi-task Classification
Yiyu Gui, MingZhi Chen, Yuqi Su, Guibo Luo, Yuchao Yang
Mixture Component Expert Knowledge EEG Data EEG Signal EEG Classification Bidirectional State Space Model

July 19, 2024

Mixture of Experts with Mixture of Precisions for Tuning Quality of Service
HamidReza Imani, Abdolah Amirany, Tarek El-Ghazawi
Language Model Mixture Component Expert Knowledge Mixture of Expert Multidimensional Local Precision Rate Service Provider Hyper Tune Quantization Level

July 18, 2024

Mixture of Experts based Multi-task Supervise Learning from Crowds
Tao Han, Huaixuan Shi, Xinyi Ding, Xiao Ma, Huamao Gu, Yili Fang
Mixture Component Expert Knowledge Crowdsourcing Context Crowded Environment Truth Inference

July 17, 2024

July 10, 2024

July 8, 2024

Thermodynamics-Consistent Graph Neural Networks
Jan G. Rittig, Alexander Mitsos
Graph Neural Network Mixture Component GNN Architecture Free Energy Thermodynamic Integration Activity Coefficient

Mixture Component

Papers

Automated Mixture Analysis via Structural Evaluation

DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models

MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts

MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts

UniFed: A Universal Federation of a Mixture of Highly Heterogeneous Medical Image Classification Tasks

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models

Finite Neural Networks as Mixtures of Gaussian Processes: From Provable Error Bounds to Prior Selection

HANNA: Hard-constraint Neural Network for Consistent Activity Coefficient Prediction

M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis

EEGMamba: Bidirectional State Space Model with Mixture of Experts for EEG Multi-task Classification

Mixture of Experts with Mixture of Precisions for Tuning Quality of Service

Mixture of Experts based Multi-task Supervise Learning from Crowds

RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models

MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models

MoVEInt: Mixture of Variational Experts for Learning Human-Robot Interactions from Demonstrations

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

Thermodynamics-Consistent Graph Neural Networks