Causal Disentanglement

Causal disentanglement aims to learn data representations where each component corresponds to an independent causal factor, rather than a statistical correlation. Current research focuses on developing algorithms, often based on variational autoencoders (VAEs) or graph neural networks, that can identify and separate these causal factors, even in the presence of confounding variables, using techniques like interventional data or self-supervised learning. This field is significant because disentangled representations improve model interpretability, robustness (e.g., to adversarial attacks), and generalization across different contexts, with applications ranging from recommendation systems and hate speech detection to biological modeling and fault diagnosis.

Papers