Root Cause
Root cause analysis (RCA) aims to identify the underlying causes of system failures or anomalies across diverse domains, from autonomous driving to manufacturing and healthcare. Current research heavily emphasizes data-driven approaches, employing machine learning models like graph neural networks, transformers, and causal discovery algorithms (including Bayesian networks) to analyze complex datasets (often multimodal) and infer causal relationships. These advancements are improving the accuracy and efficiency of RCA, leading to faster troubleshooting, enhanced system reliability, and more informed decision-making in various industries. The development of large, publicly available datasets is also a significant focus, facilitating more robust benchmarking and algorithm comparison.
Papers
AI Hazard Management: A framework for the systematic management of root causes for AI risks
Ronald Schnitzer, Andreas Hapfelmeier, Sven Gaube, Sonja Zillner
RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models
Zefan Wang, Zichuan Liu, Yingying Zhang, Aoxiao Zhong, Lunting Fan, Lingfei Wu, Qingsong Wen