Data Analysis Pipeline
A data analysis pipeline is a structured sequence of computational steps transforming raw data into actionable insights. Current research emphasizes automating pipeline construction and execution, often leveraging large language models (LLMs) to generate code, visualizations, and interpret results, as well as improving statistical rigor through techniques like selective inference. This work aims to enhance reproducibility, efficiency, and accessibility of data analysis across diverse domains, from business intelligence to scientific workflows and even sensitive applications like medical image analysis and child safety.
Papers
November 1, 2024
October 11, 2024
September 27, 2024
June 27, 2024
June 17, 2024
March 19, 2024
December 3, 2023
November 3, 2023
September 19, 2023
May 30, 2023
April 17, 2023
December 7, 2022
April 29, 2022