Data Science Pipeline
Data science pipelines automate the process of extracting knowledge from data, encompassing data ingestion, analysis, visualization, and reporting. Current research emphasizes improving accessibility and usability through natural language interfaces and automated machine learning (AutoML), particularly focusing on integrating large language models (LLMs) and autonomous agents to streamline workflows. This focus aims to enhance reproducibility, transparency, and efficiency in data science, ultimately enabling broader application across diverse fields like medicine and fact-checking, while also addressing challenges like ensuring safety and interpretability of results.
Papers
October 27, 2024
October 8, 2024
October 7, 2024
September 30, 2024
September 15, 2024
August 4, 2024
July 21, 2024
November 30, 2023
November 12, 2023
October 15, 2023
February 28, 2023