Large Scale Datasets
Large-scale datasets are driving advancements in numerous machine learning applications, with research focusing on efficient data management, improved model training, and mitigating issues like data bias and leakage. Current efforts involve developing novel algorithms for clustering, feature selection, and causal inference, often leveraging transformer-based models and techniques like knowledge distillation to enhance performance and scalability. The availability and effective utilization of these datasets are crucial for pushing the boundaries of AI capabilities across diverse fields, from scientific discovery to industrial applications.
Papers
June 11, 2024
May 27, 2024
April 30, 2024
March 20, 2024
March 19, 2024
March 6, 2024
February 4, 2024
December 11, 2023
November 25, 2023
October 25, 2023
July 17, 2023
June 20, 2023
June 8, 2023
June 7, 2023
May 29, 2023
May 25, 2023
May 2, 2023
April 17, 2023
March 27, 2023