Large Scale Datasets
Large-scale datasets are driving advancements in numerous machine learning applications, with research focusing on efficient data management, improved model training, and mitigating issues like data bias and leakage. Current efforts involve developing novel algorithms for clustering, feature selection, and causal inference, often leveraging transformer-based models and techniques like knowledge distillation to enhance performance and scalability. The availability and effective utilization of these datasets are crucial for pushing the boundaries of AI capabilities across diverse fields, from scientific discovery to industrial applications.
Papers
November 18, 2024
November 1, 2024
October 18, 2024
October 15, 2024
October 12, 2024
September 30, 2024
September 27, 2024
September 26, 2024
September 22, 2024
September 2, 2024
August 28, 2024
August 13, 2024
August 2, 2024
June 26, 2024
June 25, 2024
June 11, 2024
May 27, 2024
April 30, 2024
March 20, 2024