Private Datasets

Private datasets present a significant challenge in leveraging data for machine learning while safeguarding sensitive information. Current research focuses on developing privacy-preserving techniques, such as differential privacy, federated learning, and secure multi-party computation, often employing generative models (like normalizing flows) and advanced algorithms (e.g., zeroth-order optimization) to enable collaborative model training and inference without direct data sharing. These advancements are crucial for unlocking the potential of sensitive data in various fields, including healthcare, finance, and agriculture, while mitigating privacy risks and promoting responsible data usage. The ultimate goal is to balance the utility of data-driven insights with robust protection of individual privacy.

Papers