Diverse Set
Diverse sets, encompassing varied data points or model outputs, are a central focus in current machine learning research, aiming to improve model robustness, generalization, and explainability. Researchers are exploring diverse set generation and evaluation across various domains, employing techniques like diffusion models, contrastive learning, and gradient-based methods to achieve both diversity and quality in outputs, such as image generation, text generation, and audio captioning. This focus on diversity is crucial for addressing biases, enhancing model performance on underrepresented data, and improving the reliability and trustworthiness of AI systems in real-world applications. The development of new metrics and benchmarks for evaluating diversity is also a key area of ongoing work.
Papers
Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech
Jae-Sung Bae, Jinhyeok Yang, Tae-Jun Bak, Young-Sun Joo
DiversiTree: A New Method to Efficiently Compute Diverse Sets of Near-Optimal Solutions to Mixed-Integer Optimization Problems
Izuwa Ahanor, Hugh Medal, Andrew C. Trapp