Paper ID: 2406.04257
Data Measurements for Decentralized Data Markets
Charles Lu, Mohammad Mohammadi Amiri, Ramesh Raskar
Decentralized data markets can provide more equitable forms of data acquisition for machine learning. However, to realize practical marketplaces, efficient techniques for seller selection need to be developed. We propose and benchmark federated data measurements to allow a data buyer to find sellers with relevant and diverse datasets. Diversity and relevance measures enable a buyer to make relative comparisons between sellers without requiring intermediate brokers and training task-dependent models.
Submitted: Jun 6, 2024