Paper ID: 2406.04257

Data Measurements for Decentralized Data Markets

Charles Lu, Mohammad Mohammadi Amiri, Ramesh Raskar

Decentralized data markets can provide more equitable forms of data acquisition for machine learning. However, to realize practical marketplaces, efficient techniques for seller selection need to be developed. We propose and benchmark federated data measurements to allow a data buyer to find sellers with relevant and diverse datasets. Diversity and relevance measures enable a buyer to make relative comparisons between sellers without requiring intermediate brokers and training task-dependent models.

Submitted: Jun 6, 2024