Paper ID: 2212.13892

Cross-Dataset Propensity Estimation for Debiasing Recommender Systems

Fengyu Li, Sarah Dean

Datasets for training recommender systems are often subject to distribution shift induced by users' and recommenders' selection biases. In this paper, we study the impact of selection bias on datasets with different quantization. We then leverage two differently quantized datasets from different source distributions to mitigate distribution shift by applying the inverse probability scoring method from causal inference. Empirically, our approach gains significant performance improvement over single-dataset methods and alternative ways of combining two datasets.

Submitted: Dec 22, 2022