Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback

作者: Kazuhide Nakata , Hayato Sakata , Yuta Saito , Suguru Yaginuma , Yuta Nishino

DOI:

关键词:

摘要: Recommender systems widely use implicit feedback such as click data because of its general availability. Although the presence clicks signals users' preference to some extent, lack does not necessarily indicate a negative response from users, it is possible that users were exposed items (positive-unlabeled problem). This leads difficulty in predicting preferences feedback. Previous studies addressed positive-unlabeled problem by uniformly upweighting loss for positive or estimating confidence each having relevance information via EM-algorithm. However, these methods failed address missing-not-at-random which popular frequently recommended are more likely be clicked than other even if user have considerable interest them. To overcome limitations, we first define an ideal function optimized realize recommendations maximize and propose unbiased estimator loss. Subsequently, analyze variance proposed further clipped includes special case. We demonstrate expected improve performance recommender system, considering bias-variance trade-off. conduct semi-synthetic real-world experiments method largely outperforms baselines. In particular, works better rare less observed training data. The findings can achieve objective recommending with highest relevance.

参考文章(29)
Xiao-Li Li, Bing Liu, None, Learning from Positive and Unlabeled Examples with Different Data Distributions Machine Learning: ECML 2005. pp. 218- 229 ,(2005) , 10.1007/11564096_24
Dawen Liang, Laurent Charlin, James McInerney, David M. Blei, Modeling User Exposure in Recommendation the web conference. pp. 951- 961 ,(2016) , 10.1145/2872427.2883090
Yehuda Koren, Robert Bell, Chris Volinsky, Matrix Factorization Techniques for Recommender Systems IEEE Computer. ,vol. 42, pp. 30- 37 ,(2009) , 10.1109/MC.2009.263
Yifan Hu, Yehuda Koren, Chris Volinsky, Collaborative Filtering for Implicit Feedback Datasets international conference on data mining. pp. 263- 272 ,(2008) , 10.1109/ICDM.2008.22
Charles Elkan, Keith Noto, Learning classifiers from only positive and unlabeled data Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 213- 220 ,(2008) , 10.1145/1401890.1401920
Donald B. Rubin, Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. ,vol. 66, pp. 688- 701 ,(1974) , 10.1037/H0037350
Andriy Mnih, Ruslan R Salakhutdinov, Probabilistic Matrix Factorization neural information processing systems. ,vol. 20, pp. 1257- 1264 ,(2007)
PAUL R. ROSENBAUM, DONALD B. RUBIN, The central role of the propensity score in observational studies for causal effects Biometrika. ,vol. 70, pp. 41- 55 ,(1983) , 10.1093/BIOMET/70.1.41
Jiahui Liu, Peter Dolan, Elin Rønby Pedersen, Personalized news recommendation based on click behavior intelligent user interfaces. pp. 31- 40 ,(2010) , 10.1145/1719970.1719976
Thorsten Joachims, Adith Swaminathan, The self-normalized estimator for counterfactual learning neural information processing systems. ,vol. 28, pp. 3231- 3239 ,(2015)