Large-scale Causal Approaches to Debiasing Post-click Conversion Rate Estimation with Multi-task Learning

作者: Xiao-Yang Liu , Ramin Ramezani , Hong Wen , Keping Yang , Quan Lin

DOI:

关键词: InferenceSelection biasTask (project management)Computer scienceMissing dataDebiasingRecommender systemArtificial intelligenceMulti-task learningMachine learningEstimator

摘要: Post-click conversion rate (CVR) estimation is a critical task in e-commerce recommender systems. This deemed quite challenging under the industrial setting with two major issues: 1) selection bias caused by user self-selection, and 2) data sparsity due to rare click events. A successful typically has following sequential events: "exposure -> conversion". Conventional CVR estimators are trained space, but inference done entire exposure space. They fail account for causes of missing treat them as at random. Hence, their estimations highly likely deviate from real values large. In addition, issue can also handicap many which usually have large parameter spaces. In this paper, we propose principled, efficient effective estimation, namely, Multi-IPW Multi-DR. The proposed models approach causal perspective not our methods based on multi-task learning framework mitigate issue. Extensive experiments industrial-level datasets show that outperform state-of-the-art models.

参考文章(44)
Elias Bareinboim, Judea Pearl, Jin Tian, Recovering from selection bias in causal and statistical inference national conference on artificial intelligence. pp. 2410- 2416 ,(2014)
Craig K. Enders, Applied Missing Data Analysis ,(2010)
Miroslav Dudik, Lihong Li, John Langford, Doubly Robust Policy Evaluation and Learning arXiv: Learning. ,(2011)
Dawen Liang, Laurent Charlin, James McInerney, David M. Blei, Modeling User Exposure in Recommendation the web conference. pp. 951- 961 ,(2016) , 10.1145/2872427.2883090
Gary M. Weiss, Mining with rarity ACM SIGKDD Explorations Newsletter. ,vol. 6, pp. 7- 19 ,(2004) , 10.1145/1007730.1007734
Harald Steck, Training and testing of recommender systems on data missing not at random knowledge discovery and data mining. pp. 713- 722 ,(2010) , 10.1145/1835804.1835895
Karel Vermeulen, Stijn Vansteelandt, Bias-Reduced Doubly Robust Estimation Journal of the American Statistical Association. ,vol. 110, pp. 1024- 1036 ,(2015) , 10.1080/01621459.2014.958155
Kuang-chih Lee, Burkay Orten, Ali Dasdan, Wentong Li, None, Estimating conversion rate in display advertising from past erformance data Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '12. pp. 768- 776 ,(2012) , 10.1145/2339530.2339651
Olav Kallenberg, Foundations of modern probability ,(1997)