A microeconomic data mining problem: customer-oriented catalog segmentation

作者: Martin Ester , Rong Ge , Wen Jin , Zengjian Hu

DOI: 10.1145/1014052.1014119

关键词:

摘要: The microeconomic framework for data mining [7] assumes that an enterprise chooses a decision maximizing the overall utility over all customers where contribution of customer is function available on customer. In Catalog Segmentation, wants to design k product catalogs size r maximize number catalog products purchased. However, there are many applications customer, once attracted enterprise, would purchase more beyond ones contained in catalog. Therefore, this paper, we investigate alternative problem formulation, call Customer-Oriented measured by have at least specified minimum interest t catalogs. We formally introduce Segmentation and discuss its complexity. Then two different paradigms efficient, approximate algorithms problem, greedy (deterministic) randomized algorithms. Since may be trapped local optimum crucially depend reasonable initial solution, explore combination these paradigms. Our experimental evaluation synthetic real demonstrates new yield significantly higher compared classical

参考文章(13)
Yiyu Yao, Eric K. Louie, Tsau-Young Lin, Mining value added association rules pacific-asia conference on knowledge discovery and data mining. ,(2002)
Ke Wang, Senqiang Zhou, Jiawei Han, Profit Mining: From Patterns to Actions extending database technology. pp. 70- 87 ,(2002) , 10.1007/3-540-45876-X_7
Jon Kleinberg, Christos Papadimitriou, Prabhakar Raghavan, A Microeconomic View of Data Mining Data Mining and Knowledge Discovery. ,vol. 2, pp. 311- 324 ,(1998) , 10.1023/A:1009726428407
Renato Coppi, A theoretical framework for data mining: the informational paradigm Computational Statistics & Data Analysis. ,vol. 38, pp. 501- 515 ,(2002) , 10.1016/S0167-9473(01)00075-5
Heikki Mannila, Theoretical frameworks for data mining ACM SIGKDD Explorations Newsletter. ,vol. 1, pp. 30- 32 ,(2000) , 10.1145/846183.846191
Dachuan Xu*, Yinyu Ye, Jiawei Zhang‡, Approximating the 2-catalog segmentation problem using semidefinite programming relaxations Optimization Methods & Software. ,vol. 18, pp. 705- 719 ,(2003) , 10.1080/10556780310001634082
Tom Brijs, Bart Goethals, Gilbert Swinnen, Koen Vanhoof, Geert Wets, A data mining framework for optimal product selection in retail supermarket data: the generalized PROFSET model knowledge discovery and data mining. pp. 300- 304 ,(2000) , 10.1145/347090.347156
Michael Steinbach, George Karypis, Vipin Kumar, Efficient Algorithms for Creating Product Catalogs Defense Technical Information Center. ,(2000) , 10.21236/ADA439548
Ke Wang, Ming-Yen Thomas Su, Item selection by "hub-authority" profit ranking Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '02. pp. 652- 657 ,(2002) , 10.1145/775047.775144