Improving on-demand learning to rank through parallelism

作者: Daniel Xavier De Sousa , Thierson Couto Rosa , Wellington Santos Martins , Rodrigo Silva , Marcos André Gonçalves

DOI: 10.1007/978-3-642-35063-4_38

关键词:

摘要: Traditional Learning to Rank (L2R) is usually conducted in a batch mode which single ranking function learned order results for future queries. This approach not flexible since queries may differ considerably from those present the training set and, consequently, work properly. Ideally, distinct learning should be on demand each query. Nevertheless, on-demand L2R significantly degrade query processing time, as has on-the-fly before it can applied. In this paper we parallel implementation of an technique that reduces drastically response time previous serial implementation. Our makes use thousands threads GPU learn query, and takes advantage reduced obtained through active learning. Experiments with LETOR benchmark show our proposed achieves mean speedup 127x when compared sequential version, while producing very competitive effectiveness.

参考文章(18)
Rodrigo Silva, Marcos A Gonçalves, Adriano Veloso, None, Rule-Based Active Sampling for Learning to Rank Machine Learning and Knowledge Discovery in Databases. pp. 240- 255 ,(2011) , 10.1007/978-3-642-23808-6_16
Tarek Abudawood, Peter Flach, Evaluation Measures for Multi-class Subgroup Discovery european conference on machine learning. pp. 35- 50 ,(2009) , 10.1007/978-3-642-04180-8_20
Humberto Mossri de Almeida, Marcos André Gonçalves, Marco Cristo, Pável Calado, A combined component approach for finding collection-adapted ranking functions based on genetic programming international acm sigir conference on research and development in information retrieval. pp. 399- 406 ,(2007) , 10.1145/1277741.1277810
Shuaiqiang Wang, Byron J. Gao, Ke Wang, Hady W. Lauw, Parallel learning to rank for information retrieval international acm sigir conference on research and development in information retrieval. pp. 1083- 1084 ,(2011) , 10.1145/2009916.2010060
Stephen Tyree, Kilian Q. Weinberger, Kunal Agrawal, Jennifer Paykin, Parallel boosted regression trees for web search ranking Proceedings of the 20th international conference on World wide web - WWW '11. pp. 387- 396 ,(2011) , 10.1145/1963405.1963461
Tao Qin, Tie-Yan Liu, Jun Xu, Hang Li, LETOR: A benchmark collection for research on learning to rank for information retrieval Information Retrieval. ,vol. 13, pp. 346- 374 ,(2010) , 10.1007/S10791-009-9123-Y
George Teodoro, Nathan Mariano, Wagner Meira Jr., Renato Ferreira, Tree Projection-Based Frequent Itemset Mining on Multicore CPUs and GPUs symposium on computer architecture and high performance computing. pp. 47- 54 ,(2010) , 10.1109/SBAC-PAD.2010.15
L.A. Barroso, J. Dean, U. Holzle, Web search for a planet: The Google cluster architecture IEEE Micro. ,vol. 23, pp. 22- 28 ,(2003) , 10.1109/MM.2003.1196112
Adriano A Veloso, Humberto M Almeida, Marcos A Gonçalves, Wagner Meira Jr, None, Learning to rank at query-time using association rules Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 267- 274 ,(2008) , 10.1145/1390334.1390381
Bo Wang, Tianji Wu, Feng Yan, Ruirui Li, Ningyi Xu, Yu Wang, RankBoost Acceleration on both NVIDIA CUDA and ATI Stream Platforms international conference on parallel and distributed systems. pp. 284- 291 ,(2009) , 10.1109/ICPADS.2009.115