Towards a Combined Grouping and Aggregation Algorithm for Fast Query Processing in Columnar Databases with GPUs

作者: Sina Meraji , John Keenleyside , Sunil Kamath , Bob Blainey

DOI: 10.1109/IPDPSW.2015.21

关键词: AlgorithmOnline aggregationsortAggregate (data warehouse)SargableParallel computingHash functionComputer scienceQuery languageDatabaseQuery optimization

摘要: Column-store in-memory databases have received a lot of attention because their fast query processing response times on modern multi-core machines. Among different database operations, group by/aggregate is an important and potentially costly operation. Moreover, sort-based hash-based algorithms are the most common ways queries. While used in traditional Data Base Management Systems (DBMS), hash based can be applied for faster new columnar databases. Besides, Graphical Processing Units (GPU) utilized as fast, high bandwidth co-processors to improve performance The focus this article prototype operations that we created exploit GPUs. We show GPU. One parameters affect algorithm number groups hashing algorithm. get up 7.6x improvement kernel compared CPU implementation when use partitioned multi-level using GPU shared global memories.

参考文章(23)
Ren Wu, Bin Zhang, Meichun Hsu, Qiming Chen, GPU-accelerated predicate evaluation on column store web age information management. pp. 570- 581 ,(2010) , 10.1007/978-3-642-14246-8_56
Vijayshankar Raman, Gopi Attaluri, Ronald Barber, Naresh Chainani, David Kalmuk, Vincent KulandaiSamy, Jens Leenstra, Sam Lightstone, Shaorong Liu, Guy M. Lohman, Tim Malkemus, Rene Mueller, Ippokratis Pandis, Berni Schiefer, David Sharpe, Richard Sidle, Adam Storm, Liping Zhang, DB2 with BLU acceleration Proceedings of the VLDB Endowment. ,vol. 6, pp. 1080- 1091 ,(2013) , 10.14778/2536222.2536233
Franz Färber, Sang Kyun Cha, Jürgen Primsch, Christof Bornhövd, Stefan Sigg, Wolfgang Lehner, SAP HANA database ACM SIGMOD Record. ,vol. 40, pp. 45- 51 ,(2012) , 10.1145/2094114.2094126
Tim Kaldewey, Guy Lohman, Rene Mueller, Peter Volk, GPU join processing revisited data management on new hardware. pp. 55- 62 ,(2012) , 10.1145/2236584.2236592
Ambuj Shatdal, Jeffrey F. Naughton, Adaptive parallel aggregation algorithms international conference on management of data. ,vol. 24, pp. 104- 114 ,(1995) , 10.1145/223784.223801
Dan A Alcantara, Andrei Sharf, Fatemeh Abbasinejad, Shubhabrata Sengupta, Michael Mitzenmacher, John D Owens, Nina Amenta, None, Real-time parallel hashing on the GPU international conference on computer graphics and interactive techniques. ,vol. 28, pp. 154- ,(2009) , 10.1145/1618452.1618500
John Nickolls, Ian Buck, Michael Garland, Kevin Skadron, Scalable parallel programming with CUDA ACM SIGGRAPH 2008 classes on - SIGGRAPH '08. ,vol. 6, pp. 40- 53 ,(2008) , 10.1145/1401132.1401152
Yang Ye, Kenneth A. Ross, Norases Vesdapunt, Scalable aggregation on multicore processors Proceedings of the Seventh International Workshop on Data Management on New Hardware - DaMoN '11. pp. 1- 9 ,(2011) , 10.1145/1995441.1995442
Bingsheng He, Ke Yang, Rui Fang, Mian Lu, Naga Govindaraju, Qiong Luo, Pedro Sander, Relational joins on graphics processors Proceedings of the 2008 ACM SIGMOD international conference on Management of data - SIGMOD '08. pp. 511- 524 ,(2008) , 10.1145/1376616.1376670
Naga K. Govindaraju, Brandon Lloyd, Wei Wang, Ming Lin, Dinesh Manocha, Fast computation of database operations using graphics processors international conference on management of data. pp. 215- 226 ,(2004) , 10.1145/1007568.1007594