Towards predicting query execution time for concurrent and dynamic database workloads

作者: Wentao Wu , Yun Chi , Hakan Hacígümüş , Jeffrey F. Naughton

DOI: 10.14778/2536206.2536219

关键词:

摘要: Predicting query execution time is crucial for many database management tasks including admission control, scheduling, and progress monitoring. While a number of recent papers have explored this problem, the bulk existing work either considers prediction single query, or static workload concurrent queries, where by "static" we mean that queries to be run are fixed known. In paper, consider more general problem dynamic workloads. Unlike most previous on prediction, our proposed framework based analytic modeling rather than machine learning. We first use optimizer's cost model estimate I/O CPU requirements each pipeline in isolation, then combination queueing buffer pool merges requests from predict running times. compare approach with machine-learning variant work. Our experiments show analytic-model can lead competitive often better accuracy its counterpart.

参考文章(32)
David J. DeWitt, Ravishankar Ramamurthy, Buffer Pool Aware Query Optimization conference on innovative data systems research. pp. 250- 261 ,(2005)
Kenneth C. Sevcik, Data base system performance prediction using an analytical model (invited paper) very large data bases. pp. 182- 198 ,(1981)
Edward D. Lazowska, G. Scott Graham, John Zahorjan, Kenneth C. Sevcik, Quantitative system performance: computer system analysis using queueing network models Int. CMG Conference. pp. 468- 470 ,(1984)
Neven Tomov, Euan Dempster, M. Howard Williams, Albert Burger, Hamish Taylor, Peter J.B. King, Phil Broughton, Some Results from a New Technique for Response Time Estimation in Parallel DBMS ieee international conference on high performance computing data and analytics. pp. 713- 721 ,(1999) , 10.1007/BFB0100632
Ted J. Wasserman, Patrick Martin, David B. Skillicorn, Haider Rizvi, Developing a characterization of business intelligence workloads for sizing new database systems data warehousing and olap. pp. 7- 13 ,(2004) , 10.1145/1031763.1031766
M. Reiser, S. S. Lavenberg, Mean-Value Analysis of Closed Multichain Queuing Networks Journal of the ACM. ,vol. 27, pp. 313- 322 ,(1980) , 10.1145/322186.322195
Jennie Duggan, Ugur Cetintemel, Olga Papaemmanouil, Eli Upfal, None, Performance prediction for concurrent database workloads international conference on management of data. pp. 337- 348 ,(2011) , 10.1145/1989323.1989359
Rasha Osman, Irfan Awan, Michael E. Woodward, Application of Queueing Network Models in the Performance Evaluation of Database Designs Electronic Notes in Theoretical Computer Science. ,vol. 232, pp. 101- 124 ,(2009) , 10.1016/J.ENTCS.2009.02.053
N. Tomov, E. Dempster, M.H. Williams, A. Burger, H. Taylor, P.J.B. King, P. Broughton, Analytical response time estimation in parallel relational database systems parallel computing. ,vol. 30, pp. 249- 283 ,(2004) , 10.1016/J.PARCO.2003.11.003
Jiexing Li, Rimma V. Nehme, Jeffrey Naughton, GSLPI: A Cost-Based Query Progress Indicator 2012 IEEE 28th International Conference on Data Engineering. pp. 678- 689 ,(2012) , 10.1109/ICDE.2012.74