Host load prediction in a Google compute cloud with a Bayesian model

作者: Walfredo Cirne , Derrick Kondo , Sheng Di

DOI: 10.5555/2388996.2389025

关键词: Interval (mathematics)Cloud computingHost (network)PredictabilityData miningMoving averageComputer scienceResource allocationBayesian inferenceMean squared errorBayes' theorem

摘要: Prediction of host load in Cloud systems is critical for achieving service-level agreements. However, accurate prediction Clouds extremely challenging because it fluctuates drastically at small timescales. We design a method based on Bayes model to predict the mean over long-term time interval, as well consecutive future intervals. identify novel predictive features that capture expectation, predictability, trends and patterns load. also determine most effective combinations these prediction. evaluate our using detailed one-month trace Google data center with thousands machines. Experiments show achieves high accuracy squared error 0.0014. Moreover, improves by 5.6 -- 50% compared other state-of-the-art methods moving averages, auto-regression, and/or noise filters.

参考文章(27)
M. B. WILK, R. GNANADESIKAN, Probability plotting methods for the analysis of data Biometrika. ,vol. 55, pp. 1- 17 ,(1968) , 10.1093/BIOMET/55.1.1
Bikash Sharma, Victor Chudnovsky, Joseph L. Hellerstein, Rasekh Rifaat, Chita R. Das, Modeling and synthesizing task placement constraints in Google compute clusters symposium on cloud computing. pp. 3- ,(2011) , 10.1145/2038916.2038919
Bradley J. Barnes, Barry Rountree, David K. Lowenthal, Jaxk Reeves, Bronis de Supinski, Martin Schulz, A regression-based approach to scalability prediction Proceedings of the 22nd annual international conference on Supercomputing - ICS '08. pp. 368- 377 ,(2008) , 10.1145/1375527.1375580
Christopher Dabrowski, Fern Hunt, Using Markov chain analysis to study dynamic behaviour in large-scale grid systems grid computing. pp. 29- 40 ,(2009) , 10.5555/1862805.1862814
Juan M. Tirado, Daniel Higuero, Florin Isaila, Jesus Carretero, Multi-model prediction for enhancing content locality in elastic server infrastructures ieee international conference on high performance computing, data, and analytics. pp. 1- 9 ,(2011) , 10.1109/HIPC.2011.6152728
Tamer Basar, A New Approach to Linear Filtering and Prediction Problems Journal of Basic Engineering. ,vol. 82, pp. 35- 45 ,(1960) , 10.1115/1.3662552
S. Akioka, Y. Muraoka, Extended forecast of CPU and network load on computational Grid cluster computing and the grid. pp. 765- 772 ,(2004) , 10.1109/CCGRID.2004.1336711
Yongwei Wu, Kai Hwang, Yulai Yuan, Weiming Zheng, Adaptive Workload Prediction of Grid Performance in Confidence Windows IEEE Transactions on Parallel and Distributed Systems. ,vol. 21, pp. 925- 938 ,(2010) , 10.1109/TPDS.2009.137
Asit K. Mishra, Joseph L. Hellerstein, Walfredo Cirne, Chita R. Das, Towards characterizing cloud backend workloads: insights from Google compute clusters measurement and modeling of computer systems. ,vol. 37, pp. 34- 41 ,(2010) , 10.1145/1773394.1773400