The Fast and The Frugal: Tail Latency Aware Provisioning for Coping with Load Variations

作者: Adithya Kumar , Iyswarya Narayanan , Timothy Zhu , Anand Sivasubramaniam

DOI: 10.1145/3366423.3380117

关键词:

摘要: Small and medium sized enterprises use the cloud for running online, user-facing, tail latency sensitive applications with well-defined fixed monthly budgets. For these applications, adequate system capacity must be provisioned to extract maximal performance despite challenges of uncertainties in load request-sizes. In this paper, we address problem provisioning under budget constraints goal minimizing latency. To tackle problem, propose building systems using a heterogeneous mix low expensive resources cheap that provide high throughput per dollar. As changes through day, more faster reduce during periods cheaper handle periods. achieve benefits, introduce novel heterogeneity-aware scheduling autoscaling algorithms are designed Using software prototypes by experiments on public cloud, show our approach can outperform existing reducing as much 45% fixed-budget settings.

参考文章(63)
Byung-Gon Chun, Gunho Lee, H. Katz, Heterogeneity-aware resource allocation and scheduling in the cloud ieee international conference on cloud computing technology and science. pp. 4- 4 ,(2011) , 10.5555/2170444.2170448
Bhuvan Urgaonkar, Byung Chul Tak, Anand Sivasubramaniam, To move or not to move: the economics of cloud computing ieee international conference on cloud computing technology and science. pp. 5- 5 ,(2011) , 10.5555/2170444.2170449
Ishai Menache, Ohad Shamir, Navendu Jain, On-demand, Spot, or Both: Dynamic Resource Allocation for Executing Batch Jobs in the Cloud. international conference on autonomic computing. pp. 177- 187 ,(2014)
Bhuvan Urgaonkar, Aman Kansal, Iyswarya Narayanan, Sriram Govindan, Anand Sivasubramaniam, Towards a leaner geo-distributed cloud infrastructure ieee international conference on cloud computing technology and science. pp. 3- 3 ,(2014)
B. Urgaonkar, P. Shenoy, A. Chandra, P. Goyal, Dynamic Provisioning of Multi-tier Internet Applications international conference on autonomic computing. pp. 217- 228 ,(2005) , 10.1109/ICAC.2005.27
Qiumin Xu, Huzefa Siyamwala, Mrinmoy Ghosh, Tameesh Suri, Manu Awasthi, Zvika Guz, Anahita Shayesteh, Vijay Balakrishnan, Performance analysis of NVMe SSDs and their implication on real world databases acm international conference on systems and storage. pp. 6- ,(2015) , 10.1145/2757667.2757684
Sungkap Yeo, Hsien-Hsin S. Lee, Using Mathematical Modeling in Provisioning a Heterogeneous Cloud Computing Environment IEEE Computer. ,vol. 44, pp. 55- 62 ,(2011) , 10.1109/MC.2011.96
Yu-Ju Hong, Jiachen Xue, Mithuna Thottethodi, Dynamic server provisioning to minimize cost in an IaaS cloud measurement and modeling of computer systems. ,vol. 39, pp. 147- 148 ,(2011) , 10.1145/1993744.1993799