DeepPlace: Learning to Place Applications in Multi-Tenant Clusters

作者: Subrata Mitra , Shanka Subhra Mondal , Nikhil Sheoran , Neeraj Dhake , Ravinder Nehra

DOI: 10.1145/3343737.3343741

关键词:

摘要: Large multi-tenant production clusters often have to handle a variety of jobs and applications with complex resource usage characteristics. It is non-trivial non-optimal manually create placement rules for scheduling that would decide which should co-locate. In this paper, we present DeepPlace, scheduler learns exploits various temporal patterns using Deep Reinforcement Learning (Deep RL) reduce competition across running in the same machine while at time optimizing overall cluster utilization.

参考文章(52)
Lidong Zhou, Jaliya Ekanayake, Bing Shi, Ming Wu, Wei Lin, Eric Boutin, Jingren Zhou, Zhengping Qian, Apollo: scalable and coordinated scheduling for cloud-scale computing operating systems design and implementation. pp. 285- 300 ,(2014) , 10.5555/2685048.2685071
Amiya K. Maji, Subrata Mitra, Saurabh Bagchi, ICE: An Integrated Configuration Engine for Interference Mitigation in Cloud Services international conference on autonomic computing. pp. 91- 100 ,(2015) , 10.1109/ICAC.2015.48
Peter Bodik, Jonathan Mace, Madanlal Musuvathi, Rodrigo Fonseca, Retro: Targeted resource management in multi-tenant distributed systems networked systems design and implementation. pp. 589- 603 ,(2015)
Volodymyr Mnih, Ioannis Antonoglou, Koray Kavukcuoglu, Daan Wierstra, Martin A. Riedmiller, Alex Graves, David Silver, Playing Atari with Deep Reinforcement Learning arXiv: Learning. ,(2013)
Lalith Suresh, Marco Canini, Stefan Schmid, Anja Feldmann, None, C3: cutting tail latency in cloud data stores via adaptive replica selection networked systems design and implementation. pp. 513- 527 ,(2015)
Yunjing Xu, Zachary Musgrave, Brian Noble, Michael Bailey, None, Bobtail: avoiding long tails in the cloud networked systems design and implementation. pp. 329- 342 ,(2013)
Scott Shenker, Ali Ghodsi, Matei Zaharia, Andy Konwinski, Benjamin Hindman, Ion Stoica, Dominant resource fairness: fair allocation of multiple resource types networked systems design and implementation. pp. 323- 336 ,(2011) , 10.5555/1972457.1972490
Lucian Popa, Gautam Kumar, Mosharaf Chowdhury, Arvind Krishnamurthy, Sylvia Ratnasamy, Ion Stoica, FairCloud: sharing the network in cloud computing acm special interest group on data communication. ,vol. 42, pp. 187- 198 ,(2012) , 10.1145/2342356.2342396
Jens Kober, J. Andrew Bagnell, Jan Peters, Reinforcement learning in robotics: A survey The International Journal of Robotics Research. ,vol. 32, pp. 1238- 1274 ,(2013) , 10.1177/0278364913495721
Xiaoqi Ren, Ganesh Ananthanarayanan, Adam Wierman, Minlan Yu, Hopper: Decentralized Speculation-aware Cluster Scheduling at Scale acm special interest group on data communication. ,vol. 45, pp. 379- 392 ,(2015) , 10.1145/2785956.2787481