On the efficacy, efficiency and emergent behavior of task replication in large distributed systems

作者: Walfredo Cirne , Francisco Brasileiro , Daniel Paranhos , Luís Fabrício W. Góes , William Voorsluys

DOI: 10.1016/J.PARCO.2007.01.002

关键词:

摘要: Large distributed systems challenge traditional schedulers, as it is often hard to determine a priori how long each task will take complete on resource, information that input for such schedulers. Task replication has been applied in variety of scenarios way circumvent this problem. consists dispatching multiple replicas and using the result from first replica finish. Replication schedulers (i.e. employ replication) are able achieve good performance even absence tasks resources. They also smaller complexity than making them better suitable large systems. On other hand, waste cycles with not Moreover, extra consumption resources raises severe concerns about system-wide system multiple, competing This paper presents comprehensive study replication, comparing against information-based establishing their efficacy (the delivered application), efficiency amount wasted), emergent behavior schedulers). We introduce simple access control strategy can be implemented locally by resource greatly improves overall which compete

参考文章(29)
P. D. Coddington, K. A. Hawickand, H. A. James, Scheduling Independent Tasks on Metacomputing Systems ,(1999)
Cynthia Bailey Lee, Yael Schwartzman, Jennifer Hardy, Allan Snavely, Are user runtime estimates inherently inaccurate job scheduling strategies for parallel processing. pp. 253- 263 ,(2004) , 10.1007/11407522_14
Sung-Ju Lee, Puneet Sharma, Sujata Banerjee, Sujoy Basu, Rodrigo Fonseca, Measuring Bandwidth Between PlanetLab Nodes Lecture Notes in Computer Science. pp. 292- 305 ,(2005) , 10.1007/978-3-540-31966-5_23
Wael R. Elwasif, Martin Swany, James S. Plank, Rich Wolski, Micah Beck, Terence Moore, The Internet Backplane Protocol: Storage in the Network ,(1999)
Aliandro Lima, Walfredo Cirne, Francisco Brasileiro, Daniel Fireman, A case for event-driven distributed objects international conference on move to meaningful internet systems. pp. 1705- 1721 ,(2006) , 10.1007/11914952_46
Daniel Paranhos da Silva, Walfredo Cirne, Francisco Vilar Brasileiro, Trading Cycles for Information: Using Replication to Schedule Bag-of-Tasks Applications on Computational Grids european conference on parallel processing. pp. 169- 180 ,(2003) , 10.1007/978-3-540-45209-6_26
Elizeu Santos-Neto, Walfredo Cirne, Francisco Brasileiro, Aliandro Lima, Exploiting replication and data reuse to efficiently schedule data-intensive applications on grids job scheduling strategies for parallel processing. pp. 210- 232 ,(2004) , 10.1007/11407522_12