Work Stealing with Parallelism Feedback

作者: Kunal Agrawal , Charles E. Leiserson , Yuxiong He

DOI:

关键词:

摘要: We present a randomized work-stealing thread scheduler for fork-join multithreaded jobs that provides continual parallelism feedback to the job in form of processor requests. Our A-STEAL algorithm is appropriate large parallel servers where many share common multiprocessor resource and which number processors available particular may vary during job’s execution. Assuming never allots more than requested by scheduler, guarantees completes near-optimal time while utilizing at least constant fraction allotted processors. analysis models as scheduler’s adversary, challenging be robust system environment administrative policies. For example, can make huge exactly when has little use them. To analyze performance our adaptive under this stringent adversarial assumption, we introduce new technique called “trim analysis,” allows us prove performs poorly on most small steps, exhibiting behavior vast majority. precise, suppose work T1 critical-path length T∞. On machine with P processors, expected O(T1/P +T∞+ L lgP ) scheduling quantum denotes O(T∞ +L )-trimmed availability. This quantity average availability over all but + steps having highest When dominates trimmed available, is, ≪ T1/T∞, achieves nearly perfect linear speedup. Conversely, mean research was supported part Singapore-MIT Alliance NSF Grants ACI-0324974 CNS-0305606. Yuxiong He Visiting Scholar MIT CSAIL Ph.D. candidate National University Singapore. parallelism, asymptotic running its critical path. measured simulated using synthetic workloads. sufficient experiments indicate almost speedup across variety profiles. In these experiments, typically wasted less 20% cycles job. compared ABP algorithm, an workstealing developed Arora, Blumofe, Plaxton does not employ feedback. moderateto heavy-loaded machines predetermined profiles, completed twice quickly, despite being same or fewer every step, wasting only 10% ABP.

参考文章(63)
Jeff Edmonds, Xiaotie Deng, Donald D. Chinn, Tim Brecht, Non-clairvoyant multiprocessor scheduling of jobs with changing execution characteristics symposium on the theory of computing. ,(1997)
N. Bansal, K. Dhamdhere, J. Könemann, A. Sinha, Non-clairvoyant Scheduling for Minimizing Mean Slowdown symposium on theoretical aspects of computer science. pp. 260- 270 ,(2003) , 10.1007/3-540-36494-3_24
Mark S. Squillante, On the Benefits and Limitations of Dynamic Partitioning in Parallel Computer Systems job scheduling strategies for parallel processing. pp. 219- 238 ,(1995) , 10.1007/3-540-60153-8_31
Dionisios Papadopoulos, Robert D. Blumofe, The Performance of Work Stealing in Multiprogrammed Environments University of Texas at Austin. ,(1998)
Jeff Edmonds, Donald D. Chinn, Tim Brecht, Xiaotie Deng, Non-clair voy ant multiprocessor scheduling of jobs with changing execution characteristics Journal of Scheduling. ,vol. 6, pp. 231- 250 ,(2003) , 10.1023/A:1022952324290
Thu D. Nguyen, Raj Vaswani, John Zahorjan, Using Runtime Measured Workload Characteristics in Parallel Processor Scheduling job scheduling strategies for parallel processing. pp. 155- 174 ,(1996) , 10.1007/BFB0022293
Dror G. Feitelson, Packing Schemes for Gang Scheduling job scheduling strategies for parallel processing. pp. 89- 110 ,(1996) , 10.1007/BFB0022289
Wolfgang Kreutzer, Kasper Østerbye, BetaSIM: A framework for discrete event modelling and simulation Simulation Practice and Theory. ,vol. 6, pp. 573- 599 ,(1998) , 10.1016/S0928-4869(98)00009-3
Eric W. Parsons, Kenneth C. Sevcik, Multiprocessor Scheduling for High-Variability Service Time Distributions job scheduling strategies for parallel processing. pp. 127- 145 ,(1995) , 10.1007/3-540-60153-8_26
Robert D Blumofe, None, Executing multithreaded programs efficiently Massachusetts Institute of Technology. ,(1995)