ASAP: Automatic speculative acyclic parallelization for clusters

作者: Hanjun Kim

DOI:

关键词:

摘要: While clusters of commodity servers and switches are the most popular form large-scale parallel computers, many programs not easily parallelized for due to high internode communication cost lack globally shared memory. Speculative Decoupled Software Pipelining (Spec-DSWP) is a promising automatic parallelization technique that speculatively partitions loop into multiple threads communicate in pipelined manner. Speculation can complement conservative static analysis, making more robust applicable. allows Spec-DSWP speculate only rarely occurring dependences while respecting other through among threads. Acyclic patterns pipelining make tolerant latency clusters. However, since iteration (a transaction) sub-transactions across according pipeline stages, special runtime system required supports multi-threaded transactions (MTXs). This dissertation proposes Automatic Parallelization (ASAP) enables without any hardware modification. The ASAP various speculation techniques require different validation costs, automatically parallelizes sequential loops using SpecDSWP transformation with optimal application techniques. efficiently MTXs correctly execute transformed on With synergistic combination speculation, acyclic communication, support, this approach achieves or demonstrates path achieve scalable performance speedup up 109× wide range applications

参考文章(81)
Silvius Rus, Lawrence Rauchwerger, Jay Hoeflinger, Hybrid analysis: static & dynamic memory reference analysis International Journal of Parallel Programming. ,vol. 31, pp. 251- 283 ,(2003) , 10.1023/A:1024597010150
Katherine Yelick, William Carlson, Thomas Sterling, Tarek El-Ghazawi, UPC: Distributed Shared Memory Programming (Wiley Series on Parallel and Distributed Computing) UPC: Distributed Shared Memory Programming (Wiley Series on Parallel and Distributed Computing). ,(2005)
Mohamed M. Saad, Binoy Ravindran, Snake: control flow distributed software transactional memory international conference on stabilization safety and security of distributed systems. pp. 238- 252 ,(2011) , 10.1007/978-3-642-24550-3_19
Stanford University. Computer Systems Laboratory, Software and Hardware for Exploiting Speculative Parallelism with a Multiprocessor Stanford University. ,(1997)
Alain Darte, Frédéric Vivien, Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs International Journal of Parallel Programming. ,vol. 25, pp. 447- 496 ,(1997) , 10.1023/A:1025168022993
Bo Zhang, Binoy Ravindran, Brief Announcement: Relay: A Cache-Coherence Protocol for Distributed Transactional Memory international conference on principles of distributed systems. pp. 48- 53 ,(2009) , 10.1007/978-3-642-10877-8_6
Pierre Boulet, Michèle Dion, Code generation in Bouclettes euromicro workshop on parallel and distributed processing. pp. 273- 280 ,(1997)
Gabriel Rivera, Chau-Wen Tseng, A Comparison of Compiler Tiling Algorithms compiler construction. pp. 168- 182 ,(1999) , 10.1007/978-3-540-49051-7_12
John L. Hennessy, David A. Patterson, Computer Organization and Design: the Hardware/Software Interface ,(1993)