On index set splitting

作者: Martin Griebl , Paul Feautrier , Christian Lengauer

DOI: 10.5555/520793.825729

关键词:

摘要: There are many algorithms for the space-time mapping of nested loops. Some them even make optimal choices within their framework. We propose a preprocessing phase in polytope model, which extends model and yields mappings whose schedule is, some cases, orders magnitude faster. These cases dependence graph has small irregularities. The basic idea is to split iteration domain loop nests into parts with regular structure apply existing these individually. work based on seminal more limited context parallelization at code level. elevate level (our model), increases its applicability by providing clearer wider range an acceptable analysis cost. Index set splitting one facet effort extend power enable generation competitive target code.

参考文章(13)
Rumen Andonov, Sanjay Rajopadhye, Nicola Yanev, Optimal Orthogonal Tiling european conference on parallel processing. pp. 480- 490 ,(1998) , 10.1007/BFB0057891
Paul Feautrier, Automatic Parallelization in the Polytope Model The Data Parallel Programming Model: Foundations, HPF Realization, and Scientific Applications. pp. 79- 103 ,(1996) , 10.1007/3-540-61736-1_44
Christian Lengauer, Loop Parallelization in the Polytope Model international conference on concurrency theory. pp. 398- 416 ,(1993) , 10.1007/3-540-57208-2_28
Rajeev Barua, David Kranz, Anant Agarwal, Communication-minimal partitioning of parallel loops and data arrays for cache-coherent distributed-memory multiprocessors Languages and Compilers for Parallel Computing. pp. 350- 368 ,(1997) , 10.1007/BFB0017263
Martin Griebl, Christian Lengauer, The loop parallelizer LooPo—announcement Languages and Compilers for Parallel Computing. pp. 603- 604 ,(1997) , 10.1007/BFB0017283
Alain Darte, Frédéric Vivien, On the Optimality of Allen and Kennedy's Algorithm for Parallel Extraction in Nested Loops european conference on parallel processing. ,vol. 1123, pp. 379- 388 ,(1996) , 10.1007/3-540-61626-8_50
Paul Feautrier, Some efficient solutions to the affine scheduling problem: I. One-dimensional time International Journal of Parallel Programming. ,vol. 21, pp. 313- 348 ,(1992) , 10.1007/BF01407835
Corinne Ancourt, François Irigoin, Scanning polyhedra with DO loops acm sigplan symposium on principles and practice of parallel programming. ,vol. 26, pp. 39- 50 ,(1991) , 10.1145/109625.109631
William Pugh, David Wonnacott, Eliminating false data dependences using the Omega test programming language design and implementation. ,vol. 27, pp. 140- 151 ,(1992) , 10.1145/143095.143129
Zaher Mahjoub, Mohamed Jemni, On the parallelization of single dynamic conditional loops Simulation Practice and Theory. ,vol. 4, pp. 141- 154 ,(1996) , 10.1016/0928-4869(95)00041-0