Unifying data and control transformations for distributed shared-memory machines

作者: Michał Cierniak , Wei Li

DOI: 10.1145/207110.207145

关键词:

摘要: We present a unified approach to locality optimization that employs both data and control transformations. Data transformations include changing the array layout in memory. Control involve execution order of programs. have developed new techniques for compiler optimizations distributed shared-memory machines, although same can be used sequential machines with memory hierarchy.Our are based on an algebraic representation mappings model. pure transformation algorithm unifying While there has been much work transformations, opportunities largely neglected. In fact, advantage being applicable programs cannot optimized The algorithm, which performs simultaneously, offers improvement over obtained by applying separately.The experimental results using set applications parallel machine show improve performance significantly. These further analyzed metrics instrumentation simulation.

参考文章(35)
William J. Bolosky, Michael L. Scott, False sharing and its effect on shared memory performance Sedms'93 USENIX Systems on USENIX Experiences with Distributed and Multiprocessor Systems - Volume 4. pp. 3- 3 ,(1993)
K. W. Kennedy, Allan Kennedy Porterfield, Software methods for improvement of cache performance on supercomputer applications Rice University. ,(1989)
Tor E. Jeremiassen, Susan J. Eggers, Eliminating False Sharing. international conference on parallel processing. pp. 377- 381 ,(1991)
Wei Li, Compiler Optimizations for Cache Locality and Coherence University of Rochester. ,(1994)
John L. Hennessy, David A. Patterson, Computer Architecture: A Quantitative Approach ,(1989)
Anna Nagurney, Charles F. Nicholson, Phillip M. Bishop, Spatial Price Equilibrium Models with Discriminatory Ad Valorem Tariffs: Formulation and Comparative Computation Using Variational Inequalities Recent Advances in Spatial Equilibrium Modelling. pp. 179- 200 ,(1996) , 10.1007/978-3-642-80080-1_9
Thomas J. Watson IBM Research Center, On Estimating and Enhancing Cache Effectiveness languages and compilers for parallel computing. pp. 328- 343 ,(1991) , 10.1007/BFB0038674
J. Li, M. Chen, Index domain alignment: minimizing cost of cross-referencing between distributed arrays symposium on frontiers of massively parallel computation. pp. 424- 433 ,(1990) , 10.1109/FMPC.1990.89493
Wei Li, Compiler cache optimizations for banded matrix problems Proceedings of the 9th international conference on Supercomputing - ICS '95. pp. 21- 30 ,(1995) , 10.1145/224538.224541
Steven Mark Carr, Memory-hierarchy management Rice University. ,(1993)