Task selection for a multiscalar processor

作者: Gurindar S. Sohi , T. N. Vijaykumar

DOI: 10.5555/290940.290963

关键词: Instruction-level parallelismControl flowCompilerComputer scienceComputer architectureHeuristics

摘要: The multiscalar architecture advocates a distributed processor organization and task-level speculation to exploit high degrees of instruction level parallelism (ILP) in sequential programs without impeding improvements clock speeds. main goal this paper is understand the key implications architectural features for compiler task selection from point view performance. We identify fundamental performance issues be: control flow speculation, data communication, dependence load imbalance, overhead. show that these are intimately related few characteristics tasks: size, inter-task flow, dependence. describe heuristics select tasks with favorable characteristics. report experimental results successful boosting overall by establishing larger ILP windows.

参考文章(23)
Ravi Sethi, Jeffrey D. Ullman, Alfred V. Aho, Compilers: Principles, Techniques, and Tools ,(1986)
E. P. Markatos, T. J. LeBlanc, Load Balancing vs. Locality Management in Shared-Memory Multiprocessors international conference on parallel processing. pp. 258- 267 ,(1991)
Jenn-Yuan Tsai, Pen-Chung Yew, The superthreaded architecture: thread pipelining with run-time data dependence checking and control speculation international conference on parallel architectures and compilation techniques. pp. 35- 46 ,(1996) , 10.1109/PACT.1996.552553
J.G. Steffan, T.C. Mowry, The potential for using thread-level data speculation to facilitate automatic parallelization high-performance computer architecture. pp. 2- 13 ,(1998) , 10.1109/HPCA.1998.650541
Manoj Franklin, Gurindar S. Sohi, The expandable split window paradigm for exploiting fine-grain parallelsim Proceedings of the 19th annual international symposium on Computer architecture - ISCA '92. ,vol. 20, pp. 58- 67 ,(1992) , 10.1145/139669.139703
Vivek Sarkar, John Hennessy, Partitioning parallel programs for macro-dataflow international conference on functional programming. pp. 202- 211 ,(1986) , 10.1145/319838.319863
P. Y T Hsu, E. S. Davidson, Highly concurrent scalar processing international symposium on computer architecture. ,vol. 14, pp. 386- 395 ,(1986) , 10.1145/17356.17401
E. Waingold, M. Taylor, D. Srikrishna, V. Sarkar, W. Lee, V. Lee, J. Kim, M. Frank, P. Finch, R. Barua, J. Babb, S. Amarasinghe, A. Agarwal, Baring it all to software: Raw machines IEEE Computer. ,vol. 30, pp. 86- 93 ,(1997) , 10.1109/2.612254