Achieving High Instruction Cache Performance With An Optimizing Compiler

作者: W. W. Hwu , P. P. Chang

DOI: 10.1145/74925.74953

关键词:

摘要: Increasing the execution power requires a high instruction issue bandwidth, and decreasing encoding applying some code improving techniques cause expansion. Therefore, memory hierarchy performance has become an important factor of system performance. An placement algorithm been implemented in IMPACT-I (Illinois Microarchitecture Project using Advanced Compiler Technology - Stage I) C compiler to maximize sequential spatial localities, minimize mapping conflicts. This approach achieves low cache miss ratios traffic for small, fast caches with little hardware overhead. For ten realistic UNIX* programs, we report (average 0.5%) 8%) 2048-byte, direct-mapped 64-byte blocks. result compares favorably fully associative results reported by other researchers. We also present effect size, block sectoring, partial loading on The optimization is shown be stable across architectures different density.

参考文章(30)
John Hennessy, Norman Jouppi, Forest Baskett, John Gill, MIPS: A VLSI Processor Architecture VLSI Systems and Computations. pp. 337- 346 ,(1981) , 10.1007/978-3-642-68402-9_37
Patterson, Sequin, A VLSI RISC IEEE Computer. ,vol. 15, pp. 8- 21 ,(1982) , 10.1109/MC.1982.1654133
Frederick Chow, John Hennessy, Register allocation by priority-based coloring Proceedings of the 1984 SIGPLAN symposium on Compiler construction - SIGPLAN '84. ,vol. 39, pp. 222- 232 ,(1984) , 10.1145/502874.502896
R. J. Eickenmeyer, J. H. Patel, Performance evaluation of on-chip register and cache organizations international symposium on computer architecture. ,vol. 16, pp. 64- 72 ,(1988) , 10.1145/633625.52408
Alan Jay Smith, Cache evaluation and the impact of workload choice international symposium on computer architecture. ,vol. 13, pp. 64- 73 ,(1985) , 10.1145/327070.327132
D.B. Alpert, M.J. Flynn, Performance trade-offs for microprocessor cache memories IEEE Micro. ,vol. 8, pp. 44- 54 ,(1988) , 10.1109/40.7771
S.J. Hartley, Compile-time program restructuring in multiprogrammed virtual memory systems IEEE Transactions on Software Engineering. ,vol. 14, pp. 1640- 1644 ,(1988) , 10.1109/32.9051
Domenico Ferrari, Improving locality by critical working sets Communications of the ACM. ,vol. 17, pp. 614- 620 ,(1974) , 10.1145/361179.361195
P. Chow, M. Horowitz, Architectural tradeoffs in the design of MIPS-X international symposium on computer architecture. pp. 300- 308 ,(1987) , 10.1145/30350.30384