Using the compiler to improve cache replacement decisions

作者: Kathryn S. McKinley , Zhenlin Wang , Arnold L. Rosenberg , Charles C. Weems

DOI: 10.5555/645989.674328

关键词:

摘要: Memory performance is increasingly determining microprocessor and technology trends are exacerbating this problem. Most architectures use set-associative caches with LRU replacement policies to combine fast access relatively low miss rates. To improve decisions in caches, we develop a new set of compiler algorithms that predict which data will not be reused provide these hints the architecture. We prove either match or hit rates over LRU. describe practical one-bit cache-line tag implementation our algorithm, called evict-me. On cache replacement, architecture replace line for evict-me bit set, if none it bits. implement analysis its output Scale compiler. variety scientific programs, using algorithm both level 1 2 improves simulated cycle times by up 34% policy increasing In addition, combination simple hardware prefetching works together further performance.

参考文章(34)
Vijay S. Pai, Sarita V. Adve, Parthasarathy Ranganathan, RSIM Reference Manual: Version 1.0 Rice University ECE Technical Report. ,(1997)
K. S. McKinley, A. L. Rosenberg, Z. Wang, Improving Replacement Decisions in Set-Associative Caches TITLE2: University of Massachusetts. ,(2001)
Lixin Zhang, URSIM Reference Manual Defense Technical Information Center. ,(1999) , 10.21236/ADA439304
John L. Hennessy, David A. Patterson, Computer Architecture: A Quantitative Approach ,(1989)
Walid Abdul-Karim Abu-Sufah, Improving the performance of virtual memory computers. University of Illinois at Urbana-Champaign. ,(1979)
Anant Agarwal, Stephen D. Pudar, Column-associative caches Proceedings of the 20th annual international symposium on Computer architecture - ISCA '93. ,vol. 21, pp. 179- 190 ,(1993) , 10.1145/165123.165153
Rabin. A. Sugumar , Efficient simulation of caches under optimal replacement with applications to miss characterization measurement and modeling of computer systems. ,vol. 21, pp. 24- 35 ,(1993) , 10.1145/166955.166974
Yannis Smaragdakis, Scott Kaplan, Paul Wilson, EELRU: simple and effective adaptive page replacement measurement and modeling of computer systems. ,vol. 27, pp. 122- 133 ,(1999) , 10.1145/301453.301486
Laszlo A. Belady, None, A study of replacement algorithms for a virtual-storage computer Ibm Systems Journal. ,vol. 5, pp. 78- 101 ,(1966) , 10.1147/SJ.52.0078
M.D. Hill, A case for direct-mapped caches IEEE Computer. ,vol. 21, pp. 25- 40 ,(1988) , 10.1109/2.16187