Exploiting Single-Usage for Effective Memory Management

作者: Thomas Piquet , Olivier Rochecouste , André Seznec , None

DOI: 10.1007/978-3-540-74309-5_11

关键词:

摘要: Efficient memory management is crucial when designing high performance processors. Upon a miss, the conventional operation mode of cache hierarchy to retrieve missing block from lower levels and store it into all levels. It however difficult assert that storing intermediate will be really useful. In particular, this unnecessary if accessed only once before getting evicted - i.e. single-usage block. This paper typically concerned with reducing number blocks. Our observations reveal blocks are significant at runtime especially in lowest level. We show using an address-based prediction mechanism sufficient identify phenomenon. Two schemes examined remove pollution caused by blocks: bypass scheme replacement policy. results leveraging beneficial memory-intensive applications running on superscalar multi-core architectures.

参考文章(14)
Haakon Dybdahl, Per Stenström, Enhancing Last-Level Cache Performance by Block Bypassing and Early Miss Determination Advances in Computer Systems Architecture. pp. 52- 66 ,(2006) , 10.1007/11859802_6
Lisa R. Hsu, Steven K. Reinhardt, Ravishankar Iyer, Srihari Makineni, Communist, utilitarian, and capitalist cache policies on CMPs: caches as a shared resource international conference on parallel architectures and compilation techniques. pp. 13- 22 ,(2006) , 10.1145/1152154.1152161
Allan Snavely, Dean M. Tullsen, Geoff Voelker, Symbiotic jobscheduling with priorities for a simultaneous multithreading processor measurement and modeling of computer systems. ,vol. 30, pp. 66- 76 ,(2002) , 10.1145/511334.511343
José-Lorenzo Cruz, Antonio González, Mateo Valero, Nigel P. Topham, Multiple-banked register file architectures international symposium on computer architecture. ,vol. 28, pp. 316- 325 ,(2000) , 10.1145/339647.339708
J.A. Rivers, E.S. Davidson, Reducing conflicts in direct-mapped caches with a temporality-based design international conference on parallel processing. ,vol. 1, pp. 154- 163 ,(1996) , 10.1109/ICPP.1996.537156
John W. C. Fu, Janak H. Patel, Bob L. Janssens, Stride directed prefetching in scalar processors ACM SIGMICRO Newsletter. ,vol. 23, pp. 102- 110 ,(1992) , 10.1145/144965.145006
Matthew Farrens, Andrew R. Pleszkun, Gary Tyson, John Matthews, A modified approach to data cache management international symposium on microarchitecture. pp. 93- 103 ,(1995) , 10.5555/225160.225177
William Y. Chen, Scott A. Mahlke, Pohua P. Chang, Wen-mei W. Hwu, Data access microarchitectures for superscalar processors with compiler-assisted data prefetching Proceedings of the 24th annual international symposium on Microarchitecture - MICRO 24. pp. 69- 73 ,(1991) , 10.1145/123465.123478
W.A. Wong, J.-L. Baer, Modified LRU policies for improving second-level cache behavior high performance computer architecture. pp. 49- 60 ,(2000) , 10.1109/HPCA.2000.824338
C.-H. Chi, H. Dietz, Improving cache performance by selective cache bypass hawaii international conference on system sciences. ,vol. 1, pp. 277- 285 ,(1989) , 10.1109/HICSS.1989.47168