The shared regions approach to software cache coherence on multiprocessors

作者: Harjinder S. Sandhu , Benjamin Gamsa , Songnian Zhou

DOI: 10.1145/155332.155356

关键词:

摘要: The effective management of caches is critical to the performance applications on shared-memory multiprocessors. In this paper, we discuss a technique for software cache coherence tht based upon integration program-level abstraction shared data with . abstraction, called Shared Regions, explicitly relates synchronization objects they protect. Cache algorithms are presented which use information provided by region primitives, and ensure that regions always cacheable processors accessing them. Measurements experiments Region approach multiprocessors multiprocessor shown. Comparisons other strategies, including user-controlled strategy an operating system-based strategy, show able deliver better performance, relatively low corresponding overhead only small increase in programming effort. Compared compiler-based Regions still performs than compiler can achieve 90% accuracy allowing cacheing, as long few hundred bytes or larger, re-used times cache.

参考文章(18)
Steve Karlovsky, Kevin P. McAuliffe, Ron Cytron, Automatic Management of Programmable Caches. international conference on parallel processing. pp. 229- 238 ,(1988)
Steve McGrogan, Robert Olson, Nell Toda, Parallelizing Large Existing Programs: Methodology and Experiences. COMPCON. pp. 458- 466 ,(1986)
Gottlieb, Grishman, Kruskal, McAuliffe, Rudolph, Snir, The NYU Ultracomputer—Designing an MIMD Shared Memory Parallel Computer IEEE Transactions on Computers. ,vol. 32, pp. 175- 189 ,(1983) , 10.1109/TC.1983.1676201
Allan Gottlieb, Ralph Grishman, Clyde P. Kruskal, Kevin P. McAuliffe, Larry Rudolph, Marc Snir, The NYU Ultracomputer—designing a MIMD, shared-memory parallel machine (Extended Abstract) ACM SIGARCH Computer Architecture News. ,vol. 10, pp. 27- 42 ,(1982) , 10.1145/1067649.801711
Michael J. Feeley, Henry M. Levy, Distributed shared memory with versioned objects conference on object oriented programming systems languages and applications. ,vol. 27, pp. 247- 262 ,(1992) , 10.1145/141936.141957
M. Reiser, S. S. Lavenberg, Mean-Value Analysis of Closed Multichain Queuing Networks Journal of the ACM. ,vol. 27, pp. 313- 322 ,(1980) , 10.1145/322186.322195
H. Cheong, A.V. Veidenbaum, Compiler-directed cache management in multiprocessors IEEE Computer. ,vol. 23, pp. 39- 47 ,(1990) , 10.1109/2.55499
Monica S. Lam, Martin C. Rinard, Coarse-grain parallel programming in Jade acm sigplan symposium on principles and practice of parallel programming. ,vol. 26, pp. 94- 105 ,(1991) , 10.1145/109625.109636
Sarita V. Adve, Vikram S. Adve, Mark D. Hill, Mary K. Vernon, Comparison of hardware and software cache coherence schemes Proceedings of the 18th annual international symposium on Computer architecture - ISCA '91. ,vol. 19, pp. 298- 308 ,(1991) , 10.1145/115952.115982
D. R. Cheriton, H. A. Goosen, P. D. Boyle, Multi-level shared caching techniques for scalability in VMP-M/C international symposium on computer architecture. ,vol. 17, pp. 16- 24 ,(1989) , 10.1145/74925.74928