Extending the scalable coherent interface for large-scale shared-memory multiprocessors

作者: Ross Evan Johnson

DOI:

关键词: Distributed computingShared memoryNetwork topologyProgrammerCacheComputer scienceMassively parallelInteroperabilityBinary treeScalable Coherent Interface

摘要: Massively parallel machines promise to provide enormous computing power using an amalgamation of low-cost parts. We believe many these will be shared-memory machines, since they do not burden the programmer with data placement and nonuniform access semantics. However, efficient kiloprocessor solution for paradigm has proven elusive due bottlenecks associated accesses rapidly changing data. The Scalable Coherent Interface (SCI) is IEEE ANSI standard multiprocessors, specifying a topology-independent network cache-coherence protocol. The goal this dissertation investigate ways efficiently share frequently among thousands processors. SCI platform in which methods are investigated. Before investigating protocols, we demonstrate that arbitrary topology can constructed from set interwoven rings, such as rings. This result important because it would impossible realize performance advantages our new protocols without network. Our investigation rings leads scheme deadlock avoidance does require resource partitioning. also compare various topologies. Next, two employ trees cache lines. first adds shortcuts, called temporary pointers, list created by SCI. several protocol variations request combining temporary-pointer structures. give lower bounds on latency numerous sets assumptions against variations. second transforms into probabilistically balanced, binary tree. Again, variations, comparing derive, extend support fetch-and-add severe constraints other mechanisms. show both compatible networks correctly interoperate SCI's protocol. We have small increase traffic. They similar or than SCI, significantly when there even amount global sharing.

参考文章(104)
Gregory F. Pfister, Manoj Kumar, The Onset of Hot-Spot Contention. international conference on parallel processing. pp. 28- 34 ,(1986)
James R Goodman, Ross E Johnson, Interconnect Topologies With Point-To-Point Rings University of Wisconsin-Madison Department of Computer Sciences. ,(1991)
Joseph E. Hoag, Eugene D. Brooks, A Scalable Coherent Cache System With Incomplete Directory State. international conference on parallel processing. pp. 553- 554 ,(1990)
James R. Goodman, Ross E. Johnson, Synthesizing General Topologies from Rings. international conference on parallel processing. pp. 86- 95 ,(1992)
Tom Lovett, Shreekant S. Thakkar, The Symmetry Multiprocessor System. international conference on parallel processing. pp. 303- 310 ,(1988)
Arthur W. Burks, Herman H. Goldstine, John Neumann, Preliminary discussion of the logical design of an electronic computing instrument (1946) Perspectives on the computer revolution. pp. 39- 48 ,(1989) , 10.1007/978-3-642-61812-3_32
Stein Gjessing, Stein Krogdahl, Ellen Munthe-Kaas, A Top Down Approach to the Formal Specification of SCI Cache Coherence computer aided verification. pp. 83- 91 ,(1991) , 10.1007/3-540-55179-4_9
Censier, Feautrier, A New Solution to Coherence Problems in Multicache Systems IEEE Transactions on Computers. ,vol. 27, pp. 1112- 1118 ,(1978) , 10.1109/TC.1978.1675013
Jaswinder Pal Singh, Wolf-Dietrich Weber, Anoop Gupta, SPLASH: Stanford parallel applications for shared-memory ACM Sigarch Computer Architecture News. ,vol. 20, pp. 5- 44 ,(1992) , 10.1145/130823.130824
W. Daniel Hillis, Guy L. Steele, Data parallel algorithms Communications of the ACM. ,vol. 29, pp. 1170- 1183 ,(1986) , 10.1145/7902.7903