The Performance of SCI Memory Hierarchies

作者: R.A. Hexsel , Nigel Topham

DOI:

关键词:

摘要: This paper presents a simulation-based performance evaluation of shared-memory multiprocessor using the Scalable Coherent Interface (IEEE 1596). The machines are assembled with one to 16 processors connected in ring. multiprocessor's memory hierarchy consists split primary caches, coherent secondary caches and memory. For workload two parallel loops three thread-based programs, cache latency has strongest impact on performance. programs high miss ratios, 16-node rings exhibit network congestion whereas 4and 8-node perform better. With these same doubling processor speed yields between 20 70% gains higher smaller rings.

参考文章(19)
Ross Evan Johnson, Extending the scalable coherent interface for large-scale shared-memory multiprocessors University of Wisconsin at Madison. ,(1993)
Clyde P. Kruskal, Richard C. Paige, Parallel Algorithms for Shortest Path Problems. international conference on parallel processing. pp. 14- 20 ,(1985)
Kai Hwang, Advanced Computer Architecture: Parallelism,Scalability,Programmability McGraw-Hill Higher Education. ,(1992)
John L. Hennessy, David A. Patterson, Computer Architecture: A Quantitative Approach ,(1989)
D.V. James, A.T. Laundrie, S. Gjessing, G.S. Sohi, Distributed-directory scheme: scalable coherent interface IEEE Computer. ,vol. 23, pp. 74- 77 ,(1990) , 10.1109/2.55503
Daniel A. Menasće, Luiz André Barroso, A methodology for performance evaluation of parallel applications on multiprocessors Journal of Parallel and Distributed Computing. ,vol. 14, pp. 1- 14 ,(1992) , 10.1016/0743-7315(92)90093-3
Luis André Barroso, Michel Dubois, The performance of cache-coherent ring-based multiprocessors Proceedings of the 20th annual international symposium on Computer architecture - ISCA '93. ,vol. 21, pp. 268- 277 ,(1993) , 10.1145/165123.165162
Daniel Lenoski, James Laudon, Truman Joe, David Nakahira, Luis Stevens, Anoop Gupta, John Hennessy, The DASH prototype Proceedings of the 19th annual international symposium on Computer architecture - ISCA '92. ,vol. 20, pp. 418- 429 ,(1992) , 10.1145/139669.139706
Steven Lee Scott, Toward the design of large-scale shared-memory multiprocessors University of Wisconsin at Madison. ,(1992)
Håkon O. Bugge, Ernst H. Kristiansen, Bjørn O. Bakka, Trace-driven simulations for a two-level cache design in open bus systems ACM SIGARCH Computer Architecture News. ,vol. 18, pp. 250- 259 ,(1990) , 10.1145/325096.325151