Cache performance in vector supercomputers

作者： Leonidas I Kontothanassis , Rabin A Sugumar , GJ Faanes , James E Smith , Michael L Scott

DOI:

关键词:

摘要: Traditional supercomputers use a flat multi-bank SRAM memory organization to supply high bandwidth at low latency. Most other computers use a hierarchical organization with a small SRAM cache and a slower, cheaper DRAM for the main memory. Such systems rely heavily on data locality for achieving optimum performance. This paper evaluates cache-based memory systems for vector supercomputers. We develop a simulation model for a cache-based version of the Cray Research C90 and use the NAS parallel benchmarks to provide a large-scale workload. We show that while caches reduce memory traffic and improve the performance of plain DRAM memory, they still lag behind cacheless SRAM. We identify the performance bottlenecks in DRAM-based memory systems and quantify their contribution to program performance degradation. We find the data fetch strategy to be a significant parameter affecting …

ieee.org 本地加速

academia.edu PDF 下载加速

参考文章(0)

Cache performance in vector supercomputers

来源期刊

我的账户

Cache performance in vector supercomputers

来源期刊

相似文章 0

我的账户