A memory system design framework

作者: Amin Firoozshahian , Alex Solomatnikov , Ofer Shacham , Zain Asgar , Stephen Richardson

DOI: 10.1145/1555754.1555805

关键词:

摘要: As CPU cores become building blocks, we see a great expansion in the types of on-chip memory systems proposed for CMPs. Unfortunately, designing cache and protocol controllers to support these is complex, their concurrency latency characteristics significantly affect performance any CMP. To address this problem, paper presents microarchitecture framework controllers, which can aid generating RTL new systems. The consists three pipelined engines' request-tracking, state-manipulation, data movement' are programmed implement higher-level model. This approach simplifies design verification CMP by decomposing model into sequences state manipulations. Moreover, implementing itself produces polymorphic system.To validate approach, implemented scalable, flexible silicon. system was then disparate models' coherent shared memory, streams transactional memory. Measured overheads seem promising. Our generates with less than 20% compared an ideal controller zero internal latency. Even overhead directly fully programmable modest. While it did double controller's area, amortized effective area grew roughly 7%.

参考文章(30)
Jim Larus, Ravi Rajwar, Transactional Memory (Synthesis Lectures on Computer Architecture) Morgan & Claypool Publishers. ,(2007)
D. Pham, S. Asano, M. Bolliger, M.N. Day, H.P. Hofstee, C. Johns, J. Kahle, A. Kameyama, J. Keaty, Y. Masubuchi, M. Riley, D. Shippy, D. Stasiak, M. Suzuoki, M. Wang, J. Warnock, S. Weitzel, D. Wendel, T. Yamazaki, K. Yazawa, The design and implementation of a first-generation CELL processor international solid-state circuits conference. pp. 184- 592 ,(2005) , 10.1109/ISSCC.2005.1493930
David E. Culler, Jaswinder Pal Singh, Anoop Gupta, Parallel Computer Architecture: A Hardware/Software Approach ,(1998)
Lance Hammond, Vicky Wong, Mike Chen, Brian D. Carlstrom, John D. Davis, Ben Hertzberg, Manohar K. Prabhu, Honggo Wijaya, Christos Kozyrakis, Kunle Olukotun, Transactional Memory Coherence and Consistency ACM SIGARCH Computer Architecture News. ,vol. 32, pp. 102- 113 ,(2004) , 10.1145/1028176.1006711
P. Conway, B. Hughes, The AMD Opteron Northbridge Architecture IEEE Micro. ,vol. 27, pp. 10- 21 ,(2007) , 10.1109/MM.2007.43
P. Kongetira, K. Aingaran, K. Olukotun, Niagara: a 32-way multithreaded Sparc processor IEEE Micro. ,vol. 25, pp. 21- 29 ,(2005) , 10.1109/MM.2005.35
L. Hammond, B.A. Hubbert, M. Siu, M.K. Prabhu, M. Chen, K. Olukolun, The Stanford Hydra CMP IEEE Micro. ,vol. 20, pp. 71- 84 ,(2000) , 10.1109/40.848474
B. Towles, A. Chang, S. Rixner, B. Khailany, W.J. Dally, U.J. Kapasi, P. Mattson, J. Namkoong, J.D. Owens, Imagine: media processing with streams IEEE Micro. ,vol. 21, pp. 35- 46 ,(2001) , 10.1109/40.918001
J. Gregory Steffan, Christopher Colohan, Antonia Zhai, Todd C. Mowry, The STAMPede approach to thread-level speculation ACM Transactions on Computer Systems. ,vol. 23, pp. 253- 300 ,(2005) , 10.1145/1082469.1082471
J. Carter, W. Hsieh, L. Stoller, M. Swanson, Lixin Zhang, E. Brunvand, A. Davis, Chen-Chi Kuo, R. Kuramkote, M. Parker, L. Schaelicke, T. Tateyama, Impulse: building a smarter memory controller high-performance computer architecture. pp. 70- 79 ,(1999) , 10.1109/HPCA.1999.744334