CRIB

作者: Erika Gunadi , Mikko H. Lipasti

DOI: 10.1145/2000064.2000068

关键词: Parallel computingMicroarchitectureRegister fileEmbedded systemComputer scienceDatapathRenameOut-of-order executionRegister renamingScheduling (computing)

摘要: Conventional high-performance processors utilize register renaming and complex broadcast-based scheduling logic to steer instructions into a small number of heavily-pipelined execution lanes. This requires multiple structures repeated dependency resolution, imposing significant dynamic power overhead. paper advocates in-place instructions, power-saving, pipeline-free approach that consolidates rename, issue, bypass one structure---the CRIB---while simultaneously eliminating the need for multiported file, instead storing architected state in simple rank latches. CRIB achieves high IPC an out-of-order machine while keeping core clean, simple, low power. The datapath within structure is purely combinational, most clocked elements fully synchronous yet high-frequency design. Experimental results match cycle time baseline design reducing energy consumption by more than 60% affected structures.

参考文章(81)
Nam Sung Kim, Trevor Mudge, Reducing register ports using delayed write-back queues and operand pre-fetch international conference on supercomputing. pp. 172- 182 ,(2003) , 10.1145/782814.782839
Isidro González, Marco Galluzzi, Alex Veidenbaum, Marco A Ramirez, Adrián Cristal, Mateo Valero, None, A distributed processor state management architecture for large-window processors international symposium on microarchitecture. pp. 11- 22 ,(2008) , 10.1109/MICRO.2008.4771775
Mayan Moudgill, Keshav Pingali, Stamatis Vassiliadis, Register renaming and dynamic speculation: an alternative approach international symposium on microarchitecture. pp. 202- 213 ,(1993) , 10.5555/255235.255288
Srikanth T. Srinivasan, Ravi Rajwar, Haitham Akkary, Amit Gandhi, Mike Upton, Continual flow pipelines Proceedings of the 11th international conference on Architectural support for programming languages and operating systems - ASPLOS-XI. ,vol. 38, pp. 107- 119 ,(2004) , 10.1145/1024393.1024407
H. Patil, R. Cohn, M. Charney, R. Kapoor, A. Sun, A. Karunanidhi, Pinpointing Representative Portions of Large Intel® Itanium® Programs with Dynamic Instrumentation international symposium on microarchitecture. pp. 81- 92 ,(2004) , 10.1109/MICRO.2004.28
Ramon Canal, Antonio González, James E. Smith, Very low power pipelines using significance compression Proceedings of the 33rd annual ACM/IEEE international symposium on Microarchitecture - MICRO 33. ,vol. 2000, pp. 181- 190 ,(2000) , 10.1145/360128.360147
William H. Mangione-Smith, Johnson Kin, Munish Gupta, The filter cache: an energy efficient memory structure international symposium on microarchitecture. pp. 184- 193 ,(1997) , 10.5555/266800.266818
Timothy Sherwood, Erez Perelman, Greg Hamerly, Brad Calder, Automatically characterizing large scale program behavior Tenth international conference on architectural support for programming languages and operating systems on Proceedings of the 10th international conference on architectural support for programming languages and operating systems (ASPLOS-X) - ASPLOS '02. ,vol. 30, pp. 45- 57 ,(2002) , 10.1145/605397.605403
R. Iris Bahar, Gianluca Albera, Srilatha Manne, Power and performance tradeoffs using various caching strategies international symposium on low power electronics and design. pp. 64- 69 ,(1998) , 10.1145/280756.295115
Dana S. Henry, Bradley C. Kuszmaul, Gabriel H. Loh, Rahul Sami, Circuits for wide-window superscalar processors international symposium on computer architecture. ,vol. 28, pp. 236- 247 ,(2000) , 10.1145/339647.339689