Row-based configuration mechanism for a 2-D processing element array in coarse-grained reconfigurable architecture

作者: LeiBo Liu , YanSheng Wang , ShouYi Yin , Min Zhu , Xing Wang

DOI: 10.1007/S11432-013-4973-8

关键词:

摘要: Using the coarser operand grain and simplified interconnection patterns, CGRA (coarse grained reconfigurable architectures) has been proven to be energy efficient in several specific domains. As we know, speed at which contexts are applied a PEA (processing element array) directly determines performance of CGRA. In this paper, design space is further developed from configuration granularity perspective by one middle-grained granularity—the row-based mechanism (RCM). The most prominent feature RCM that large DFG (data flow graph) can mapped onto small array once reconfiguration, carried out on row-by-row basis. Compared with an ordinary DFG-partitioning solution, reconfiguration time data transfer well reduced. Furthermore, proposed offers much more storage for contexts. partitioning boosted 2.6% 57.8%, while area penalty only 4.79% power 7.22%. used processor called REMUS HPA (reconfigurable multi-media system, high version advanced). implemented 50.5 mm2 silicon TSMC 65 nm technology. Simulation shows 1920×1088@37 fps achieved H.264 high-profile decoding when exploiting 200 MHz working frequency. XPP (one commercial processor), 247% boosted.

参考文章(25)
Bjorn De Sutter, Praveen Raghavan, Andy Lambrechts, Coarse-Grained Reconfigurable Array Architectures signal processing systems. pp. 553- 592 ,(2013) , 10.1007/978-1-4419-6345-1_17
Ming-Hau Lee, Hartej Singh, Guangming Lu, Nader Bagherzadeh, Fadi J. Kurdahi, Eliseu M.C. Filho, Vladimir Castro Alves, Design and Implementation of the MorphoSys Reconfigurable ComputingProcessor signal processing systems. ,vol. 24, pp. 147- 164 ,(2000) , 10.1023/A:1008189221436
R. Hartenstein, M. Herz, T. Hoffmann, U. Nageldinger, Mapping Applications onto Reconfigurable Kress Arrays field-programmable logic and applications. pp. 385- 390 ,(1999) , 10.1007/978-3-540-48302-1_42
Xinning LIU, Chen MEI, Peng CAO, Min ZHU, Longxing SHI, Date Flow Optimization of Dynamically Coarse Grain Reconfigurable Architecture for Multimedia Applications IEICE Transactions on Information and Systems. ,vol. 95, pp. 374- 382 ,(2012) , 10.1587/TRANSINF.E95.D.374
John Shield, Peter Sutton, Philip Machanick, Dynamic Cache Switching in Reconfigurable Embedded Systems field-programmable logic and applications. pp. 111- 116 ,(2007) , 10.1109/FPL.2007.4380634
João M. P. Cardoso, Pedro C. Diniz, Markus Weinhardt, Compiling for reconfigurable computing: A survey ACM Computing Surveys. ,vol. 42, pp. 13- ,(2010) , 10.1145/1749603.1749604
Bo LIU, Peng CAO, Min ZHU, Jun YANG, Leibo LIU, Shaojun WEI, Longxing SHI, Reconfiguration Process Optimization of Dynamically Coarse Grain Reconfigurable Architecture for Multimedia Applications IEICE Transactions on Information and Systems. ,vol. 95, pp. 1858- 1871 ,(2012) , 10.1587/TRANSINF.E95.D.1858
Mahendra Kumar Angamuthu Ganesan, Sundeep Singh, Frank May, Jurgen Becker, H. 264 Decoder at HD Resolution on a Coarse Grain Dynamically Reconfigurable Architecture field-programmable logic and applications. pp. 467- 471 ,(2007) , 10.1109/FPL.2007.4380691
S.C. Goldstein, H. Schmit, M. Budiu, S. Cadambi, M. Moe, R.R. Taylor, PipeRench: a reconfigurable architecture and compiler Computer. ,vol. 33, pp. 70- 77 ,(2000) , 10.1109/2.839324
C. Ebeling, C. Fisher, Guanbin Xing, Manyuan Shen, Hui Liu, Implementing an OFDM receiver on the RaPiD reconfigurable architecture IEEE Transactions on Computers. ,vol. 53, pp. 1436- 1448 ,(2004) , 10.1109/TC.2004.98