SMEFF: A scalable memory extension fabric for FPGA

作者: Wei Li , Yangyang Zhao , Yuhang Liu , Mingyu Chen

DOI: 10.1109/FPT.2017.8280119

关键词:

摘要: In resource-constrained FPGA systems, off-chip memory plays an important role in both prototype verification and acceleration systems for big data. As the scale of applications become increasingly large complex, data to be processed grows exponentially. contrast, FPGAs provide limited capacity bandwidth, severely limiting performance systems. Furthermore, movement is expected a dominant consumer energy, thus inefficient between different DRAM modules also incurs significant energy penalties. This paper proposes practical design: A Scalable Memory Extension Fabric (SMEFF), which asynchronous access mechanism exploits cascaded technology solve problem bandwidth. SMEFF uses two key technologies achieve bandwidth improvements, shrink latency overhead movement-the first FPGA-based high-speed serial bus build multi-level fabric instead traditional parallel signal integrity problem. The second module (M-To-M) DMA technology, reduces modules. We implement on demonstrate feasibility our approach. Experimental results show that provides 5x increase up 3.6x improvement compared state-of-the-art outperforms PCIe-based M-TO-M's obtains 3x reduction, average 21.1% 61.1% reduction increases More importantly, architecture opportunities design scalable, cost-effective subsystems.

参考文章(13)
Edin Kadric, David Lakata, André DeHon, Impact of Memory Architecture on FPGA Energy Consumption field programmable gate arrays. pp. 146- 155 ,(2015) , 10.1145/2684746.2689062
Rui Hou, Tao Jiang, Liuhang Zhang, Pengfei Qi, Jianbo Dong, Haibin Wang, Xiongli Gu, Shujie Zhang, Cost effective data center servers high-performance computer architecture. pp. 179- 187 ,(2013) , 10.1109/HPCA.2013.6522317
Yu-Hang Liu, Xian-He Sun, C 2 -bound: a capacity and concurrency driven analytical model for many-core design ieee international conference on high performance computing data and analytics. pp. 48- ,(2015) , 10.1145/2807591.2807641
Udit Dhawan, André DeHon, Area-efficient near-associative memories on FPGAs Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays - FPGA '13. pp. 191- 200 ,(2013) , 10.1145/2435264.2435298
Yu-Hang Liu, Xian-He Sun, Reevaluating Data Stall Time with the Consideration of Data Access Concurrency Journal of Computer Science and Technology. ,vol. 30, pp. 227- 245 ,(2015) , 10.1007/S11390-015-1517-2
Benjamin C. Lee, Engin Ipek, Onur Mutlu, Doug Burger, Architecting phase change memory as a scalable dram alternative Proceedings of the 36th annual international symposium on Computer architecture - ISCA '09. ,vol. 37, pp. 2- 13 ,(2009) , 10.1145/1555754.1555758
Elliott Cooper-Balis, Paul Rosenfeld, Bruce Jacob, Buffer-on-board memory systems ACM SIGARCH Computer Architecture News. ,vol. 40, pp. 392- 403 ,(2012) , 10.1145/2366231.2337204
Li-Cheng Chen, Ming-Yu Chen, Yuan Ruan, Yong-Bing Huang, Ze-Han Cui, Tian-Yue Lu, Yun-Gang Bao, MIMS: Towards a Message Interface Based Memory System Journal of Computer Science and Technology. ,vol. 29, pp. 255- 272 ,(2014) , 10.1007/S11390-014-1428-7
Kevin Lim, Yoshio Turner, Jose Renato Santos, Alvin AuYoung, Jichuan Chang, Parthasarathy Ranganathan, Thomas F. Wenisch, System-level implications of disaggregated memory high performance computer architecture. pp. 1- 12 ,(2012) , 10.1109/HPCA.2012.6168955
Aniruddha N. Udipi, Naveen Muralimanohar, Rajeev Balasubramonian, Al Davis, Norman P. Jouppi, Combining memory and a controller with photonics through 3D-stacking to enable scalable and energy-efficient systems Proceeding of the 38th annual international symposium on Computer architecture - ISCA '11. ,vol. 39, pp. 425- 436 ,(2011) , 10.1145/2000064.2000115