作者: Steen Larsen , Ben Lee , Jin-Hyuk Yoon , Jae-Yeun Yun
关键词:
摘要: Current I/O devices communicate based on the PCIe protocol, and by default, all traffic passes through CPU-memory complex. However, this approach causes bottleneck in system throughput, which increases latency power as CPU processes device specific protocols to move data between devices. This paper examines cost of centralized proposes a new method perform direct device-to-device communication. Our proof-of-concept implementation using NetFPGA shows that can be reduced more than 2×, utilization up 18%, decreased 31 W.