A (In)Cast of Thousands: Scaling Datacenter TCP to Kiloservers and Gigabits (CMU-PDL-09-101)

作者: Gregory R. Ganger , Amar Phanishayee , Elie Krevat , Vijay Vasudevan , Hiral Shah

DOI:

关键词:

摘要: This paper presents a practical solution to the problem of high-fan-in, high-bandwidth synchronized TCP workloads in datacenter Ethernets—the Incast problem. In these networks, receivers often experience drastic reduction throughput when simultaneously requesting data from many servers using TCP. Inbound overfills small switch buffers, leading timeouts lasting hundreds milliseconds. For that have synchronization requirement (e.g., filesystem reads and parallel dataintensive queries), incast can reduce by up 90%. Our for uses high-resolution timers allow microsecond-granularity timeouts. We show this technique is effective avoiding simulation real-world experiments. Last, we eliminating minimum retransmission timeout bound safe all environments, including wide-area. Acknowledgements: would like thank Brian Mueller at Panasas Inc. helping us conduct experiments on their systems. also our partners Petascale Data Storage Institute, Andrew Shewmaker, HB Chen, Parks Fields, Gary Grider, Ben McClelland, James Nunez Los Alamos National Lab help with obtaining packet header traces. members companies PDL Consortium (including APC, Cisco, Domain, EMC, Facebook, Google, Hewlett-Packard, Hitachi, IBM, Intel, LSI, NetApp, Oracle, Seagate, Sun Microsystems, Symantec, VMware) interest, insights, feedback, support. Finally, we’d Michael Stroucken his managing cluster. material based research sponsored part Science Foundation, via grants #CNS-0546551, #CNS-0326453 #CCF-0621499, Army Research Office under agreement number DAAD19–02–1–0389, Department Energy Award Number #DE-FC02-06ER25767, DARPA grant #HR00110710025.

参考文章(27)
George Polyzos, Hans-Werner Braun, kc claffy, Measurement considerations for assessing unidirectional latencies Journal of Internetworking. ,vol. 4, ,(1993)
Ioannis Psaras, Vassilis Tsaoussidis, The TCP minimum RTO revisited next generation internet. pp. 981- 991 ,(2007) , 10.1007/978-3-540-72606-7_84
R. Braden, Requirements for Internet Hosts - Communication Layers RFC. ,vol. 1122, pp. 1- 116 ,(1989)
K. Ramakrishnan, S. Floyd, A Proposal to add Explicit Congestion Notification (ECN) to IP RFC. ,vol. 2481, pp. 1- 25 ,(1999)
V. Jacobson, D. Borman, R. Braden, TCP Extensions for High Performance TCP Extensions for High Performance. ,vol. 1323, pp. 1- 37 ,(1992)
R. Ludwig, M. Meyer, The Eifel Detection Algorithm for TCP RFC. ,vol. 3522, pp. 1- 14 ,(2003)
J. Mahdavi, S. Floyd, M. Mathis, A. Romanow, TCP Selective Acknowledgement Options RFC 2018. ,vol. 2018, pp. 1- 12 ,(1996)
Alberto Medina, Mark Allman, Sally Floyd, Measuring the evolution of transport protocols in the internet acm special interest group on data communication. ,vol. 35, pp. 37- 52 ,(2005) , 10.1145/1064413.1064418
Hari Balakrishnan, Venkata N. Padmanabhan, Randy H. Katz, The effects of asymmetry on TCP performance acm/ieee international conference on mobile computing and networking. pp. 77- 89 ,(1997) , 10.1145/262116.262134