Efficient Coflow Transmission for Distributed Stream Processing

作者: Wenxin Li , Xu Yuan , Wenyu Qu , Heng Qi , Xiaobo Zhou

DOI: 10.1109/INFOCOM41043.2020.9155511

关键词:

摘要: Distributed streaming applications require the underlying network flows to transmit packets continuously keep their output results fresh. These will become stale if no updates come, and staleness is determined by slowest flow. At this point, coflows can be semantically comprised. Hence, efficient coflow transmission critical for applications. However, prior coflow-based solutions have significant limitations. They use a one-shot performance metric—CCT (coflow completion time), which cannot reflect of application.To end, we propose new metric—coflow age (CA), generated distributed The CA tracks longest time-since-last-service among all in coflow. In such context, consider data center with multiple that between source-destination pairs address problem minimizing average long-term while simultaneously satisfying throughput constraints from coflows. To solve efficiently, design randomized algorithm drift-plus-age algorithm, show they make achieve nearly two times arbitrarily close optimal value, respectively. Through extensive simulations, further demonstrate both proposed algorithms significantly reduce coflows, without violating requirement any coflow, when compared state-of-the-art solution.

参考文章(23)
Ariel Rabkin, Matvey Arye, Michael J. Freedman, Vivek S. Pai, Siddhartha Sen, Aggregation and degradation in JetStream: streaming analytics in the wide area networked systems design and implementation. pp. 275- 288 ,(2014) , 10.5555/2616448.2616474
Yuan Yao, Longbo Huang, Abhihshek Sharma, Leana Golubchik, Michael Neely, Data centers power reduction: A two time scale approach for delay tolerant workloads international conference on computer communications. pp. 1431- 1439 ,(2012) , 10.1109/INFCOM.2012.6195508
Mosharaf Chowdhury, Ion Stoica, Coflow: a networking abstraction for cluster applications hot topics in networks. pp. 31- 36 ,(2012) , 10.1145/2390231.2390237
Mosharaf Chowdhury, Ion Stoica, Efficient Coflow Scheduling Without Prior Knowledge acm special interest group on data communication. ,vol. 45, pp. 393- 406 ,(2015) , 10.1145/2785956.2787480
Jonathan Perry, Amy Ousterhout, Hari Balakrishnan, Devavrat Shah, Hans Fugal, Fastpass: a centralized "zero-queue" datacenter network acm special interest group on data communication. ,vol. 44, pp. 307- 318 ,(2014) , 10.1145/2619239.2626309
Mosharaf Chowdhury, Yuan Zhong, Ion Stoica, Efficient coflow scheduling with Varys acm special interest group on data communication. ,vol. 44, pp. 443- 454 ,(2014) , 10.1145/2619239.2626315
Lidong Zhou, Junwei Xu, Sen Yang, Wei Lin, Jingren Zhou, Zhengping Qian, Haochuan Fan, STREAMSCOPE: continuous reliable distributed processing of big data streams networked systems design and implementation. pp. 439- 453 ,(2016)
Yupeng Li, Shaofeng H.-C. Jiang, Haisheng Tan, Chenzi Zhang, Guihai Chen, Jipeng Zhou, Francis C. M. Lau, Efficient online coflow routing and scheduling mobile ad hoc networking and computing. pp. 161- 170 ,(2016) , 10.1145/2942358.2942367