Coded computation for multicore setups

作者: Kangwook Lee , Ramtin Pedarsani , Dimitris Papailiopoulos , Kannan Ramchandran

DOI: 10.1109/ISIT.2017.8006962

关键词:

摘要: Consider a distributed computing setup consisting of master node and n worker nodes, each equipped with p cores, function f (x) = g(f 1 (x), 2 (x),…, fk(x)), where i can be computed independently the rest. Assuming that computational times have exponential tails, what is minimum possible time for f? Can we use coding theory principles to speed up this computation? In [1], it shown linear functions expedited by applying erasure codes. However, not clear if codes computation ‘nonlinear’ as well. To resolve problem, propose sparse codes, exploiting modern multicore processing architecture. We show 1) our solution achieves order optimal runtime, 2) at least Θ(√log n) faster than any uncoded schemes number workers n.

参考文章(13)
Tom Richardson, Ruediger Urbanke, Modern Coding Theory Cambridge University Press. ,(2008) , 10.1017/CBO9780511791338
Manmohan Chaubey, Erik Saule, Replicated Data Placement for Uncertain Scheduling international parallel and distributed processing symposium. pp. 464- 472 ,(2015) , 10.1109/IPDPSW.2015.50
Matei Zaharia, Andy Konwinski, Anthony D. Joseph, Ion Stoica, Randy Katz, Improving MapReduce performance in heterogeneous environments operating systems design and implementation. pp. 29- 42 ,(2008) , 10.5555/1855741.1855744
Scott Shenker, Ali Ghodsi, Ganesh Ananthanarayanan, Ion Stoica, Effective straggler mitigation: attack of the clones networked systems design and implementation. pp. 185- 198 ,(2013)
Jeffrey Dean, Luiz André Barroso, The tail at scale Communications of The ACM. ,vol. 56, pp. 74- 80 ,(2013) , 10.1145/2408776.2408794
Kristen Gardner, Samuel Zbarsky, Sherwin Doroudi, Mor Harchol-Balter, Esa Hyytia, Reducing Latency via Redundant Requests: Exact Analysis measurement and modeling of computer systems. ,vol. 43, pp. 347- 360 ,(2015) , 10.1145/2745844.2745873
Da Wang, Gauri Joshi, Gregory Wornell, Efficient task replication for fast response times in parallel computation measurement and modeling of computer systems. ,vol. 42, pp. 599- 600 ,(2014) , 10.1145/2591971.2592042
Ganesh Ananthanarayanan, Srikanth Kandula, Albert Greenberg, Ion Stoica, Yi Lu, Bikas Saha, Edward Harris, None, Reining in the outliers in map-reduce clusters using Mantri operating systems design and implementation. pp. 265- 278 ,(2010) , 10.5555/1924943.1924962
Jianmin Chen, Xinghao Pan, Rajat Monga, Samy Bengio, Rafal Jozefowicz, Revisiting Distributed Synchronous SGD arXiv: Learning. ,(2016)
Nihar B. Shah, Kangwook Lee, Kannan Ramchandran, When do redundant requests reduce latency allerton conference on communication, control, and computing. pp. 731- 738 ,(2013) , 10.1109/ALLERTON.2013.6736597