A consensus-based decentralized training algorithm for deep neural networks with communication compression

作者: Zhengtao Ding , Bo Liu

DOI: 10.1016/J.NEUCOM.2021.01.020

关键词:

摘要: Abstract Facing the challenge of distributed computing on processing large-scale data, this paper proposes a consensus-based decentralized training method with communication compression. First, is designed based topology to reduce burden busiest agent and avoid any revealing its locally stored data. The convergence algorithm then analyzed, which demonstrates that trained model can reach minimal empirical risk whole dataset, without sharing data samples. Furthermore, compression combined error-compensated considered costs during process. At last, simulation study shows proposed applicable for both IID non-IID datasets, exhibits much better performance than local method. Besides, an appropriate rate comparable centralized training, while saving lot costs.

参考文章(31)
Yuncheng Li, Yijun Huang, Xiangru Lian, Ji Liu, Asynchronous Parallel Stochastic Gradient for Nonconvex Optimization arXiv: Optimization and Control. ,(2015)
Simone Scardapane, Dianhui Wang, Massimo Panella, A decentralized training algorithm for Echo State Networks in distributed big data applications Neural Networks. ,vol. 78, pp. 65- 74 ,(2016) , 10.1016/J.NEUNET.2015.07.006
Tianqing Zhu, Ping Xiong, Gang Li, Wanlei Zhou, None, Correlated Differential Privacy: Hiding Information in Non-IID Data Set IEEE Transactions on Information Forensics and Security. ,vol. 10, pp. 229- 242 ,(2015) , 10.1109/TIFS.2014.2368363
R. Olfati-Saber, R.M. Murray, Consensus problems in networks of agents with switching topology and time-delays IEEE Transactions on Automatic Control. ,vol. 49, pp. 1520- 1533 ,(2004) , 10.1109/TAC.2004.834113
Alex J Smola, David G Andersen, Kai Yu, Mu Li, Communication Efficient Distributed Machine Learning with the Parameter Server neural information processing systems. ,vol. 27, pp. 19- 27 ,(2014)
Borja Peleato, Stephen Boyd, Neal Parikh, Jonathan Eckstein, Eric Chu, Distributed Optimization and Statistical Learning Via the Alternating Direction Method of Multipliers ,(2011)
Amir Vahid Dastjerdi, Rajkumar Buyya, None, Fog Computing: Helping the Internet of Things Realize Its Potential IEEE Computer. ,vol. 49, pp. 112- 116 ,(2016) , 10.1109/MC.2016.245
Jakub Konečný, Ananda Theertha Suresh, Dave Bacon, Felix X. Yu, Peter Richtarik, H. Brendan McMahan, Federated Learning: Strategies for Improving Communication Efficiency arXiv: Learning. ,(2016)
Ananda Theertha Suresh, Felix X. Yu, Sanjiv Kumar, H. Brendan McMahan, Distributed mean estimation with limited communication international conference on machine learning. pp. 3329- 3337 ,(2017)