作者: Tie-Yan Liu , Qi Meng , Shiqi Gong , Zhi-Ming Ma , Wei Chen
DOI:
关键词:
摘要: … with state-dependent noise. Specifically, we show that the covariance of the noise of SGD in … Inspired by our theory, we propose to add additional state-dependent noise into (large-batch…