作者: Jingfeng Wu , Wenqing Hu , Haoyi Xiong , Jun Huan , Vladimir Braverman
DOI:
关键词: Covariance 、 Matrix (mathematics) 、 Algorithm 、 Gaussian 、 Computer science 、 Deep learning 、 Artificial intelligence 、 Noise 、 Generalization 、 Sampling (statistics) 、 Gradient noise 、 Regularization (mathematics) 、 Gradient descent
摘要: … the impact of the noise class. On the other hand, thanks to the flexibility of choosing noise class, we are allowed to use noisy gradient descent with best fitted noises based on practical …