作者: Razvan Pascanu , Grzegorz Swirszcz , Wojciech Marian Czarnecki
DOI:
关键词: Value (mathematics) 、 Artificial neural network 、 Nonlinear system 、 Initialization 、 Mathematics 、 Maxima and minima 、 Weight space 、 Training (civil) 、 Mathematical optimization 、 Error surface
摘要: There has been a lot of recent interest in trying to characterize the error surface of deep models. This stems from a long standing question. Given that deep networks are highly …