Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? --- A Neural Tangent Kernel Perspective

作者: Molei Tao , Tuo Zhao , Yuqing Wang , Kaixuan Huang

DOI:

关键词:

摘要: … networks [21], as the width of deep residual networks increases to infinity, training residual networks … , we first briefly review feedforward networks, residual networks and dual kernels …

参考文章(0)