Characterization of training errors in supervised learning using gradient-based rules

作者: Jun Wang , B. Malakooti

DOI: 10.1016/S0893-6080(09)80019-1

关键词:

摘要: In the majority of existing supervised learning paradigms, a neural network is trained by minimizing an error function using rule. The commonly used rules are gradient-based such as popular backpropagation algorithm. This paper addresses important issue on minimization in networks rules. characterizes asymptotic properties training errors for various forms and discusses their practical implications designing via remarks examples. analytical results presented this reveal dependency quality rank samples associated steady activation stales. also complexity achieving zero error.

参考文章(18)
Eduardo D. Sontag, Héctor J. Sussmann, Backpropagation Can Give Rise to Spurious Local Minima Even for Networks without Hidden Layers. Complex Systems. ,vol. 3, ,(1989)
J. Wang, E.P. Teixeira, On the design principles of the functional link nets IEEE International Conference on Systems Engineering. pp. 613- 616 ,(1990) , 10.1109/ICSYSE.1990.203232
Ken-Ichi Funahashi, On the approximate realization of continuous mappings by neural networks Neural Networks. ,vol. 2, pp. 183- 192 ,(1989) , 10.1016/0893-6080(89)90003-8
Fernando J. Pineda, Recurrent backpropagation and the dynamical approach to adaptive neural computation Neural Computation. ,vol. 1, pp. 161- 172 ,(1989) , 10.1162/NECO.1989.1.2.161
B. W. White, Frank Rosenblatt, PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS American Journal of Psychology. ,vol. 76, pp. 705- ,(1961) , 10.2307/1419730
Ronald J. Williams, David Zipser, A learning algorithm for continually running fully recurrent neural networks Neural Computation. ,vol. 1, pp. 270- 280 ,(1989) , 10.1162/NECO.1989.1.2.270
M. Gori, A. Tesi, On the problem of local minima in backpropagation IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 14, pp. 76- 86 ,(1992) , 10.1109/34.107014
Jun Wang, B. Malakooti, A feedforward neural network for multiple criteria decision making Computers & Operations Research. ,vol. 19, pp. 151- 167 ,(1992) , 10.1016/0305-0548(92)90089-N
M.L. Brady, R. Raghavan, J. Slawny, Back propagation fails to separate where perceptrons succeed IEEE Transactions on Circuits and Systems. ,vol. 36, pp. 665- 674 ,(1989) , 10.1109/31.31314
C. Lee Giles, Tom Maxwell, Learning, invariance, and generalization in high-order neural networks Applied Optics. ,vol. 26, pp. 4972- 4978 ,(1987) , 10.1364/AO.26.004972