Improving Convergence of Back-Propagation by Handling Flat-Spots in the Output Layer

作者: Karthik Balakrishnan , Vasant Honavar

DOI: 10.1016/B978-0-444-89488-5.50039-7

关键词: Sigmoid functionAlgorithmSpeedupComputer scienceBackpropagationBenchmark (computing)Convergence (routing)Learning ruleArtificial neural networkSet (abstract data type)

摘要: Abstract Back-propagation (BP) [9, 5] is one of the most widely used procedures for training multi-layer artificial neural networks with sigmoid units. Though successful in a number applications, its convergence to set desired weights can be excruciatingly slow. Several modifications have been proposed improving learning speed [2, 4, 8, 1, 6]. The phenomenon flat-spots known play significant role slow BP [2]. formulation Learning rule prevents network from effectively presence flat-spots. In this paper we propose new approach minimize error such that occurring output layer are appropriately handled, thereby permitting learn even improvement provided by technique demonstrated on standard benchmark data-sets. More importantly, speedup obtained little or no increase computational requirements each iteration.

参考文章(6)
P. Werbos, Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences Ph. D. dissertation, Harvard University. ,(1974)
J. Yang, V. Honavar, Experiments with the cascade-correlation algorithm international joint conference on neural network. pp. 2428- 2433 ,(1991) , 10.1109/IJCNN.1991.170752
Tariq Samad, Back propagation with expected source values Neural Networks. ,vol. 4, pp. 615- 618 ,(1991) , 10.1016/0893-6080(91)90015-W
Vasant Honavar, Leonard Uhr, Generative learning structures and processes for generalized connectionist networks Information Sciences. ,vol. 70, pp. 75- 108 ,(1993) , 10.1016/0020-0255(93)90049-R
Vasant G. Honavar, Leonard Uhr, Generative learning structures for generalized connectionist networks The University of Wisconsin - Madison. ,(1990)
Jude W. Shavlik, Raymond J. Mooney, Geoffrey G. Towell, Symbolic and neural learning algorithms: an experimental comparison Machine Learning. ,vol. 6, pp. 111- 143 ,(1991) , 10.1023/A:1022602303196