Withdrawing an example from the training set: An analytic estimation of its effect on a non-linear parameterised model

作者: Gaétan Monari , Gérard Dreyfus

DOI: 10.1016/S0925-2312(00)00325-8

关键词:

摘要: Abstract For a non-linear parameterised model, the effects of withdrawing an example from training set can be predicted. We focus on prediction error left-out example, and confidence interval for this example. derive rigorous expression first-order expansion, in parameter space, gradient quadratic cost function, specify its validity conditions. As consequence, we approximate expressions given thereof, had been withdrawn set. show that influence model summarised by single parameter. These results are applicable to leave-one-out cross-validation, with considerable decrease computation time respect conventional leave-one-out. The paper focuses theoretical aspects question; both academic illustrations large-scale industrial examples described [9].

参考文章(10)
Christopher M. Bishop, Neural networks for pattern recognition ,(1995)
Lars Kai Hansen, Jan Larsen, Linear unlearning for cross-validation Advances in Computational Mathematics. ,vol. 5, pp. 269- 280 ,(1996) , 10.1007/BF02124747
Leo Breiman, Heuristics of instability and stabilization in model selection Annals of Statistics. ,vol. 24, pp. 2350- 2383 ,(1996) , 10.1214/AOS/1032181158
Babak Hassibi, David G. Stork, Second order derivatives for network pruning: Optimal Brain Surgeon neural information processing systems. ,vol. 5, pp. 164- 171 ,(1992)
Robert Tibshirani, A comparison of some error estimates for neural network models Neural Computation. ,vol. 8, pp. 152- 163 ,(1996) , 10.1162/NECO.1996.8.1.152
A. J. Lawrance, Deletion Influence and Masking in Regression Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 57, pp. 181- 189 ,(1995) , 10.1111/J.2517-6161.1995.TB02023.X
Jan Larsen, Lars Kai Hansen, Paul Haase Sørensen, Peter Magnus Nørgård, Cross validation in LULOO international conference on neural information processing. ,(1996)
CJ Seber, GAF, and Wild, None, Nonlinear Regression Wiley Series in Probability and Statistics. ,(1989) , 10.1002/0471725315