Improving neural networks by preventing co-adaptation of feature detectors

作者: Ilya Sutskever , Geoffrey E. Hinton , Alex Krizhevsky , Ruslan R. Salakhutdinov , Nitish Srivastava

DOI:

关键词: Computer scienceCognitive neuroscience of visual object recognitionContext (language use)Dropout (neural networks)OverfittingArtificial neural networkFeature (computer vision)Feedforward neural networkBenchmark (computing)Training setMachine learningArtificial intelligence

摘要: When a large feedforward neural network is trained on a small training set, it typically performs poorly on held-out test data. This "overfitting" is greatly reduced by randomly omitting half …

参考文章(18)
Matthew Zeiler, Rob Fergus, Li Wan, Yann Le Cun, Sixin Zhang, Regularization of Neural Networks using DropConnect international conference on machine learning. pp. 1058- 1066 ,(2013)
David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams, Learning representations by back-propagating errors Nature. ,vol. 323, pp. 696- 699 ,(1988) , 10.1038/323533A0
Geoffrey Hinton, Radford M. Neal, Bayesian learning for neural networks ,(1995)
Abdel-rahman Mohamed, George E. Dahl, Geoffrey Hinton, Acoustic Modeling Using Deep Belief Networks IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 20, pp. 14- 22 ,(2012) , 10.1109/TASL.2011.2109382
Jorge Sanchez, Florent Perronnin, High-dimensional signature compression for large-scale image classification CVPR 2011. pp. 1665- 1672 ,(2011) , 10.1109/CVPR.2011.5995504
Geoffrey E Hinton, Ruslan R Salakhutdinov, Reducing the Dimensionality of Data with Neural Networks Science. ,vol. 313, pp. 504- 507 ,(2006) , 10.1126/SCIENCE.1127647
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, Li Fei-Fei, ImageNet: A large-scale hierarchical image database computer vision and pattern recognition. pp. 248- 255 ,(2009) , 10.1109/CVPR.2009.5206848
Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition Proceedings of the IEEE. ,vol. 86, pp. 2278- 2324 ,(1998) , 10.1109/5.726791
Geoffrey E. Hinton, Training products of experts by minimizing contrastive divergence Neural Computation. ,vol. 14, pp. 1771- 1800 ,(2002) , 10.1162/089976602760128018
A. Livnat, C. Papadimitriou, J. Dushoff, M. W. Feldman, A mixability theory for the role of sex in evolution. Proceedings of the National Academy of Sciences of the United States of America. ,vol. 105, pp. 19803- 19808 ,(2008) , 10.1073/PNAS.0803596105