作者: Stefan Wager , Sida Wang , Percy S Liang
DOI:
关键词: Mathematics 、 Regularization (mathematics) 、 Overfitting 、 Generalized linear model 、 Fisher information 、 Diagonal 、 Document classification 、 Machine learning 、 Scaling 、 Inverse 、 Artificial intelligence
摘要: … , dropout performs a form of adaptive regularization. Using this viewpoint, we show that the dropout … operates by repeatedly solving linear dropout-regularized problems. By casting …