Task-similarity Aware Meta-learning through Nonparametric Kernel Regression

作者: Bo Wahlberg , Arun Venkitaraman , Anders Hansson

DOI:

关键词:

摘要: This paper investigates the use of nonparametric kernel-regression to obtain a task- similarity aware meta-learning algorithm. Our hypothesis is that helps when available tasks are limited and may contain outlier/ dissimilar tasks. While existing approaches implicitly assume as being similar, it generally unclear how this task-similarity could be quantified used in learning. As result, most popular meta- learning do not actively similarity/dissimilarity between tasks, but rely on availability huge number for their working. contribution novel framework explicitly uses form kernels an associated We model task-specific parameters belong reproducing kernel Hilbert space where function captures across The proposed algorithm iteratively learns meta-parameter which assign descriptor every task. task descriptors then quantify through function. show our approach conceptually generalizes model-agnostic (MAML) Meta-stochastic gradient descent (Meta-SGD) approaches. Numerical experiments with regression classification outperforms these limited, even presence out- lier or supports improve performance task-limited adverse settings.

参考文章(32)
Christopher M. Bishop, Pattern Recognition and Machine Learning (Information Science and Statistics) Springer-Verlag New York, Inc.. ,(2006)
Ethem Alpaydın, Mehmet Gönen, Multiple Kernel Learning Algorithms Journal of Machine Learning Research. ,vol. 12, pp. 2211- 2268 ,(2011)
Simon J. Godsill, Arnaud Doucet, Jaco Vermaak, Sequential Bayesian Kernel Regression neural information processing systems. ,vol. 16, pp. 113- 120 ,(2003)
Gunnar Rätsch, Sören Sonnenburg, Christin Schäfer, A General and Efficient Multiple Kernel Learning Algorithm neural information processing systems. ,vol. 18, pp. 1273- 1280 ,(2005)
Sinno Jialin Pan, Qiang Yang, A Survey on Transfer Learning IEEE Transactions on Knowledge and Data Engineering. ,vol. 22, pp. 1345- 1359 ,(2010) , 10.1109/TKDE.2009.191
Thomas Hofmann, Bernhard Schölkopf, Alexander J. Smola, Kernel methods in machine learning Annals of Statistics. ,vol. 36, pp. 1171- 1220 ,(2008) , 10.1214/009053607000000677
Jing Lu, Steven CH Hoi, Jialei Wang, Peilin Zhao, Zhi-Yong Liu, None, Large scale online kernel learning Journal of Machine Learning Research. ,vol. 17, pp. 1613- 1655 ,(2016) , 10.5555/2946645.2946692
Matthew Botvinick, Sergey Bartunov, Daan Wierstra, Adam Santoro, Timothy Lillicrap, Meta-learning with memory-augmented neural networks international conference on machine learning. pp. 1842- 1850 ,(2016)
Kevin Swersky, Richard S. Zemel, Jake Snell, Prototypical Networks for Few-shot Learning neural information processing systems. ,vol. 30, pp. 4077- 4087 ,(2017)
Pieter Abbeel, Chelsea Finn, Sergey Levine, Model-agnostic meta-learning for fast adaptation of deep networks international conference on machine learning. pp. 1126- 1135 ,(2017)