Source-selection-free transfer learning

作者: Evan Wei Xiang , Qiang Yang , Sinno Jialin Pan , Weike Pan , Jian Su

DOI: 10.5591/978-1-57735-516-8/IJCAI11-392

关键词:

摘要: Transfer learning addresses the problems that labeled training data are insufficient to produce a high-performance model. Typically, given target task, most transfer approaches require select one or more auxiliary tasks as sources by designers. However, how right source enable effective knowledge automatically is still an unsolved problem, which limits applicability of learning. In this paper, we take step ahead and propose novel framework, known source-selection-free (SSFTL), free users from need domains. Instead asking for pairs, traditional does, SSFTL turns some online information such World Wide Web Wikipedia help. The can be hidden somewhere within large source, but do not know where they are. Based on sources, train number classifiers. Then, bridge built labels potential candidates domain in via social media with tag cloud label translator. An added advantage that, unlike many previous approaches, difficult scale up scale, highly scalable offset much work offline stage. We demonstrate effectiveness efficiency through extensive experiments several real-world datasets text classification.

参考文章(17)
Xiaoxiao Shi, Wei Fan, Qiang Yang, Jiangtao Ren, Relaxed Transfer of Different Classes via Spectral Partition european conference on machine learning. pp. 366- 381 ,(2009) , 10.1007/978-3-642-04174-7_24
Fan R K Chung, Spectral Graph Theory ,(1996)
Peter Prettenhofer, Benno Stein, Cross-Lingual Adaptation Using Structural Correspondence Learning ACM Transactions on Intelligent Systems and Technology. ,vol. 3, pp. 13- ,(2011) , 10.1145/2036264.2036277
Jing Gao, Wei Fan, Jing Jiang, Jiawei Han, Knowledge transfer via multiple model local structure mapping Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 283- 291 ,(2008) , 10.1145/1401890.1401928
Lixin Duan, Ivor W. Tsang, Dong Xu, Tat-Seng Chua, Domain adaptation from multiple sources via auxiliary classifiers Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 289- 296 ,(2009) , 10.1145/1553374.1553411
Mikhail Belkin, Partha Niyogi, Laplacian Eigenmaps for dimensionality reduction and data representation Neural Computation. ,vol. 15, pp. 1373- 1396 ,(2003) , 10.1162/089976603321780317
Novi Quadrianto, Alex J. Smola, Tibério S. Caetano, James Petterson, S.v.n. Vishwanathan, Multitask Learning without Label Correspondences neural information processing systems. ,vol. 23, pp. 1957- 1965 ,(2010)
Kilian Q. Weinberger, Olivier Chapelle, Large Margin Taxonomy Embedding for Document Categorization neural information processing systems. ,vol. 21, pp. 1737- 1744 ,(2008)
Sinno Jialin Pan, Xiaochuan Ni, Jian-Tao Sun, Qiang Yang, Zheng Chen, Cross-domain sentiment classification via spectral feature alignment the web conference. pp. 751- 760 ,(2010) , 10.1145/1772690.1772767
Samy Bengio, Jason Weston, David Grangier, Label Embedding Trees for Large Multi-Class Tasks neural information processing systems. ,vol. 23, pp. 163- 171 ,(2010)