System and method for domain adaption with partial observation

作者: Richard D. Lawrence , Vijil E. Chenthamarakshan , Yan Liu , Dan Zhang

DOI:

关键词:

摘要: System, method and computer program product provides a novel domain adaption/transfer learning approach applied to the problem of classifying abbreviated documents, e.g., short text messages, instant tweets. The proposed uses large number multi-labeled examples (source domain) improve on partial observations (target domain). Specifically, hidden, higher-level abstraction space is learned that meaningful for in source domain. This done by simultaneously minimizing document reconstruction error classification model hidden using known labels from target are then mapped same space, classified into label determined Exemplary results provided Twitter dataset demonstrate identifies topics useful classifications specific

参考文章(10)
Sudarshan Lamkhede, Srinivas Vadrevu, Anne Ya Zhang, Bo Long, Belle Tseng, System and method for cross domain learning for data augmentation ,(2009)
John P. Pestian, Robert A. Kowatch, Pawel Matykiewicz, Jacqueline M. Grupp-Phelan, Wlodzislaw Duch, Tracy A. Glauser, Processing text with domain-specific spreading activation methods ,(2008)
Yuchun Tang, Yan-Qing Zhang, N.V. Chawla, S. Krasser, SVMs Modeling for Highly Imbalanced Classification systems man and cybernetics. ,vol. 39, pp. 281- 288 ,(2009) , 10.1109/TSMCB.2008.2002909
Ritendra Datta, Dhiraj Joshi, Jia Li, James Z. Wang, Tagging over time Proceedings of the 15th international conference on Multimedia - MULTIMEDIA '07. pp. 393- 402 ,(2007) , 10.1145/1291233.1291328
Andrew Arnold, William W. Cohen, Intra-document structural frequency features for semi-supervised domain adaptation Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08. pp. 1291- 1300 ,(2008) , 10.1145/1458082.1458253
Qiang Yang, James T. Kwok, Sinno Jialin Pan, Dou Shen, Transferring localization models across space national conference on artificial intelligence. ,vol. 3, pp. 1383- 1388 ,(2008)
Rajat Raina, Alexis Battle, Honglak Lee, Benjamin Packer, Andrew Y. Ng, Self-taught learning: transfer learning from unlabeled data international conference on machine learning. pp. 759- 766 ,(2007) , 10.1145/1273496.1273592
Dan Zhang, Yan Liu, Richard D. Lawrence, Vijil Chenthamarakshan, ALPOS: A Machine Learning Approach for Analyzing Microblogging Data international conference on data mining. pp. 1265- 1272 ,(2010) , 10.1109/ICDMW.2010.154
Mark Dredze, John Blitzer, Fernando Pereira, Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification meeting of the association for computational linguistics. pp. 440- 447 ,(2007)
Higham Nj, Accuracy and Stability of Numerical Algorithms Society for Industrial and Applied Mathematics; 2002.. ,(2002)