A stratified strategy for efficient Kernel-based learning

作者: Roberto Basili , Simone Filice , Danilo Croce

DOI:

关键词: Data miningComputer scienceClassifier (UML)Kernel (linear algebra)Training setSentiment analysisMachine learningSupport vector machineArtificial intelligence

摘要: In Kernel-based Learning the targeted phenomenon is summarized by a set of explanatory examples derived from training set. When model size grows with complexity task, such approaches are so computationally demanding that adoption comprehensive models not always viable. this paper, general framework aimed at minimizing problem proposed: multiple classifiers stratified and dynamically invoked according to increasing levels corresponding incrementally more expressive representation spaces. Computationally expensive inferences thus adopted only when classification lower too uncertain over an individual instance. The application complex functions avoided where possible, significant reduction overall costs. proposed strategy has been integrated within two well-known algorithms: Support Vector Machines Passive-Aggressive Online classifier. A cost (up 90%), negligible performance drop, observed against Natural Language Processing tasks, i.e. Question Classification Sentiment Analysis in Twitter.

参考文章(29)
Simone Filice, Giuseppe Castellucci, Danilo Croce, Roberto Basili, Effective Kernelized Online Learning in Language Processing Tasks Lecture Notes in Computer Science. ,vol. 8416, pp. 347- 358 ,(2014) , 10.1007/978-3-319-06028-6_29
Alessandro Moschitti, Roberto Basili, Danilo Croce, Structured Lexical Similarity via Convolution Kernels on Dependency Trees empirical methods in natural language processing. pp. 1034- 1046 ,(2011)
Nello Cristianini, John Shawe-Taylor, Huma Lodhi, Latent Semantic Kernels international conference on machine learning. ,vol. 18, pp. 127- 152 ,(2001) , 10.1023/A:1013625426931
Nello Cristianini, John Shawe-Taylor, Kernel Methods for Pattern Analysis ,(2004)
Peter Brockhausen, Thorsten Joachims, Katharina Morik, Combining Statistical Learning with a Knowledge-Based Approach - A Case Study in Intensive Care Monitoring international conference on machine learning. pp. 268- 277 ,(1999)
D. Haussler, Convolution kernels on discrete structures Tech. Rep.. ,(1999)
Joakim Nivre, Josef Van Genabith, Jennifer Foster, Deirdre Hogan, Joachim Wagner, Joseph Le Roux, Stephen Hogan, Özlem Çetinoǧlu, #hardtoparse: POS tagging and parsing the twitterverse national conference on artificial intelligence. pp. 20- 25 ,(2011)
Nicolò Cesa-Bianchi, Claudio Gentile, Tracking the Best Hyperplane with a Simple Budget Perceptron Learning Theory. ,vol. 69, pp. 483- 498 ,(2006) , 10.1007/11776420_36
Slobodan Vucetic, Zhuang Wang, Online Passive-Aggressive Algorithms on a Budget international conference on artificial intelligence and statistics. pp. 908- 915 ,(2010)
Alessandro Moschitti, Roberto Basili, Complex Linguistic Features for Text Classification: A Comprehensive Study Lecture Notes in Computer Science. pp. 181- 196 ,(2004) , 10.1007/978-3-540-24752-4_14