Social Semantics And Its Evaluation By Means of Closed Topic Models: An SVM-Classification Approach Using Semantic Feature Replacement By Topic Generalization

作者: Ulli Waltinger , Rüdiger Gleim , Alexander Mehler

DOI:

关键词:

摘要: Text categorization is a fundamental part in many NLP applications. In general, the Vector Space Model, Latent Semantic Analysis and Support Machine implementation have been successfully applied within this area. However, feature extraction most challenging task when conducting experiments. Moreover, sensitive reduction needed order to reduce time space complexity especially deal with singular value decomposition or larger sized text collections. paper we examine of by means closed topic models. We propose replacement technique generalization comprising user generated concepts social ontology. Derived are then subsequently used enhance replace existing features gaining minimum representation twenty concepts. effect each step classification process using large corpus 29,086 texts 30 different categories. addition, offer an easy-to-use web interface as eHumanities Desktop test proposed classifiers.

参考文章(21)
Evgeniy Gabrilovich, Shaul Markovitch, Feature generation for text categorization using world knowledge international joint conference on artificial intelligence. pp. 1048- 1053 ,(2005)
Christiane Fellbaum, An Electronic Lexical Database Cambridge, MA: The MIT Press. ,(1998)
Hirotoshi Taira, Masahiko Haruno, Feature selection in SVM text categorization national conference on artificial intelligence. pp. 480- 486 ,(1999)
Ulli Waltinger, Gerhard Heyer, Alexander Mehler, TOWARDS AUTOMATIC CONTENT TAGGING - Enhanced Web Services in Digital Libraries using Lexical Chaining international conference on web information systems and technologies. pp. 231- 236 ,(2008)
Stephan Bloehdorn, Andreas Hotho, Boosting for text classification with semantic features web mining and web usage analysis. pp. 149- 166 ,(2004) , 10.1007/11899402_10
Ulli Waltinger, Tobias Feith, Dietmar Esch, R. Gleim, Alexandra Ernst, Alexander Mehler, eHumanities Desktop. Eine webbasierte Arbeitsumgebung für die geisteswissenschaftliche Fachinformatik Proceedings of the Symposium "Sprachtechnologie und eHumanities". ,(2009)
Gerard Salton, Automatic text processing: the transformation, analysis, and retrieval of information by computer Addison-Wesley Longman Publishing Co., Inc.. ,(1989)
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)