Named Entity Recognition for Highly Inflectional Languages: Effects of Various Lemmatization and Stemming Approaches

作者: Michal Konkol , Miloslav Konopík

DOI: 10.1007/978-3-319-10816-2_33

关键词:

摘要: In this paper, we study the effects of various lemmatization and stemming approaches on named entity recognition (NER) task for Czech, a highly inflectional language. Lemmatizers are seen as necessary component Czech NER systems they were used in all published papers about so far. Thus, it has an utmost importance to explore their benefits, limits differences between simple complex methods. Our experiments evaluated standard Named Entity Corpus 1.1 well newly created 2.0 version.

参考文章(24)
Mijail Kabadjov, Josef Steinberger, Ralf Steinberger, Multilingual Statistical News Summarization Multi-source, Multilingual Information Extraction and Summarization. pp. 229- 252 ,(2013) , 10.1007/978-3-642-28569-1_11
Michal Konkol, Miloslav Konopík, Maximum entropy named entity recognition for Czech language text speech and dialogue. pp. 203- 210 ,(2011) , 10.1007/978-3-642-23538-2_26
Alena Böhmová, Jan Hajič, Eva Hajičová, Barbora Hladká, The Prague Dependency Treebank Treebanks. pp. 103- 127 ,(2003) , 10.1007/978-94-010-0201-1_7
Michal Konkol, Miloslav Konopík, CRF-Based Czech Named Entity Recognizer and Consolidation of Czech NER Research text speech and dialogue. ,vol. 8082, pp. 153- 160 ,(2013) , 10.1007/978-3-642-40585-3_20
Jana Straková, Milan Straka, Jan Hajič, A New State-of-The-Art Czech Named Entity Recognizer text speech and dialogue. pp. 68- 75 ,(2013) , 10.1007/978-3-642-40585-3_10
Diego Molla, Menno van Zaanen, Daniel Smith, Named Entity Recognition for Question Answering Proceedings of the Australasian Language Technology Workshop 2006. pp. 51-58- 51-58 ,(2006)
Michal Konkol, Brainy: A Machine Learning Library Artificial Intelligence and Soft Computing. pp. 490- 499 ,(2014) , 10.1007/978-3-319-07176-3_43
Magda Ševčíková, Zdeněk Žabokrtský, Oldřich Krůza, Named entities in Czech: annotating data and developing NE tagger text speech and dialogue. pp. 188- 195 ,(2007) , 10.1007/978-3-540-74628-7_26
Artificial Intelligence and Soft Computing Lecture Notes in Computer Science. ,vol. 6113, ,(2010) , 10.1007/978-3-642-13208-7