Maximum entropy named entity recognition for Czech language

作者: Michal Konkol , Miloslav Konopík

DOI: 10.1007/978-3-642-23538-2_26

关键词:

摘要: Named Entity Recognition (NER) is an important preprocessing tool for many Natural Language Processing tasks like Information Retrieval, Question Answering or Machine Translation. This paper focused on NER Czech language. The proposed based knowledge and experiences acquired other languages adapted Czech. Our recognizer outperforms the previously introduced recognizers article also use of semantic spaces NER. Although no significant improvement was yet achieved in this way, we believe that research worth sharing.

参考文章(19)
Magda Ševčíková, Zdeněk Žabokrtský, Oldřich Krůza, Named entities in Czech: annotating data and developing NE tagger text speech and dialogue. pp. 188- 195 ,(2007) , 10.1007/978-3-540-74628-7_26
Hideki Isozaki, Hideto Kazawa, Efficient support vector classifiers for named entity recognition Proceedings of the 19th international conference on Computational linguistics -. pp. 1- 7 ,(2002) , 10.3115/1072228.1072282
Kevin Lund, Curt Burgess, Producing high-dimensional semantic spaces from lexical co-occurrence Behavior Research Methods, Instruments, & Computers. ,vol. 28, pp. 203- 208 ,(1996) , 10.3758/BF03204766
Michael N. Jones, Douglas J. K. Mewhort, Representing word meaning and order information in a composite holographic lexicon. Psychological Review. ,vol. 114, pp. 1- 37 ,(2007) , 10.1037/0033-295X.114.1.1
Eleanor Rosch, Carolyn B Mervis, Family resemblances: Studies in the internal structure of categories Cognitive Psychology. ,vol. 7, pp. 573- 605 ,(1975) , 10.1016/0010-0285(75)90024-9
James R. Curran, Stephen Clark, Language independent NER using a maximum entropy tagger north american chapter of the association for computational linguistics. pp. 164- 167 ,(2003) , 10.3115/1119176.1119200
Robert Malouf, Markov models for language-independent named entity recognition international conference on computational linguistics. pp. 1- 4 ,(2002) , 10.3115/1118853.1118872
Jorge Nocedal, Updating Quasi-Newton Matrices With Limited Storage Mathematics of Computation. ,vol. 35, pp. 773- 782 ,(1980) , 10.1090/S0025-5718-1980-0572855-7
Ralph Grishman, Beth Sundheim, Message Understanding Conference-6: a brief history international conference on computational linguistics. ,vol. 1, pp. 466- 471 ,(1996) , 10.3115/992628.992709
Jana Kravalová, Zdeněk Žabokrtský, None, Czech Named Entity Corpus and SVM-based Recognizer Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration (NEWS 2009). pp. 194- 201 ,(2009) , 10.3115/1699705.1699748