作者: Michal Konkol , Miloslav Konopík
DOI: 10.1007/978-3-319-24033-6_7
关键词:
摘要: In this paper we study the effects of various segment representations in named entity recognition NER task. The representation is responsible for mapping multi-word entities into classes used chosen machine learning approach. Usually, choice a system arbitrary without proper tests. Some authors presented comparisons different such as BIO, BIEO, BILOU and usually compared only two representations. Our goal to show, that problem more complex selection best approach not straightforward. We provide experiments with wide set All are tested using popular algorithms: Conditional Random Fields Maximum Entropy. Furthermore, tests done on four languages, namely English, Spanish, Dutch Czech.