Enhancement of Medical Named Entity Recognition Using Graph-Based Features

作者: Sara Keretna , Chee Peng Lim , Doug Creighton

DOI: 10.1109/SMC.2015.331

关键词:

摘要: Named Entity Recognition (NER) is a crucial step in text mining. This paper proposes new graph-based technique for representing unstructured medical text. The representation used to extract discriminative features that are able enhance the NER performance. To evaluate usefulness of proposed technique, i2b2 medication challenge data set used. Specifically, 'treatment' named entities extracted evaluation using six different classifiers. F-measure results five classifiers enhanced, with an average improvement up 26%

参考文章(29)
George Hripcsak, Adam S Rothschild, Agreement, the F-Measure, and Reliability in Information Retrieval Journal of the American Medical Informatics Association. ,vol. 12, pp. 296- 298 ,(2005) , 10.1197/JAMIA.M1733
Yudong Cai, Yuan Yuan, Liuhuan Dong, Using Bagging classifier to predict protein domain structural class. Journal of Biomolecular Structure & Dynamics. ,vol. 24, pp. 239- 242 ,(2006)
Chikashi Nobata, Satoshi Sekine, Kiyoshi Sudo, Extended Named Entity Hierarchy language resources and evaluation. ,(2002)
K Fukuda, T Tsunoda, A Tamura, T Takagi, Toward information extraction: identifying protein names from biological papers. pacific symposium on biocomputing. pp. 707- 718 ,(1998)
Fangda Wang, Hongzhi Wang, Jianzhong Li, Hong Gao, Graph-based reference table construction to facilitate entity matching Journal of Systems and Software. ,vol. 86, pp. 1679- 1688 ,(2013) , 10.1016/J.JSS.2013.02.026
Michal Konkol, Tomáš Brychcín, Miloslav Konopík, Latent semantics in Named Entity Recognition Expert Systems With Applications. ,vol. 42, pp. 3470- 3479 ,(2015) , 10.1016/J.ESWA.2014.12.015
Yoshimasa Tsuruoka, Jun’ichi Tsujii, Improving the performance of dictionary-based approaches in protein name recognition Journal of Biomedical Informatics. ,vol. 37, pp. 461- 470 ,(2004) , 10.1016/J.JBI.2004.08.003
Lev Ratinov, Dan Roth, Design Challenges and Misconceptions in Named Entity Recognition conference on computational natural language learning. pp. 147- 155 ,(2009) , 10.3115/1596374.1596399
Sara Keretna, Chee Peng Lim, Doug Creighton, Khaled Bashir Shaban, Enhancing medical named entity recognition with an extended segment representation technique Computer Methods and Programs in Biomedicine. ,vol. 119, pp. 88- 100 ,(2015) , 10.1016/J.CMPB.2015.02.007
O. TUASON, L. CHEN, H. LIU, J.A BLAKE, C. FRIEDMAN, Biological nomenclatures: a source of lexical knowledge and ambiguity. pacific symposium on biocomputing. pp. 238- 249 ,(2003) , 10.1142/9789812704856_0023