Dictionary-based concept identification with UMLS

作者: Max De Wilde , Roser Morante , W. Daelemans

DOI:

关键词:

摘要: Short common words such as an and I were systematically ruled out in order to avoid being mistaken for medical abbreviations. Unrecognized further stemmed with a Python implementation of the Porter Stemmer identify more terms. It is important understand that lemmatization was not option here it would have increased processing time dramatically. Each UMLS ID subsequently mapped semantic type, each group turn broader group, recommended challenge guidelines.

参考文章(2)
Kristina M Hettne, Erik M van Mulligen, Martijn J Schuemie, Bob JA Schijvenaars, Jan A Kors, Rewriting and suppressing UMLS terms for improved biomedical term identification Journal of Biomedical Semantics. ,vol. 1, pp. 5- 5 ,(2010) , 10.1186/2041-1480-1-5
Tom De Smedt, Vincent van Asch, Walter Daelemans, Memory-based shallow parser for python ,(2010)