作者: George Gkotsis , Sumithra Velupillai , Anika Oellrich , Harry Dean , Maria Liakata
DOI: 10.18653/V1/W16-0310
关键词: Natural language processing 、 Set (psychology) 、 Information extraction 、 Computer science 、 Binary classification 、 Classifier (UML) 、 Parsing 、 Artificial intelligence 、 Negation 、 Domain knowledge
摘要: Mental Health Records (MHRs) contain freetext documentation about patients’ suicide and suicidality. In this paper, we address the problem of determining whether grammatic variants (inflections) word “suicide” are affirmed or negated. To achieve this, populate annotate a dataset with over 6,000 sentences originating from large repository MHRs. The resulting has high InterAnnotator Agreement ( 0.93). Furthermore, develop propose negation detection method that leverages syntactic features text 1 . Using parse trees, build set basic rules rely on minimum domain knowledge render as binary classification (affirmed vs. negated). Since overall goal is to identify patients who expected be at risk suicide, focus evaluation positive (affirmed) cases determined by our classifier. Our approach yields recall (sensitivity) value 94.6% for an accuracy 91.9%. We believe can integrated other clinical Natural Language Processing tools in order further advance information extraction capabilities.