Knowledge discovery and data mining to assist natural language understanding.

作者: George Hripcsak , Adam B. Wilcox

DOI:

关键词:

摘要: As natural language processing systems become more frequent in clinical use, methods for interpreting the output of these programs increasingly important. These require effort a domain expert, who must build specific queries and rules processor output. Knowledge discovery data mining tools can be used instead expert to automatically generate rules. C5.0, decision tree generator, was create rule base understanding system. A general-purpose using this tested on set 200 chest radiograph reports. When small reports, classified by physicians, as training set, generated performed well lay persons, but worse than physicians. larger ICD9 coding classify system, physicians persons. It appears that larger, accurate is needed increase performance method.

参考文章(11)
Peter Spyns, None, Natural language processing in medicine: An overview. Methods of Information in Medicine. ,vol. 35, pp. 285- 301 ,(1996) , 10.1055/S-0038-1634681
C. Ohmann, V. Moustakis, Q. Yang, K. Lang, Acute Abdominal Pain Study Group, Evaluation of automatic knowledge acquisition techniques in the diagnosis of acute abdominal pain Artificial Intelligence in Medicine. ,vol. 8, pp. 23- 36 ,(1996) , 10.1016/0933-3657(95)00018-6
C. Friedman, P. O. Alderson, J. H. M. Austin, J. J. Cimino, S. B. Johnson, A General Natural-language Text Processor for Clinical Radiology Journal of the American Medical Informatics Association. ,vol. 1, pp. 161- 174 ,(1994) , 10.1136/JAMIA.1994.95236146
Gregory Piatetsky-Shapiro, Usama M. Fayyad, Padhraic Smyth, From data mining to knowledge discovery: an overview knowledge discovery and data mining. pp. 1- 34 ,(1996)
George Hripcsak, Carol Friedman, Philip O Alderson, William DuMouchel, Stephen B Johnson, Paul D Clayton, Unlocking Clinical Data from Narrative Reports: A Study of Natural Language Processing Annals of Internal Medicine. ,vol. 122, pp. 681- 688 ,(1995) , 10.7326/0003-4819-122-9-199505010-00007
Michael L. Gundersen, Peter J. Haug, T.Allan Pryor, Rudy van Bree, Spence Koehler, Kay Bauer, Brenda Clemons, Development and evaluation of a computerized admission diagnosis encoding system Computers and Biomedical Research. ,vol. 29, pp. 351- 372 ,(1996) , 10.1006/CBMR.1996.0026
Øivind Braaten, Artificial intelligence in pediatrics: important clinical signs in newborn syndromes Computers and Biomedical Research. ,vol. 29, pp. 153- 161 ,(1996) , 10.1006/CBMR.1996.0013
Pierre Zweigenbaum, None, MENELAS: an access system for medical records using natural language. Computer Methods and Programs in Biomedicine. ,vol. 45, pp. 117- 120 ,(1994) , 10.1016/0169-2607(94)90029-9
William J. Long, John L. Griffith, Harry P. Selker, Ralph B. D'Agostino, A Comparison of Logistic Regression to Decision-Tree Induction in a Medical Domain Computers and Biomedical Research. ,vol. 26, pp. 74- 97 ,(1993) , 10.1006/CBMR.1993.1005