Discerning tumor status from unstructured MRI reports--completeness of information in existing reports and utility of automated natural language processing.

作者: Lionel T. E. Cheng , Jiaping Zheng , Guergana K. Savova , Bradley J. Erickson

DOI: 10.1007/S10278-009-9215-7

关键词:

摘要: Information in electronic medical records is often an unstructured free-text format. This format presents challenges for expedient data retrieval and may fail to convey important findings. Natural language processing (NLP) emerging technique rapid efficient clinical retrieval. While proven disease detection, the utility of NLP discerning progression from reports untested. We aimed (1) assess whether radiology contained sufficient information tumor status classification; (2) develop NLP-based extraction tool determine reports; (3) compare human classification outcomes. Consecutive follow-up brain magnetic resonance imaging (2000–­2007) a tertiary center were manually annotated using consensus guidelines on status. Reports randomized training (70%) or testing (30%) groups. The utilized support vector machines model with statistical rule-based Most had classification, although 0.8% did not describe despite reference prior examinations. Tumor size was unreported 68.7% documents, while 50.3% lacked change magnitude when there detectable regression. Using retrospective as gold standard, achieved 80.6% sensitivity 91.6% specificity determination (mean positive predictive value, 82.4%; negative 92.0%). In conclusion, most determination, though variable features used demonstrated good accuracy have novel application automated databases.

参考文章(33)
Christopher G. Chute, Serguei V Pakhomov, Veronique L. Roger, Susan A. Weston, Ryan Meverden, Steven J. Jacobsen, Electronic medical records for clinical research: application to the identification of heart failure. The American Journal of Managed Care. ,vol. 13, pp. 281- 288 ,(2007)
George Hripcsak, Adam S Rothschild, Agreement, the F-Measure, and Reliability in Information Retrieval Journal of the American Medical Informatics Association. ,vol. 12, pp. 296- 298 ,(2005) , 10.1197/JAMIA.M1733
N Cristianini, Icg Campbell, K Veropoulos, Controlling the Sensitivity of Support Vector Machines pp. 55- 60 ,(1999)
Yi Lin, Yoonkyung Lee, Grace Wahba, Support Vector Machines for Classification in Nonstandard Situations Machine Learning. ,vol. 46, pp. 191- 202 ,(2002) , 10.1023/A:1012406528296
I. A. McCowan, D. C. Moore, A. N. Nguyen, R. V. Bowman, B. E. Clarke, E. E. Duhig, M.-J. Fry, Collection of Cancer Stage Data by Classifying Free-text Medical Reports Journal of the American Medical Informatics Association. ,vol. 14, pp. 736- 745 ,(2007) , 10.1197/JAMIA.M2130
Paul Hanbury, Will Bridewell, Gregory F. Cooper, Wendy W. Chapman, Bruce G. Buchanan, Evaluation of negation phrases in narrative clinical reports. american medical informatics association annual symposium. pp. 105- 109 ,(2001)
George Hripcsak, Carol Friedman, Philip O Alderson, William DuMouchel, Stephen B Johnson, Paul D Clayton, Unlocking Clinical Data from Narrative Reports: A Study of Natural Language Processing Annals of Internal Medicine. ,vol. 122, pp. 681- 688 ,(1995) , 10.7326/0003-4819-122-9-199505010-00007
N. Reed Dunnick, Curtis P. Langlotz, The radiology report of the future: a summary of the 2007 Intersociety Conference. Journal of The American College of Radiology. ,vol. 5, pp. 626- 629 ,(2008) , 10.1016/J.JACR.2007.12.015
Chris L. Sistrom, Curtis P. Langlotz, A framework for improving radiology reporting. Journal of The American College of Radiology. ,vol. 2, pp. 159- 167 ,(2005) , 10.1016/J.JACR.2004.06.015
Ricky K. Taira, Stephen G. Soderland, Rex M. Jakobovits, Automatic Structuring of Radiology Free-Text Reports Radiographics. ,vol. 21, pp. 237- 245 ,(2001) , 10.1148/RADIOGRAPHICS.21.1.G01JA18237