Discovering interobserver variability in the cytodiagnosis of breast cancer using decision trees and Bayesian networks

作者: Nicandro Cruz-Ramírez , Héctor-Gabriel Acosta-Mesa , Humberto Carrillo-Calvet , Rocío-Erandi Barrientos-Martínez

DOI: 10.1016/J.ASOC.2009.05.004

关键词: Machine learningDecision support systemBreast cancerComputer scienceBayesian networkDecision treePattern recognitionClassifier (UML)Artificial intelligenceSoftware

摘要: We evaluate the performance of two decision tree procedures and four Bayesian network classifiers as potential support systems in cytodiagnosis breast cancer. In order to test their thoroughly, we use real-world databases containing 692 cases 322 collected by a single observer 19 observers, respectively. The results show that, general, there are considerable differences all tests (accuracy, sensitivity, specificity, PV+, PV- ROC) when specific classifier uses single-observer dataset compared those this same multiple-observer dataset. These suggest that different observers see things: problem known interobserver variability. graphically unveil such presenting structures trees networks resultant from running both databases.

参考文章(45)
Yoshiyuki Murata, Shoichi Nagatomo, Akihiro Tsukamoto, Data management systems ,(2001)
Nir Friedman, Daphne Koller, Learning Bayesian Networks from Data ,(2001)
James Franklin, The elements of statistical learning : data mining, inference,and prediction The Mathematical Intelligencer. ,vol. 27, pp. 83- 85 ,(2005) , 10.1007/BF02985802
Vipin Kumar, Pang-Ning Tan, Michael M. Steinbach, Introduction to Data Mining ,(2013)
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
Peter D. Grünwald, In Jae Myung, Mark A. Pitt, Advances in Minimum Description Length: Theory and Applications MIT Press. ,(2005)
Ron Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection international joint conference on artificial intelligence. ,vol. 2, pp. 1137- 1143 ,(1995)