Computer-based probabilistic-network construction

作者: Edward Herskovits

DOI:

关键词:

摘要: Faced with increasing amounts of data that they cannot analyze manually, biomedical researchers have turned increasingly to computational methods for exploring large databases. In particular, might benefit from a nonparametric, efficient, computer-based method determining the important associations among variables in domain, particularly when human expertise is not readily available. this dissertation, I demonstrate such algorithms are conceptually feasible, robust noise, computationally theoretically sound, and generate models can classify new cases accurately. I first describe two take as input database optional user-supplied prior knowledge, probabilistic network--in belief network--as output. The may incomplete data, contain noise. resulting network be used determine poorly understood or classifier were learning. After describing algorithms, present simple examples how these programs database. then results evaluating on databases several domains, including gynecologic pathology, lymph-node DNA-sequence analysis, poisonous-mushroom classification. most cases, networks test high accuracy. In addition discussing empirical results, an overview proofs based metrics will, number increases without limit, always prefer those more closely approximate true underlying distribution database; is, asymptotically correct. I conclude discussion work's contributions, list open research problems.

参考文章(69)
Jose M. Bernardo, Reference Posterior Distributions for Bayesian Inference Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 41, pp. 113- 128 ,(1979) , 10.1111/J.2517-6161.1979.TB01066.X
Peter Cheeseman, A method of computing generalized Bayesian probability values for expert systems international joint conference on artificial intelligence. pp. 198- 202 ,(1983)
James F. Fries, Dennis J. McShane, ARAMIS (the American Rheumatism Association Medical Information System). A prototypical national chronic-disease data bank. Western Journal of Medicine. ,vol. 145, pp. 798- 804 ,(1986)
Gregory O Stone, None, An analysis of the delta rule and the learning of statistical associations Parallel distributed processing: explorations in the microstructure of cognition, vol. 1. pp. 444- 459 ,(1986)
H. Heyer, Information and Sufficiency Springer, New York, NY. pp. 142- 173 ,(1982) , 10.1007/978-1-4613-8218-8_7
William B. Gevarter, Automatic probabilistic knowledge acquisition from data 1987 IEEE Third International Conference on Data Engineering. pp. 277- 280 ,(1987) , 10.1109/ICDE.1987.7272384
Pat Langley, Gary L. Bradshaw, Herbert A. Simon, Rediscovering Chemistry with the Bacon System Machine Learning. pp. 307- 329 ,(1983) , 10.1007/978-3-662-12405-5_10
D. E. Heckerman, E. J. Horvitz, B. N. Nathwani, Toward normative expert systems: Part I. The Pathfinder project. Methods of Information in Medicine. ,vol. 31, pp. 90- 105 ,(1992) , 10.1055/S-0038-1634867