Simple low cost causal discovery using mutual information and domain knowledge

作者: Adrian Joseph

DOI:

关键词:

摘要: Bayesian networks (BNs) provide a means for representing, displaying, and making available in usable form the knowledge of experts given Weld. In this paper, we look at performance an expert constructed BN compared with other machine learning (ML) techniques predicting outcome (win, lose, or draw) matches played by Tottenham Hotspur Football Club. The period under study was 1995–1997 – start that period, based almost exclusively on subjective judgement. Our objective to determine retrospectively comparative accuracy some alternative ML models were built using data from two-year period. additional considered were: MC4, decision tree learner; Naive Data Driven (a whose structure node probability tables are learnt entirely data); K-nearest neighbour learner. results show is generally superior domain predictive accuracy. even more impressive BNs that, number key respects, assumptions place them disadvantage. For example, have assumed prediction ‘incorrect’ if predicts than one as equally most likely (whereas, fact, such would prove valuable somebody who could ‘each way’ bet outcome). Although has now long been irrelevant (since it contains variables relating players retired left club) here tend conWrm excellent potential when they reliable expert. ability accurate predictions without requiring much obvious bonus any where scarce. Moreover, relatively simple build its be used again similar types problems. © 2006 Elsevier B.V. All rights reserved.

参考文章(334)
Risi Imre Kondor, John Lafferty, Diffusion kernels on graphs and other discrete structures international conference on machine learning. ,(2002)
Thomas G. Dietterich, Hussein Almuallim, Learning with many irrelevant features national conference on artificial intelligence. pp. 547- 552 ,(1991)
Sherry H. Walden, Timothy L. Acorn, SMART: support management automated reasoning technology for compaq customer service innovative applications of artificial intelligence. pp. 3- 18 ,(1992)
Ron Kohavi, Dan Sommerfield, Feature subset selection using the wrapper method: overfltting and dynamic search space topology knowledge discovery and data mining. pp. 192- 197 ,(1995)
David K. Lewis, Philosophical Papers: Volume I ,(1983)
Herbert A. Simon, Causal Ordering and Identifiability Springer, Dordrecht. pp. 53- 80 ,(1977) , 10.1007/978-94-010-9521-1_5
Mieczyslaw A. Klopotek, Learning belief network structure from data under causal insufficiency european conference on machine learning. pp. 379- 382 ,(1994) , 10.1007/3-540-57868-4_78
Michael J. Pazzani, Eamonn J. Keogh, Learning augmented Bayesian classifiers: A comparison of distribution-based and classification-based approaches. international conference on artificial intelligence and statistics. ,(1999)
Joe Suzuki, Learning Bayesian Belief Networks Based on the MDL Principle : An Efficient Algorithm Using the Branch and Bound Technique IEICE Transactions on Information and Systems. ,vol. 82, pp. 356- 367 ,(1999)