New options for hoeffding trees

作者: Bernhard Pfahringer , Geoffrey Holmes , Richard Kirkby

DOI: 10.1007/978-3-540-76928-6_11

关键词:

摘要: Hoeffding trees are state-of-the-art for processing high-speed data streams. Their ingenuity stems from updating sufficient statistics, only addressing growth when decisions can be made that guaranteed to almost identical those would by conventional batch learning methods. Despite this guarantee, still subject limited lookahead and stability issues. In paper we explore Option Trees, a regular tree containing additional option nodes allow several tests applied, leading multiple as separate paths. We show how control in order generate mixture of paths, empirically determine reasonable number then evaluate spectrum variations: single trees, bagged trees. Finally, investigate pruning.We on some datasets pruned smaller more accurate than tree.

参考文章(13)
Knowledge discovery in databases : pkdd 2005 Published in <b>2005</b> in New York NY) by Springer. ,(2005) , 10.1007/11564126
Geoffrey Holmes, Richard Kirkby, Bernhard Pfahringer, Stress-Testing Hoeffding Trees Knowledge Discovery in Databases: PKDD 2005. pp. 495- 502 ,(2005) , 10.1007/11564126_50
Ron Kohavi, Clayton Kunz, Option Decision Trees with Majority Votes international conference on machine learning. pp. 161- 169 ,(1997)
Michael J. Pazzani, Kamal Mahmood Ali, Learning probabilistic relational concept descriptions University of California, Irvine. ,(1996)
Pedro Domingos, Geoff Hulten, Mining high-speed data streams knowledge discovery and data mining. pp. 71- 80 ,(2000) , 10.1145/347090.347107
C. L. Blake, UCI Repository of machine learning databases www.ics.uci.edu/〜mlearn/MLRepository.html. ,(1998)
João Gama, Ricardo Rocha, Pedro Medas, Accurate decision trees for mining high-speed data streams knowledge discovery and data mining. pp. 523- 528 ,(2003) , 10.1145/956750.956813
R. Agrawal, T. Imielinski, A. Swami, Database mining: a performance perspective IEEE Transactions on Knowledge and Data Engineering. ,vol. 5, pp. 914- 925 ,(1993) , 10.1109/69.250074
Wray Buntine, Learning classification trees Statistics and Computing. ,vol. 2, pp. 63- 73 ,(1992) , 10.1007/BF01889584
Eric Ziegel, Artificial intelligence and statistics Technometrics. ,vol. 31, pp. 130- 130 ,(1986) , 10.1080/00401706.1989.10488504