On the application of GP to streaming data classification tasks with label budgets

作者: Ali Vahdat , Aaron Atwater , Andrew R. McIntyre , Malcolm I. Heywood

DOI: 10.1145/2598394.2611385

关键词:

摘要: A framework is introduced for applying GP to streaming data classification tasks under label budgets. This a fundamental requirement if going adapt the challenge of environments. The proposes three elements: sampling policy, subset and archiving policy. policy establishes on what basis sampled from stream, therefore when information requested. used define individuals evolve against. composition such mixture forwarded historical identified through combination achieve decoupling between rate at which stream passes evolution commences. Benchmarking performed two artificial sets with specific forms sudden shift gradual drift as well known real-world set.

参考文章(38)
Tamraparni Dasu, Suresh Venkatasubramanian, Ke Yi, Shankar Krishnan, An Information-Theoretic Approach to Detecting Changes in Multi-Dimensional Data Streams Proc. Symposium on the Interface of Statistics, Computing Science, and Applications (Interface). ,(2006)
Indrė Žliobaitė, Albert Bifet, Bernhard Pfahringer, Geoff Holmes, Active Learning with Evolving Streaming Data Machine Learning and Knowledge Discovery in Databases. pp. 597- 612 ,(2011) , 10.1007/978-3-642-23808-6_39
Philip S. Yu, Wei Fan, Haixun Wang, Yi an Huang, Active mining of data streams siam international conference on data mining. pp. 457- 461 ,(2004)
Albert Bifet, Jesse Read, Indrė Žliobaitė, Bernhard Pfahringer, Geoff Holmes, Pitfalls in benchmarking data stream classification and how to avoid them european conference on machine learning. pp. 465- 479 ,(2013) , 10.1007/978-3-642-40988-2_30
Ian Dempsey, Anthony Brabazon, Michael O'Neill, Foundations in Grammatical Evolution for Dynamic Environments ,(2009)
Raquel Sebastião, João Gama, Change detection in learning histograms from data streams portuguese conference on artificial intelligence. pp. 112- 123 ,(2007) , 10.5555/1782254.1782265
Hai H. Dam, Chris Lokan, Hussein A. Abbass, Evolutionary Online Data Mining: An Investigation in a Dynamic Environment Evolutionary Computation in Dynamic and Uncertain Environments. pp. 153- 178 ,(2007) , 10.1007/978-3-540-49774-5_7
João Gama, Pedro Medas, Gladys Castillo, Pedro Rodrigues, Learning with Drift Detection Advances in Artificial Intelligence – SBIA 2004. pp. 286- 295 ,(2004) , 10.1007/978-3-540-28645-5_29
Gianluigi Folino, Giuseppe Papuzzo, Handling different categories of concept drifts in data streams using distributed GP european conference on genetic programming. pp. 74- 85 ,(2010) , 10.1007/978-3-642-12148-7_7