Proactive-Reactive Prediction for Data Streams

作者: Xindong Wu , Ying Yang , Xingquan Zhu

DOI:

关键词:

摘要: Prediction in streaming data is an important activity various branches of science such as sociology, economics and politics. Two major challenges offered by streams are (1) the underlying concept may change over time; (2) grow without limit so that it difficult to retain a long history raw data. Previous research has mainly focused on manipulating relatively recent The distinctive contribution this paper three folds. First, uses measure conceptual equivalence organize into concepts. Transition patterns among concepts can be learned from help prediction. Second, carries out prediction at two levels, general level predicting each oncoming specific instance’s class. Third, proposes system RePro incorporates reactive proactive mechanisms predict with efficacy efficiency. Experiments conducted compare representative existing methods benchmark sets represent diversified scenarios change. Empirical evidence offers inspiring insights suggests proposed methodology advisable solution for streams.

参考文章(17)
Steven L. Salzberg, Alberto Segre, Programs for Machine Learning ,(1994)
MARCOS SALGANICOFF, Tolerating concept and sampling shift in lazy learning using prediction error context switching Artificial Intelligence Review. ,vol. 11, pp. 133- 155 ,(1997) , 10.1023/A:1006515405170
Michael Harries, Kim Horn, Learning stable concepts in a changing world pacific rim international conference on artificial intelligence. pp. 106- 122 ,(1996) , 10.1007/3-540-64413-X_31
W. Nick Street, YongSeog Kim, A streaming ensemble algorithm (SEA) for large-scale classification knowledge discovery and data mining. pp. 377- 382 ,(2001) , 10.1145/502512.502568
Eamonn Keogh, Shruti Kasetty, On the need for time series data mining benchmarks Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '02. pp. 102- 111 ,(2002) , 10.1145/775047.775062
Haixun Wang, Wei Fan, Philip S. Yu, Jiawei Han, Mining concept-drifting data streams using ensemble classifiers knowledge discovery and data mining. pp. 226- 235 ,(2003) , 10.1145/956750.956778
Geoff Hulten, Laurie Spencer, Pedro Domingos, Mining time-changing data streams knowledge discovery and data mining. pp. 97- 106 ,(2001) , 10.1145/502512.502529