Considering Currency in Decision Trees in the Context of Big Data

作者: Diana Hristova

DOI:

关键词: Incremental decision treeMachine learningProbability theoryArtificial intelligenceData qualityDecision treeBig dataCurrencyContext (language use)Decision tree learningStructure (mathematical logic)Computer scienceData mining

摘要: In the current age of big data, decision trees are one most commonly applied data mining methods. However, for reliable results they require up-to-date input which is not always given in reality. We present a two-phase approach based on probability theory considering currency stored trees. Our efficient and thus suitable applications. Moreover, it independent particular tree classifier. Finally, context-specific since structure supplemental taken into account. demonstrate benefits novel by applying to three datasets. The show substantial increase classification success rate as opposed currency. Thus, our prevents wrong consequently decisions.

参考文章(57)
Diana Hristova, Bernd Heinrich, A FUZZY METRIC FOR CURRENCY IN THE CONTEXT OF BIG DATA european conference on information systems. ,(2014)
Adir Even, Alisa Wechsler, Using a Markov-Chain Model for Assessing Accuracy Degradation and Developing Data Maintenance Policies americas conference on information systems. ,(2012)
Yang Zhang, Chunquan Liang, Qun Song, Decision Tree for Dynamic and Uncertain Data Streams asian conference on machine learning. pp. 209- 224 ,(2010)
Charu C Aggarwal, Managing and Mining Sensor Data Springer US. ,(2013) , 10.1007/978-1-4614-6309-2
A. Blanton Godfrey, Thomas C. Redman, Data Quality For The Information Age ,(1997)
Michalis Vazirgiannis, Maria Halkidi, Dimitrious Gunopulos, None, Uncertainty handling and quality assessment in data mining ,(2003)
Michael Chau, Reynold Cheng, Ben Kao, Jackey Ng, Uncertain data mining: an example in clustering location data knowledge discovery and data mining. pp. 199- 204 ,(2006) , 10.1007/11731139_24
Carson Kai-Sang Leung, Yaroslav Hayduk, Mining Frequent Patterns from Uncertain Data with MapReduce for Big Data Analytics database systems for advanced applications. pp. 440- 455 ,(2013) , 10.1007/978-3-642-37487-6_33
Mathias Klier, Bernd Heinrich, A Novel Data Quality Metric for Timeliness considering Supplemental Data european conference on information systems. pp. 2651- 2662 ,(2009)
Irwin Miller, Marylees Miller, John E. Freund, John E. Freund's Mathematical Statistics with Applications ,(2003)