作者: C.K.-S. Leung , M.A.F. Mateo , A.J. Nadler
关键词: Data integrity 、 Data warehouse 、 Data modeling 、 Data efficiency 、 Data consistency 、 Computer science 、 Data mining 、 Data validation 、 Data quality 、 Data stream mining
摘要: Data mining aims to search for implicit, previously unknown, and potentially useful information that might be embedded in the data. It is well known "garbage in, garbage out". Hence, get meaningful results, a clean set of data essential. In this paper, we propose an effective model controlling quality Specifically, three-layer focuses on validity consistency. To elaborate, internal layer ensures observed are valid their values fall within reasonable ranges. The temporal consistent with behaviour. spatial neighbours. A case study applying our proposed real-life weather agricultural application shows improving quality, thus leading better results. important note not confined applications. We also discuss, how can effectively applicable control some other situations.