A Descriptive Classification of Causes of Data Quality Problems in Data Warehousing

作者: Kawaljeet Singh , Ranjit Singh

DOI:

关键词:

摘要: Data warehousing is gaining in eminence as organizations become awake of the benefits decision oriented and business intelligence data bases. However, there one key stumbling block to rapid development implementation quality warehouses, specifically that warehouse issues at various stages warehousing. Specifically, problems arise populating a with data. Over period time many researchers have contributed issues, but no research has collectively gathered all causes phases Viz. 1) sources, 2) integration & profiling, 3) staging ETL, 4) modeling schema design. The state-of-the-art purpose paper identify reasons for deficiencies, non-availability or reach ability aforementioned formulate descriptive classification these causes. We identified possible set from extensive literature review consultation practitioners working renowned IT giants on India. hope this will help developers Implementers examine analyze before moving ahead solutions applications.

参考文章(7)
Won Kim, Byoung-Ju Choi, Eui-Kyeong Hong, Soo-Kyung Kim, Doheon Lee, A Taxonomy of Dirty Data Data Mining and Knowledge Discovery. ,vol. 7, pp. 81- 99 ,(2003) , 10.1023/A:1021564703268
A. Rudra, E. Yeo, Key issues in achieving data quality and consistency in data warehousing among large organisations in Australia hawaii international conference on system sciences. ,vol. 7, pp. 7012- 7012 ,(1999) , 10.1109/HICSS.1999.772757
Erhard Rahm, Hong Hai Do, Data Cleaning: Problems and Current Approaches. IEEE Data(base) Engineering Bulletin. ,vol. 23, pp. 3- 13 ,(2000)
Channah F. Naiman, Arison M. Ouksel, A classification of semantic conflicts in heterogeneous database systems workshop on information technologies and systems. ,vol. 5, pp. 167- 193 ,(1995) , 10.1080/10919399509540248
J. Srivastava, Ping-Yao Chen, Warehouse creation-a potential roadblock to data warehousing IEEE Transactions on Knowledge and Data Engineering. ,vol. 11, pp. 118- 126 ,(1999) , 10.1109/69.755620
J. Bisbal, D. Lawless, Bing Wu, J. Grimson, Legacy information systems: issues and directions IEEE Software. ,vol. 16, pp. 103- 111 ,(1999) , 10.1109/52.795108