作者: Wenfei Fan , Floris Geerts , Shuai Ma , Nan Tang , Wenyuan Yu
DOI: 10.1007/978-3-642-41660-6_12
关键词: Data deduplication 、 Data science 、 Master data 、 Data integrity 、 Computer science 、 Data quality 、 Closed-world assumption 、 Data consistency 、 Data mining 、 Logical framework 、 Unification
摘要: Recent work on data quality has primarily focused repairing algorithms for improving consistency and record matching methods deduplication. This paper accentuates several other challenging issues that are essential to developing cleaning systems, namely, error correction with performance guarantees, unification of matching, relative information completeness, currency. We provide an overview recent advances in the study these issues, advocate need a logical framework uniform treatment issues.