作者: Tianran Li , Deima T. Elnatour
DOI:
关键词:
摘要: A sample of product listings is selected from a catalog. An audit process performed to identify other in the catalog that are duplicates sample. The probability would be included randomly computed for each and duplicate listings. weight assigned inversely proportional listing. weights may then utilized compute de-biased estimated duplication rate reduce an actual database.