De-biased estimated duplication rate

作者: Tianran Li , Deima T. Elnatour

DOI:

关键词:

摘要: A sample of product listings is selected from a catalog. An audit process performed to identify other in the catalog that are duplicates sample. The probability would be included randomly computed for each and duplicate listings. weight assigned inversely proportional listing. weights may then utilized compute de-biased estimated duplication rate reduce an actual database.

参考文章(25)
Kenneth L. Levy, Neil E. Lofgren, More Improvements in Recommendation Systems ,(2012)
David M. Rozelle, Self calibrating gyroscope system ,(2009)
Paul T. Stathacopoulos, Daniel Wright Trenz, Kim Rubric Dykeman, Thomas Steven Woods, Brian Peterson, China Arai, Gaurav Sinha, Trent Wheeler, David Jordan, Christopher Dow, Gareth Dean White, Jason Conness, User interface for content browsing and selection in a content system ,(2010)
Mikkel Thorup, Nicholas Duffield, Carsten Lund, Edith Cohen, Haim Kaplan, Variance-optimal sampling-based estimation of subset sums ,(2008)
Johnny Chen, Clarence Christopher Mysen, Duplicate Content Search ,(2007)
Mark J. Tomko, Egidio Terra, Grant M. Emery, Aswath Manoharan, Srikanth Thirumalai, Vijai Mohan, Comparison engine for identifying documents describing similar subject matter ,(2007)
Andy Shirey, Jon Chaikin, Karen A. Swanson, David Bricker, Robyn Battle, Tristan G. Rinehart, Systems and methods for mapping records in a manufacturer line, series, model hierarchy ,(2011)