Random sampling of rows in a parallel processing database system

作者: Ambuj Shatdal

DOI:

关键词:

摘要: A method, apparatus, and article of manufacture for random sampling rows stored in a table, wherein the table has plurality partitions. row count is determined each partitions total number from table. proportional allocation sample size computed based on rows. set retrieved contributes its to Preferably, computer system parallel processing database system, units manages partition some above steps can be performed by units.

参考文章(3)
Vivek Narasayya, Surajit Chaudhuri, What-if index analysis utility for database systems ,(1998)
Ralph Edward Sipple, James Michael Plasek, Statistical database query using random sampling of records ,(1996)