Statistical database query using random sampling of records

作者: Ralph Edward Sipple , James Michael Plasek

DOI:

关键词:

摘要: A system and method for expediting database queries by using random sampling. Data associated with a attribute is partitioned into multiple data classes query language grouping command. Each of the randomly sampled on an individual basis to obtain corresponding number class samples, each which are stored in separate sample table. Database then applied samples

参考文章(8)
Peter Phaal, Neil Howard McKee, Accessing data held in large databases ,(1995)
Frank Olken, Doron Rotem, Simple Random Sampling from Relational Databases very large data bases. pp. 160- 169 ,(1986)
Gennady Antoshenkov, Random Sampling from Pseudo-Ranked B+ Trees very large data bases. pp. 375- 382 ,(1992)
F. Olken, D. Rotem, Maintenance of materialized views of sampling queries international conference on data engineering. pp. 632- 641 ,(1992) , 10.1109/ICDE.1992.213145
Yibei Ling, Wei Sun, An evaluation of sampling-based size estimation methods for selections in database systems international conference on data engineering. pp. 532- 539 ,(1995) , 10.1109/ICDE.1995.380360
Wen-Chi Hou, Gultekin Ozsoyoglu, Baldeo K. Taneja, Processing aggregate relational queries with hard time constraints international conference on management of data. ,vol. 18, pp. 68- 77 ,(1989) , 10.1145/66926.66933
L. Craig Murray, Composite random sampling ,(1991)