Sampling for database systems

作者: Rajeev Motwani , Vivek Narasayya , Surajit Chaudhuri

DOI:

关键词:

摘要: A database server supports weighted and unweighted sampling of records or tuples in accordance with desired semantics such as replacement (WR), without (WoR), independent coin flips (CF) semantics, for example. The may perform sequentially not only to sample non-materialized records, those produced a stream by pipeline query tree example, but also whether materialized not, single pass. over join two relations requiring the computation full materialization both and/or indexes on attribute values relations.

参考文章(70)
Yossi Matias, Andrew Witkowski, Phillip B. Gibbons, Viswanath Poosala, Incremental maintenance of an approximate histogram in a database system ,(1997)
Yori Takahashi, Kazutomo Ushijima, Kazuo Masai, Itaru Nishizawa, Shinji Fujiwara, Random sampling method for use in a database processing system and a database processing system based thereon ,(1998)
Phillip B. Gibbons, Sridhar Ramaswamy, Viswanath Poosala, Swarup Acharya, Join synopsis-based approximate query answering ,(2002)
Douglass R. Cutting, Jan. O. Pedersen, John W. Tukey, David Karger, Scatter-gather: a cluster-based method and apparatus for browsing large document collections ,(1991)
Peter Phaal, Neil Howard McKee, Accessing data held in large databases ,(1995)