作者: Wen He , Yongjoo Park , Idris Hanafi , Jacob Yatvitskiy , Barzan Mozafari
关键词:
摘要: We demonstrate VerdictDB, the first platform-independent approximate query processing (AQP) system. Unlike existing AQP systems that are tightly-integrated into a specific database, VerdictDB operates at driver-level, acting as middleware between users and off-the-shelf database systems. In other words, requires no modifications to internals; it simply relies on rewriting incoming queries such standard execution of rewritten under relational semantics yields answers original queries. exploits novel technique for error estimation called variational subsampling, which is amenable efficient computation via SQL. this demonstration, we showcase VerdictDB's performance benefits (up two orders magnitude) compared issued directly engines. also illustrate returned by nearly identical exact answers. use Apache Spark SQL Amazon Redshift examples modern distributed platforms. allow audience explore using web-based interface (e.g., Hue or Zeppelin) issue visualize their currently open-sourced available License (V2).