On the Complexity of Evaluating Order Queries with the Crowd.

作者: Tova Milo , Sudeepa Roy , Benoît Groz

DOI:

关键词: TupleCrowdsourcingInformation retrievalComputer scienceData manipulation languageField (computer science)Information technologyPairwise comparisonProcess (engineering)Set (psychology)Theoretical computer science

摘要: One of the foremost challenges for information technology over last few years has been to explore, understand, and extract useful from large amounts data. Some particular tasks such as annotating data or matching entities have outsourced human workers many years. But seen rise a new research field called crowdsourcing that aims at delegating wide range workers, building formal frameworks, improving efficiency these processes. The database community thus suggesting algorithms process traditional manipulation operators with crowd, joins filtering. This is even more when comparing underlying “tuples” subjective decision – e.g., they are photos, text, simply noisy different variations interpretations can presumably be done better faster by humans than machines. problems considered in this article aim retrieve subset preferred items set pairwise comparison operations crowd. most obvious example finding maximum (called max). We also consider two natural generalizations max problem:

参考文章(40)
Christoph Lofi, Kinda El Maarry, Wolf-Tilo Balke, Skyline Queries over Incomplete Data - Error Models for Focused Crowd-Sourcing international conference on conceptual modeling. pp. 298- 312 ,(2013) , 10.1007/978-3-642-41924-9_25
Hector Garcia-Molina, Neoklis Polyzotis, Luca de Alfaro, James Davis, Vassilis Polychronopoulos, Human-Powered Top-k Lists international workshop on the web and databases. pp. 25- 30 ,(2013)
Ashish Gupta, Neoklis Polyzotis, Jennifer Widom, Aditya Parameswaran, Stephen Boyd, Hector Garcia-Molina, Optimal crowd-powered rating and filtering algorithms Proceedings of the VLDB Endowment. ,vol. 7, pp. 685- 696 ,(2014) , 10.14778/2732939.2732942
Antti Ukkonen, Hannes Heikinheimo, The Crowd-Median Algorithm. national conference on artificial intelligence. ,(2013)
Susan Davidson, Sanjeev Khanna, Tova Milo, Sudeepa Roy, Top-k and Clustering with Noisy Comparisons ACM Transactions on Database Systems. ,vol. 39, pp. 1- 39 ,(2014) , 10.1145/2684066
David G. Kirkpatrick, Raimund Seidel, Output-size sensitive algorithms for finding maximal vectors symposium on computational geometry. pp. 89- 96 ,(1985) , 10.1145/323233.323246
Peyman Afshani, Pankaj K. Agarwal, Lars Arge, Kasper Green Larsen, Jeff M. Phillips, Approximate) Uncertain Skylines Theory of Computing Systems \/ Mathematical Systems Theory. ,vol. 52, pp. 342- 366 ,(2013) , 10.1007/S00224-012-9382-7
Nicholas Pippenger, Sorting and selecting in rounds SIAM Journal on Computing. ,vol. 16, pp. 1032- 1038 ,(1987) , 10.1137/0216066
Yuan Ma, An O(nlogn)-size fault-tolerant sorting network (extended abstract) symposium on the theory of computing. pp. 266- 275 ,(1996) , 10.1145/237814.237876