Optimal crowd-powered rating and filtering algorithms

作者: Ashish Gupta , Neoklis Polyzotis , Jennifer Widom , Aditya Parameswaran , Stephen Boyd

DOI: 10.14778/2732939.2732942

关键词:

摘要: We focus on crowd-powered filtering, i.e., filtering a large set of items using humans. Filtering is one the most commonly used building blocks in crowdsourcing applications and systems. While solutions for exist, they make range implicit assumptions restrictions, ultimately rendering them not powerful enough real-world applications. describe two approaches to discard these restrictions: one, that carefully generalizes prior work, leading an optimal, but often-times intractable solution, another, provides novel way reasoning about strategies, sometimes suboptimal, efficiently computable solution (that asymptotically close optimal). demonstrate our techniques lead significant reductions error up 30% fixed cost over work application: peer evaluation online courses.

参考文章(40)
Hector Garcia-Molina, Jennifer Widom, Aditya G. Parameswaran, Ming Han Teh, DataSift: An Expressive and Accurate Crowd-Powered Search Toolkit national conference on artificial intelligence. ,(2013)
Mausam Mausam, Daniel S. Weld, Christopher H. Lin, Dynamically switching between synergistic workflows for crowdsourcing national conference on artificial intelligence. pp. 87- 93 ,(2012)
Robert C. Miller, Samuel R. Madden, Eugene Wu, Adam Marcus, David R. Karger, Crowdsourced Databases: Query Processing with People conference on innovative data systems research. pp. 211- 214 ,(2011)
Geoffrey M. Voelker, Chris Kanich, Marti Motoyama, Damon McCoy, Kirill Levchenko, Stefan Savage, Re: CAPTCHAs: understanding CAPTCHA-solving services in an economic context usenix security symposium. pp. 28- 28 ,(2010)
Beth Trushkowsky, Tim Kraska, Purnamrita Sarkar, Michael J. Franklin, Getting It All from the Crowd arXiv: Databases. ,(2012)
Eytan Bakshy, Jake M. Hofman, Winter A. Mason, Duncan J. Watts, Everyone's an influencer: quantifying influence on twitter web search and data mining. pp. 65- 74 ,(2011) , 10.1145/1935826.1935845
Rion Snow, Brendan O'Connor, Daniel Jurafsky, Andrew Y. Ng, Cheap and fast---but is it good? Proceedings of the Conference on Empirical Methods in Natural Language Processing - EMNLP '08. pp. 254- 263 ,(2008) , 10.3115/1613715.1613751
Stephen Guo, Aditya Parameswaran, Hector Garcia-Molina, So who won? Proceedings of the 2012 international conference on Management of Data - SIGMOD '12. pp. 385- 396 ,(2012) , 10.1145/2213836.2213880
Hyunjung Park, Richard Pang, Aditya Parameswaran, Hector Garcia-Molina, Neoklis Polyzotis, Jennifer Widom, An overview of the deco system: data model and query language; query processing and optimization international conference on management of data. ,vol. 41, pp. 22- 27 ,(2013) , 10.1145/2430456.2430462
Vikas C. Raykar, Shipeng Yu, Linda H. Zhao, Anna Jerebko, Charles Florin, Gerardo Hermosillo Valadez, Luca Bogoni, Linda Moy, Supervised learning from multiple experts Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 889- 896 ,(2009) , 10.1145/1553374.1553488