The Crowd-Median Algorithm.

作者: Antti Ukkonen , Hannes Heikinheimo

DOI:

关键词:

摘要: The power of human computation is founded on the capabilities humans to process qualitative information in a manner that hard reproduce with computer. However, all machine learning algorithms rely mathematical operations, such as sums, averages, least squares etc. are less suitable for computation. This paper an effort combine these two aspects data processing. We consider problem computing centroid set, key component many data-analysis applications clustering, using very simple intelligence task (HIT). In this workers must choose outlier from set three items. After presenting number triplets workers, item chosen times selected centroid. provide proof determined by procedure equal mean univariate normal distribution. Furthermore, demonstration viability our method, we implement based variant k-means clustering algorithm. present experiments where proposed method used find "average" image collection, and cluster images semantic categories.

参考文章(16)
Aude Oliva, Antonio Torralba, Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope International Journal of Computer Vision. ,vol. 42, pp. 145- 175 ,(2001) , 10.1023/A:1011139631724
Aditya Ganesh Parameswaran, Hyunjung Park, Hector Garcia-Molina, Neoklis Polyzotis, Jennifer Widom, Deco Proceedings of the 21st ACM international conference on Information and knowledge management - CIKM '12. pp. 1203- 1212 ,(2012) , 10.1145/2396761.2398421
Hyunjung Park, Hector Garcia-Molina, Richard Pang, Neoklis Polyzotis, Aditya Parameswaran, Jennifer Widom, Deco Proceedings of the VLDB Endowment. ,vol. 5, pp. 1990- 1993 ,(2012) , 10.14778/2367502.2367555
Aditya G. Parameswaran, Hector Garcia-Molina, Hyunjung Park, Neoklis Polyzotis, Aditya Ramesh, Jennifer Widom, CrowdScreen Proceedings of the 2012 international conference on Management of Data - SIGMOD '12. pp. 361- 372 ,(2012) , 10.1145/2213836.2213878
Aniket Kittur, Boris Smus, Susheel Khamkar, Robert E. Kraut, CrowdForge Proceedings of the 24th annual ACM symposium on User interface software and technology - UIST '11. pp. 43- 52 ,(2011) , 10.1145/2047196.2047202
Petros Venetis, Hector Garcia-Molina, Kerui Huang, Neoklis Polyzotis, Max algorithms in crowdsourcing environments the web conference. pp. 989- 998 ,(2012) , 10.1145/2187836.2187969
B. Trushkowsky, T. Kraska, M. J. Franklin, P. Sarkar, Crowdsourced enumeration queries international conference on data engineering. pp. 673- 684 ,(2013) , 10.1109/ICDE.2013.6544865
Laurens van der Maaten, Kilian Weinberger, Stochastic triplet embedding international workshop on machine learning for signal processing. pp. 1- 6 ,(2012) , 10.1109/MLSP.2012.6349720
Aditya Parameswaran, Anish Das Sarma, Hector Garcia-Molina, Neoklis Polyzotis, Jennifer Widom, Human-assisted graph search Proceedings of the VLDB Endowment. ,vol. 4, pp. 267- 278 ,(2011) , 10.14778/1952376.1952377
Lydia B. Chilton, Greg Little, Darren Edge, Daniel S. Weld, James A. Landay, Cascade: crowdsourcing taxonomy creation human factors in computing systems. pp. 1999- 2008 ,(2013) , 10.1145/2470654.2466265