Joint people, event, and location recognition in personal photo collections using cross-domain context

作者: Dahua Lin , Ashish Kapoor , Gang Hua , Simon Baker

DOI: 10.1007/978-3-642-15549-9_18

关键词:

摘要: We present a framework for vision-assisted tagging of personal photo collections using context. Whereas previous efforts mainly focus on people, we develop unified approach to jointly tag across multiple domains (specifically events, and locations). The heart our is generic probabilistic model context that couples the through set cross-domain relations. Each relation models how likely instances in two are co-occur. Based this model, derive an algorithm simultaneously estimates relations infers unknown tags semi-supervised manner. conducted experiments well-known datasets obtained significant performance improvements both people location recognition. also demonstrated ability infer event labels with missing timestamps (i.e. no features).

参考文章(27)
Charles Sutton, Andrew McCallum, An Introduction to Conditional Random Fields for Relational Learning MIT Press. ,(2007)
Marc Davis, Michael Smith, John Canny, Nathan Good, Simon King, Rajkumar Janakiraman, Towards context-aware face recognition Proceedings of the 13th annual ACM international conference on Multimedia - MULTIMEDIA '05. pp. 483- 486 ,(2005) , 10.1145/1101149.1101257
Florian Schroff, C. Lawrence Zitnick, Simon Baker, Clustering videos by location british machine vision conference. pp. 1- 11 ,(2010) , 10.5244/C.23.48
Jingyu Cui, Fang Wen, Rong Xiao, Yuandong Tian, Xiaoou Tang, EasyAlbum: an interactive photo annotation system based on face clustering and re-ranking human factors in computing systems. pp. 367- 376 ,(2007) , 10.1145/1240624.1240684
Marc Davis, Michael Smith, Fred Stentiford, Adetokunbo Bamidele, John Canny, Nathan Good, Simon King, Rajkumar Janakiraman, Using Context and Similarity for Face and Location Identification electronic imaging. ,vol. 6061, ,(2006) , 10.1117/12.650981
Richard H. Byrd, Peihuang Lu, Jorge Nocedal, Ciyou Zhu, A Limited Memory Algorithm for Bound Constrained Optimization SIAM Journal on Scientific Computing. ,vol. 16, pp. 1190- 1208 ,(1995) , 10.1137/0916069
Andrew Rabinovich, Andrea Vedaldi, Carolina Galleguillos, Eric Wiewiora, Serge Belongie, Objects in Context international conference on computer vision. pp. 1- 8 ,(2007) , 10.1109/ICCV.2007.4408986
Andrew C. Gallagher, Tsuhan Chen, Using Context to Recognize People in Consumer Images IPSJ Transactions on Computer Vision and Applications. ,vol. 1, pp. 115- 126 ,(2009) , 10.2197/IPSJTCVA.1.115
Andrew C. Gallagher, Tsuhan Chen, Using a Markov Network to Recognize People in Consumer Images international conference on image processing. ,vol. 4, pp. 489- 492 ,(2007) , 10.1109/ICIP.2007.4380061
Li-Jia Li, Richard Socher, Li Fei-Fei, Towards total scene understanding: Classification, annotation and segmentation in an automatic framework computer vision and pattern recognition. pp. 2036- 2043 ,(2009) , 10.1109/CVPR.2009.5206718