Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection

作者: Yang Cong , Junsong Yuan , Jiebo Luo

DOI: 10.1109/TMM.2011.2166951

关键词:

摘要: The rapid growth of consumer videos requires an effective and efficient content summarization method to provide a user-friendly way manage browse the huge amount video data. Compared with most previous methods that focus on sports news videos, personal is more challenging because its unconstrained lack any pre-imposed structures. We formulate as novel dictionary selection problem using sparsity consistency, where key frames selected such original can be best reconstructed from this representative dictionary. An global optimization algorithm introduced solve model convergence rates O(1/K2) (where K iteration counter), in contrast traditional sub-gradient descent O(1/√K). Our provides scalable solution for both frame extraction skim generation, one select arbitrary number represent videos. Experiments human labeled benchmark dataset comparisons state-of-the-art demonstrate advantages our algorithm.

参考文章(31)
Aude Oliva, Antonio Torralba, Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope International Journal of Computer Vision. ,vol. 42, pp. 145- 175 ,(2001) , 10.1023/A:1011139631724
B. Li, I. Sezan, Semantic sports video analysis: approaches and new applications international conference on image processing. ,vol. 1, pp. 17- 20 ,(2003) , 10.1109/ICIP.2003.1246887
Z. Rasheed, M. Shah, Detection and representation of scenes in videos IEEE Transactions on Multimedia. ,vol. 7, pp. 1097- 1105 ,(2005) , 10.1109/TMM.2005.858392
Markus A. Stricker, Markus Orengo, Similarity of color images Storage and Retrieval for Image and Video Databases. ,vol. 2420, pp. 381- 392 ,(1995) , 10.1117/12.205308
Tong Zhang, Intelligent keyframe extraction for video printing Internet multimedia management systems. Conference. ,vol. 5601, pp. 25- 35 ,(2004) , 10.1117/12.572474
Shuiwang Ji, Jieping Ye, An accelerated gradient method for trace norm minimization Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 457- 464 ,(2009) , 10.1145/1553374.1553434
Yu-Fei Ma, Lie Lu, Hong-Jiang Zhang, Mingjing Li, A user attention model for video summarization acm multimedia. pp. 533- 542 ,(2002) , 10.1145/641007.641116
Yang Cong, Junsong Yuan, Ji Liu, Sparse reconstruction cost for abnormal event detection computer vision and pattern recognition. pp. 3449- 3456 ,(2011) , 10.1109/CVPR.2011.5995434
Bin Yu, Wei-Ying Ma, Klara Nahrstedt, Hong-Jiang Zhang, Video summarization based on user log enhanced link analysis Proceedings of the eleventh ACM international conference on Multimedia - MULTIMEDIA '03. pp. 382- 391 ,(2003) , 10.1145/957013.957095
Zhuang Li, P. Ishwar, J. Konrad, Video Condensation by Ribbon Carving IEEE Transactions on Image Processing. ,vol. 18, pp. 2572- 2583 ,(2009) , 10.1109/TIP.2009.2026677