Combining Audio-Based and Video-Based Shot Classification Systems for News Videos Segmentation

作者: M. De Santo , G. Percannella , C. Sansone , M. Vento

DOI: 10.1007/11494683_40

关键词:

摘要: In this paper we propose an innovative combination strategy for a system using video and audio stream of news to automatically segment it into stories. our approach, the segmentation is performed in two steps: first, shots are classified by combining three different anchor shot detection algorithms information only. Then, classification improved novel method based on features extracted from track. Experimental results demonstrate that combined use allows perform better than approaches only terms both story segmentation.

参考文章(14)
M. De Santo, G. Percannella, C. Sansone, M. Vento, A Multi-expert Approach for Shot Classification in News Videos international conference on image analysis and recognition. ,vol. 3211, pp. 564- 571 ,(2004) , 10.1007/978-3-540-30125-7_70
Lekha Chaisorn, Tat-Seng Chua, Chin-Hui Lee, A Multi-Modal Approach to Story Segmentation for News Video World Wide Web. ,vol. 6, pp. 187- 208 ,(2003) , 10.1023/A:1023622605600
Image Analysis and Recognition Lecture Notes in Computer Science. ,vol. 5627, ,(2004) , 10.1007/978-3-319-11755-3
C. SANSONE, F. TORTORELLA, M. VENTO, A CLASSIFICATION RELIABILITY DRIVEN REJECT RULE FOR MULTI-EXPERT SYSTEMS International Journal of Pattern Recognition and Artificial Intelligence. ,vol. 15, pp. 885- 904 ,(2001) , 10.1142/S0218001401001210
M. De Santo, G. Percannella, C. Sansone, M. Vento, Combining experts for anchorperson shot detection in news videos Pattern Analysis and Applications. ,vol. 7, pp. 447- 460 ,(2004) , 10.1007/S10044-004-0227-0
Alan Hanjalic, Reginald L. Lagendijk, Jan Biemond, Semiautomatic news analysis, indexing, and classification system based on topic preselection Storage and Retrieval for Image and Video Databases. ,vol. 3656, pp. 86- 97 ,(1998) , 10.1117/12.333829
M. Bertini, A. Del Bimbo, P. Pala, Content-based indexing and retrieval of TV news Pattern Recognition Letters. ,vol. 22, pp. 503- 516 ,(2001) , 10.1016/S0167-8655(00)00113-6
U. Gargi, R. Kasturi, S.H. Strayer, Performance characterization of video-shot-change detection methods IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 10, pp. 1- 13 ,(2000) , 10.1109/76.825852
Y.S. Huang, C.Y. Suen, A method of combining multiple experts for the recognition of unconstrained handwritten numerals IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 17, pp. 90- 94 ,(1995) , 10.1109/34.368145
L.P. Cordella, P. Foggia, C. Sansone, M. Vento, A real-time text-independent speaker identification system international conference on image analysis and processing. pp. 632- 637 ,(2003) , 10.1109/ICIAP.2003.1234121