Using web sources for improving video categorization

作者: José M. Perea-Ortega , Arturo Montejo-Ráez , M. Teresa Martín-Valdivia , L. Alfonso Ureña-López

DOI: 10.1007/S10844-010-0123-6

关键词:

摘要: In this paper, several experiments about video categorization using a supervised learning approach are presented. To end, the VideoCLEF 2008 evaluation forum has been chosen as experimental framework. After an analysis of corpus, it was found that transcriptions not best source information in order to identify thematic streams. Therefore, two web-based corpora have generated aim adding more informational sources by integrating documents from Wikipedia articles and Google searches. A number test data accomplished. Several machine algorithms proved validate effect corpus on final results: Naive Bayes, K-nearest-neighbors (KNN), Support Vectors Machine (SVM) j48 decision tree. The results obtained show web can be useful for generating classification models data.

参考文章(20)
M. C. Díaz-Galiano, M. Á. García-Cumbreras, M. T. Martín-Valdivia, A. Montejo-Ráez, L. Alfonso Ureña-López, Using information gain to improve the ImageCLEF 2006 collection cross language evaluation forum. pp. 711- 714 ,(2006) , 10.1007/978-3-540-74999-8_89
M. T. Martín-Valdivia, M. A. García-Cumbreras, M. C. Díaz-Galiano, L. A. Ureña-López, A. Montejo-Raez, The university of jaén at ImageCLEF 2005: adhoc and medical tasks cross language evaluation forum. pp. 612- 621 ,(2005) , 10.1007/11878773_68
Arturo Montejo Ráez, Luis Alfonso Ureña López, Binary classifiers versus AdaBoost for labeling of digital documents Procesamiento Del Lenguaje Natural. ,vol. 37, pp. 319- 326 ,(2006)
Henning Müller, Jayashree Kalpathy-Cramer, Charles E. Kahn, William Hatt, Steven Bedrick, William Hersh, Overview of the ImageCLEFmed 2008 medical image retrieval task cross language evaluation forum. ,vol. 1174, pp. 512- 522 ,(2008) , 10.1007/978-3-642-04447-2_63
David Bargeron, Anoop Gupta, Jonathan Grudin, Elizabeth Sanocki, Annotations for streaming video on the Web: system design and usage studies the web conference. ,vol. 31, pp. 1139- 1153 ,(1999) , 10.1016/S1389-1286(99)00058-4
Timo Volkmer, John R. Smith, Apostol (Paul) Natsev, A web-based system for collaborative annotation of large image and video collections Proceedings of the 13th annual ACM international conference on Multimedia - MULTIMEDIA '05. pp. 892- 901 ,(2005) , 10.1145/1101149.1101341
David D. Lewis, Evaluating text categorization human language technology. pp. 312- 318 ,(1991) , 10.3115/112405.112471
Jia Li, Shih-Fu Chang, Michael Lesk, Rainer Lienhart, Jiebo Luo, Arnold W. M. Smeulders, New challenges in multimedia research for the increasingly connected and fast growing digital society Proceedings of the international workshop on Workshop on multimedia information retrieval - MIR '07. pp. 3- 10 ,(2007) , 10.1145/1290082.1290086
Alan F. Smeaton, Paul Over, Wessel Kraaij, Evaluation campaigns and TRECVid multimedia information retrieval. pp. 321- 330 ,(2006) , 10.1145/1178677.1178722
Manuel Carlos Díaz-Galiano, José M. Perea-Ortega, Arturo Montejo-Ráez, María Teresa Martín-Valdivia, Luis Alfonso Ureña López, SINAI at VideoCLEF 2008 CLEF (Working Notes). ,(2008)