Auto-summarization of audio-video presentations

作者: Liwei He , Elizabeth Sanocki , Anoop Gupta , Jonathan Grudin

DOI: 10.1145/319463.319691

关键词:

摘要: As streaming audio-video technology becomes widespread, there is a dramatic increase in the amount of multimedia content available on net. Users face new challenge: How to examine large amounts quickly. One technique that can enable quick overview video summaries; is, shorter version assembled by picking important segments from original.We evaluate three techniques for automatic creation summaries online presentations. These exploit information audio signal (e.g., pitch and pause information), knowledge slide transition points presentation, about access patterns previous users. We report user study compares automatically generated are 20%-25% length full presentations author summaries. learn computer-generated summaries, although less than authors' They initially find coherent, but quickly grow accustomed them.

参考文章(24)
M.A. Smith, T. Kanade, Video skimming and characterization through the combination of image and language understanding techniques computer vision and pattern recognition. pp. 775- 781 ,(1997) , 10.1109/CVPR.1997.609414
Yoshinobu Tonomura, Shinji Abe, Content oriented visual interface using video icons for visual database systems Journal of Visual Languages and Computing. ,vol. 1, pp. 183- 198 ,(1990) , 10.1016/S1045-926X(05)80015-1
C.K. Gan, R.W. Donaldson, Adaptive silence deletion for speech storage and voice mail applications IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 36, pp. 924- 927 ,(1988) , 10.1109/29.1605
Barry Arons, SpeechSkimmer ACM Transactions on Computer-Human Interaction. ,vol. 4, pp. 3- 38 ,(1997) , 10.1145/244754.244758
Gary W. Heiman, Raphael J. Leo, Glenn Leighbody, Kathleen Bowler, Word intelligibility decrements and the comprehension of time-compressed speech Perception & Psychophysics. ,vol. 40, pp. 407- 411 ,(1986) , 10.3758/BF03208200
Dulce Ponceleon, Savitha Srinivasan, Arnon Amir, Dragutin Petkovic, Dan Diklic, Key to effective video retrieval: effective cataloging and browsing acm multimedia. pp. 99- 107 ,(1998) , 10.1145/290747.290760
Andrew Merlino, Daryl Morey, Mark Maybury, Broadcast news navigation using story segmentation acm multimedia. pp. 381- 391 ,(1997) , 10.1145/266180.266390
HongJiang Zhang, Chien Yong Low, Stephen W Smoliar, Jian Hua Wu, Video parsing, retrieval and browsing: an integrated and content-based solution acm multimedia. pp. 15- 24 ,(1995) , 10.1145/217279.215068
Nosa Omoigui, Liwei He, Anoop Gupta, Jonathan Grudin, Elizabeth Sanocki, Time-compression: systems concerns, usage, and benefits human factors in computing systems. pp. 136- 143 ,(1999) , 10.1145/302979.303017
Jonathan Foote, John Boreczhy, Andreas Girgensohn, Lynn Wilcox, An intelligent media browser using automatic multimodal analysis acm multimedia. pp. 375- 380 ,(1998) , 10.1145/290747.290804