Auto-summarization of audio-video presentations

作者： Liwei He , Elizabeth Sanocki , Anoop Gupta , Jonathan Grudin

DOI: 10.1145/319463.319691

关键词:

摘要: As streaming audio-video technology becomes widespread, there is a dramatic increase in the amount of multimedia content available on net. Users face new challenge: How to examine large amounts quickly. One technique that can enable quick overview video summaries; is, shorter version assembled by picking important segments from original.We evaluate three techniques for automatic creation summaries online presentations. These exploit information audio signal (e.g., pitch and pause information), knowledge slide transition points presentation, about access patterns previous users. We report user study compares automatically generated are 20%-25% length full presentations author summaries. learn computer-generated summaries, although less than authors' They initially find coherent, but quickly grow accustomed them.

columbia.edu PDF 下载加速

microsoft.com PDF 下载加速

acm.org LINK 下载加速

microsoft.com LINK 下载加速

sci-hub.se PDF 下载加速

参考文章(24)

M.A. Smith, T. Kanade, Video skimming and characterization through the combination of image and language understanding techniques computer vision and pattern recognition. pp. 775- 781 ,(1997) , 10.1109/CVPR.1997.609414

Yoshinobu Tonomura, Shinji Abe, Content oriented visual interface using video icons for visual database systems Journal of Visual Languages and Computing. ,vol. 1, pp. 183- 198 ,(1990) , 10.1016/S1045-926X(05)80015-1

C.K. Gan, R.W. Donaldson, Adaptive silence deletion for speech storage and voice mail applications IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 36, pp. 924- 927 ,(1988) , 10.1109/29.1605

Barry Arons, SpeechSkimmer ACM Transactions on Computer-Human Interaction. ,vol. 4, pp. 3- 38 ,(1997) , 10.1145/244754.244758

Gary W. Heiman, Raphael J. Leo, Glenn Leighbody, Kathleen Bowler, Word intelligibility decrements and the comprehension of time-compressed speech Perception & Psychophysics. ,vol. 40, pp. 407- 411 ,(1986) , 10.3758/BF03208200

Dulce Ponceleon, Savitha Srinivasan, Arnon Amir, Dragutin Petkovic, Dan Diklic, Key to effective video retrieval: effective cataloging and browsing acm multimedia. pp. 99- 107 ,(1998) , 10.1145/290747.290760

Andrew Merlino, Daryl Morey, Mark Maybury, Broadcast news navigation using story segmentation acm multimedia. pp. 381- 391 ,(1997) , 10.1145/266180.266390

HongJiang Zhang, Chien Yong Low, Stephen W Smoliar, Jian Hua Wu, Video parsing, retrieval and browsing: an integrated and content-based solution acm multimedia. pp. 15- 24 ,(1995) , 10.1145/217279.215068

Nosa Omoigui, Liwei He, Anoop Gupta, Jonathan Grudin, Elizabeth Sanocki, Time-compression: systems concerns, usage, and benefits human factors in computing systems. pp. 136- 143 ,(1999) , 10.1145/302979.303017

10.

Jonathan Foote, John Boreczhy, Andreas Girgensohn, Lynn Wilcox, An intelligent media browser using automatic multimodal analysis acm multimedia. pp. 375- 380 ,(1998) , 10.1145/290747.290804

Auto-summarization of audio-video presentations

来源期刊

我的账户

Auto-summarization of audio-video presentations

来源期刊

相似文章 10

我的账户