Video Skimming for Quick Browsing based on Audio and Image Characterization

作者: Michael A Smith , Takeo Kanade

DOI:

关键词: EntertainmentVideo trackingHost (network)Agency (sociology)World Wide WebVideo processingComputer scienceSpace (commercial competition)Video captureStructure (mathematical logic)Multimedia

摘要: Digital video is rapidly becoming an important source for information, entertainment and a host of multimedia applications. With the size these collections growing to thousands hours, technology needed effectively browse segments in short time without losing content video. We propose method extract significant audio information create “skim” which represents synopsis original. The extraction such as specific objects, keywords relevant structure, made possible through integration techniques image language understanding. resulting skim much smaller, retains essential original segment. This research sponsored by National Science Foundation under grant no. IRI9411299, Space Aeronautics Administration, Advanced Research Projects Agency. views conclusions contained this document are those authors should not be interpreted necessarily representing official policies or endorsements, either expressed implied, United States Government.

参考文章(15)
Michael Loren Mauldin, None, Information retrieval by text skimming Carnegie Mellon University. ,(1989)
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
Scott Stevens, Michael Christen, Howard Wactlar, Informedia: improving access to digital video Interactions. ,vol. 1, pp. 67- 71 ,(1994) , 10.1145/194283.194311
Hongjiang Zhang, Chien Yong Low, Stephen W. Smoliar, Video parsing and browsing using compressed data Multimedia Tools and Applications. ,vol. 1, pp. 89- 111 ,(1995) , 10.1007/BF01261227
Y. Tonomura, A. Akutsu, Y. Taniguchi, G. Suzuki, Structured Video Computing IEEE Multimedia. ,vol. 1, pp. 34- 349 ,(1994) , 10.1109/MMUL.1994.318984
HongJiang Zhang, Atreyi Kankanhalli, Stephen W. Smoliar, Automatic partitioning of full-motion video Multimedia Systems. ,vol. 1, pp. 10- 28 ,(1993) , 10.1007/BF01210504
Farshid Arman, Arding Hsu, Ming-Yee Chiu, Image processing on encoded video sequences Multimedia Systems. ,vol. 1, pp. 211- 219 ,(1994) , 10.1007/BF01268945
Leo Degen, Richard Mander, Gitta Salomon, Working with audio Proceedings of the SIGCHI conference on Human factors in computing systems - CHI '92. pp. 413- 418 ,(1992) , 10.1145/142750.142877
Arun Hampapur, Ramesh Jain, Terry E Weymouth, Production model based digital video segmentation Multimedia Tools and Applications. ,vol. 1, pp. 9- 46 ,(1995) , 10.1007/BF01261224