MPEG-4: Audio/video and synthetic graphics/audio for mixed media

作者: Peter K. Doenges , Tolga K. Capin , Fabio Lavagetto , Joern Ostermann , Igor S. Pandzic

DOI: 10.1016/S0923-5965(97)00007-6

关键词:

摘要: Abstract MPEG-4 addresses coding of digital hybrids natural and synthetic, aural visual (A/V) information. The objective this synthetic/natural hybrid (SNHC) is to facilitate content-based manipulation, interoperability, wider user access in the delivery animated mixed media. SNHC will support non-real-time passive media delivery, as well more interactive, real-time applications. Integrated spatial-temporal sought for audio, video, 2D/3D computer graphics standardized A/V objects. Targets standardization include mesh-segmented video coding, compression geometry, synchronization between objects, multiplexing streamed integration types. Composition, interactivity, scripting objects can thus be supported client terminals, content production servers, also effectively enabling terminals servers. Such exhibit high efficiency transmission storage, plus scalability, combinations transient dynamic data persistent downloaded data. This approach lower bandwidth media, offer tradeoffs quality versus update specific foster varied distribution methods that exploit spatial temporal coherence over buses networks. responds trends at home work move beyond paradigm audio/video a experience flexible which combine with synthetic audio.

参考文章(28)
Eric David Petajan, Automatic lipreading to enhance speech recognition (speech reading) University of Illinois at Urbana-Champaign. ,(1984)
Models and Techniques in Computer Animation Springer Verlag, Tokyo New York Berlin Heidelberg. ,(2014) , 10.1007/978-4-431-66911-1
Frederic Ira Parke, A parametric model for human faces. The University of Utah. ,(1974)
Norman I. Badler, Cary B. Phillips, Bonnie Lynn Webber, Simulating humans: computer graphics animation and control Simulating humans: computer graphics animation and control. pp. 270- 270 ,(1993) , 10.1093/OSO/9780195073591.001.0001
Michael M. Cohen, Dominic W. Massaro, Modeling Coarticulation in Synthetic Visual Speech Models and Techniques in Computer Animation. pp. 139- 156 ,(1993) , 10.1007/978-4-431-66911-1_13
N Magnenat Thalmann, T Çapin, I Pandzic, D Thalmann, Participant, User-Guided and Autonomous Actors in the Virtual Life Network VLNET Proc. ICAT/VRTS 95. pp. 3- 11 ,(1995)
E. Petajan, B. Bischoff, D. Bodoff, N. M. Brooke, An improved automatic lipreading system to enhance speech recognition Proceedings of the SIGCHI conference on Human factors in computing systems - CHI '88. pp. 19- 25 ,(1988) , 10.1145/57167.57170
Michael R. Macedonia, Michael J. Zyda, David R. Pratt, Paul T. Barham, Steven Zeswitz, Npsnet: A network software architecture for largescale virtual environments Presence: Teleoperators & Virtual Environments. ,vol. 3, pp. 265- 287 ,(1994) , 10.1162/PRES.1994.3.4.265
Gabriel Taubin, Jarek Rossignac, Geometric compression through topological surgery ACM Transactions on Graphics. ,vol. 17, pp. 84- 115 ,(1998) , 10.1145/274363.274365
Frederic I. Parke, A model for human faces that allows speech synchronized animation Computers & Graphics. ,vol. 1, pp. 3- 4 ,(1975) , 10.1016/0097-8493(75)90024-2