Content-prioritised video coding for British Sign Language communication.

作者: Laura Joy Muir , None

DOI:

关键词:

摘要: Video communication of British Sign Language (BSL) is important for remote interpersonal and the equal provision services deaf people. However, use video telephony conferencing applications BSL limited by inadequate quality. a highly structured, linguistically complete, natural language system that expresses vocabulary grammar visually spatially using complex combination facial expressions (such as eyebrow movements, eye blinks mouth/lip shapes), hand gestures, body movements finger-spelling change in space time. Accurate places specific demands on visual media which must compress image data efficient transmission. Current compression schemes apply methods to reduce statistical redundancy perceptual irrelevance based general model Human Visual System (HVS) sensitivities. This thesis presents novel coding developed achieve conflicting requirements high quality coding. Novel prioritising content optimised are exploit HVS spatial temporal response mechanisms users (determined Eye Movement Tracking) characteristics content. The implement an accurate foveation, applied domains, at pre-processing stage current standard-based (H.264). Comparison performance standard systems, evaluation this thesis, demonstrates improved perceived low bit rates. users, broadcasters service providers benefit from perception over range available transmission bandwidths. research community benefits new approach optimisation better understanding needs

参考文章(48)
Egon Guba, Manfred Knemeyer, Willavene Wolf, Sybil de Groot, Ralph Van Atta, Larry Light, Eye movements and TV viewing in children AV communication review. ,vol. 12, pp. 386- 401 ,(1964) , 10.1007/BF02768694
Jens Riegelsberger, Martina Angela Sasse, John D. McCarthy, Eye-Catcher of Blind Spot? The Effect of Photographs of Faces on eCommerce Sites I3E '02 Proceedings of the IFIP Conference on Towards The Knowledge Society: E-Commerce, E-Business, E-Government. pp. 383- 398 ,(2002)
Abdul H. Sadka, Compressed Video Communications ,(2002)
Arthur P. Ginsburg, William R. Hendee, Quantification of Visual Capability Springer, New York, NY. pp. 52- 72 ,(1993) , 10.1007/978-1-4612-1836-4_3
Robert J. K. Jacob, New Human-Computer Interaction Techniques Springer Berlin Heidelberg. pp. 131- 138 ,(1994) , 10.1007/978-3-642-85104-9_15
Gillian M. Wilson, M. Angela Sasse, Do Users Always Know What’s Good For Them? Utilising Physiological Responses to Assess Media Quality People and Computers XIV — Usability or Else!. pp. 327- 339 ,(2000) , 10.1007/978-1-4471-0515-2_22
Charles A. Kelsey, Detection of Visual Information Springer New York. pp. 30- 51 ,(1993) , 10.1007/978-1-4757-6769-8_2
Andrew B. Watson, Digital images and human vision MIT Press. ,(1993)
Shin'ya Nishida, Timothy Ledgeway, Mark Edwards, Dual multiple-scale processing for motion in the human visual System Vision Research. ,vol. 37, pp. 2685- 2698 ,(1997) , 10.1016/S0042-6989(97)00092-8