Face analysis for the synthesis of photo-realistic talking heads

作者: H.P. Graf , E. Cosatto , T. Ezzat

DOI: 10.1109/AFGR.2000.840633

关键词:

摘要: This paper describes techniques for extracting bitmaps of facial parts from videos a talking person. The goal is to synthesize photo-realistic heads high quality that show picture-perfect appearance and realistic head movements with good lip-sound synchronization. For the synthesis head, are combined form whole then sequences such images integrated audio text-to-speech synthesizer. seamless integration into an animation, their shape visual must be known accuracy. recognition system has find not only locations features, but also able determine head's orientation recognize expressions. Our face proceeds in multiple steps, each increased precision. Using motion, color information, position location main features determined first. Then smaller areas searched matched filters, order identify specific From this information 3D calculated. Facial cut image and, using orientation, warped 'normalized' scale.

参考文章(10)
Keith Waters, Frederic I. Parke, Computer Facial Animation ,(1996)
E. Cosatto, H.P. Graf, Sample-based synthesis of photo-realistic talking heads Proceedings Computer Animation '98 (Cat. No.98EX169). pp. 103- 110 ,(1998) , 10.1109/CA.1998.681914
M.S. El-Nasr, T.R. Ioerger, J. Yen, D.H. House, F.I. Parke, Emotionally expressive agents Proceedings Computer Animation 1999. pp. 48- 57 ,(1999) , 10.1109/CA.1999.781198
Denis Oberkampf, Daniel F. DeMenthon, Larry S. Davis, Iterative Pose Estimation Using Coplanar Feature Points Computer Vision and Image Understanding. ,vol. 63, pp. 495- 511 ,(1996) , 10.1006/CVIU.1996.0037
Frederic Pighin, Jamie Hecker, Dani Lischinski, Richard Szeliski, David H. Salesin, Synthesizing realistic facial expressions from photographs ACM SIGGRAPH 2006 Courses on - SIGGRAPH '06. pp. 75- 84 ,(2006) , 10.1145/1185657.1185859
T. Ezzat, T. Poggio, MikeTalk: a talking facial display based on morphing visemes Proceedings Computer Animation '98 (Cat. No.98EX169). pp. 96- 102 ,(1998) , 10.1109/CA.1998.681913
Christoph Bregler, Michele Covell, Malcolm Slaney, Video Rewrite: driving visual speech with audio international conference on computer graphics and interactive techniques. pp. 353- 360 ,(1997) , 10.1145/258734.258880
J. Ostermann, Animation of synthetic faces in MPEG-4 Proceedings Computer Animation '98 (Cat. No.98EX169). pp. 49- 55 ,(1998) , 10.1109/CA.1998.681907
Brian Guenter, Cindy Grimm, Daniel Wood, Henrique Malvar, Fredric Pighin, None, Making faces ACM SIGGRAPH 2006 Courses on - SIGGRAPH '06. pp. 55- 66 ,(2006) , 10.1145/1185657.1185858
J. L. Barron, D. J. Fleet, S. S. Beauchemin, Performance of optical flow techniques International Journal of Computer Vision. ,vol. 12, pp. 43- 77 ,(1994) , 10.1007/BF01420984