Parametric method for tracking and analysing lip movements

作者: D. Shah , S. Marshall

DOI: 10.1007/978-1-4471-1597-7_25

关键词:

摘要: Telecommunication systems cover a wide range of techniques dealing from speaker recognition to speech recognition. Each component such system has an effect on its overall performance. The discussed in this paper is the visual feature extractor. This extractor deals with localising, tracking and extracting vital measurements. method used based Bayesian algorithms which are able model lip contour image intensities around mouth area. A further use method, inferring phonologically information can provide complementary data acoustic waveform enable improvement performance process at transmission stage.

参考文章(8)
Eric David Petajan, Automatic lipreading to enhance speech recognition (speech reading) University of Illinois at Urbana-Champaign. ,(1984)
Ulf Grenander, Daniel MacRae Keenan, Towards automated image understanding Journal of Applied Statistics. ,vol. 20, pp. 89- 103 ,(1989) , 10.1080/02664769300000060
Kathleen E. Finn, Allen A. Montgomery, Automatic optically-based recognition of speech Pattern Recognition Letters. ,vol. 8, pp. 159- 164 ,(1988) , 10.1016/0167-8655(88)90094-3
HARRY MCGURK, JOHN MACDONALD, Hearing lips and seeing voices Nature. ,vol. 264, pp. 746- 748 ,(1976) , 10.1038/264746A0
Michael Stoker, Fact, fiction and fraud Nature. ,vol. 264, pp. 126- 127 ,(1976) , 10.1038/264126B0
Alan L. Yuille, Peter W. Hallinan, David S. Cohen, Feature extraction from faces using deformable templates International Journal of Computer Vision. ,vol. 8, pp. 99- 111 ,(1992) , 10.1007/BF00127169
D. Shah, An image/speech relational database and its application IEE Colloquium on Integrated Audio-Visual Processing for Recognition, Synthesis and Communication. pp. 6- 6 ,(1996) , 10.1049/IC:19961150