SPEAKER LOCALIZATION EXPLOITING SPATIAL-TEMPORAL INFORMATION

作者: Sharon Gannot , Tsvi Gregory Dvorkind

DOI:

关键词:

摘要: Determining the spatial position of a speaker finds growing interest in video conference scenario where automated camera steering and tracking are required. Speaker localization can be achieved with dual step approach. In preliminary stage microphone array is used to extract time difference arrival (TDOA) speech signal. These readings then by second for actual localization. Since trajectory must smooth, estimates close positions might improve current estimate. However, many methods, although exploiting information obtained dierent pairs, do not exploit this temporal information. contribution we present two schemes, which The first well known extended Kalman filter (EKF). recursive form Gauss method, denote Recursive (RG). Experimental study supports potential proposed methods.

参考文章(6)
T. Dvorkind, S. Gannot, Speaker localization in a reverberant environment convention of electrical and electronics engineers in israel. pp. 7- 9 ,(2002) , 10.1109/EEEI.2002.1178291
J. Smith, J. Abel, Closed-form least-squares source location estimation from range-difference measurements IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 35, pp. 1661- 1669 ,(1987) , 10.1109/TASSP.1987.1165089
C. Knapp, G. Carter, The generalized correlation method for estimation of time delay IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 24, pp. 320- 327 ,(1976) , 10.1109/TASSP.1976.1162830
M.S. Brandstein, J.E. Adcock, H.F. Silverman, A closed-form location estimator for use with room environment microphone arrays IEEE Transactions on Speech and Audio Processing. ,vol. 5, pp. 45- 50 ,(1997) , 10.1109/89.554268
Yiteng Huang, J. Benesty, G.W. Elko, R.M. Mersereati, Real-time passive source localization: a practical linear-correction least-squares approach IEEE Transactions on Speech and Audio Processing. ,vol. 9, pp. 943- 956 ,(2001) , 10.1109/89.966097