Multichannel sound source dereverberation and separation for arbitrary number of sources based on Bayesian nonparametrics

作者: Takuma Otsuka , Katsuhiko Ishiguro , Takuya Yoshioka , Hiroshi Sawada , Hiroshi G. Okuno

DOI: 10.1109/TASLP.2014.2363790

关键词:

摘要: Multichannel signal processing using a microphone array provides fundamental functions for coping with multi-source situations, such as sound source localization and separation, that are needed to extract the auditory information each source. Auditory uncertainties about degree of reverberation number sources known degrade performance or limit practical application processing. Such must therefore be overcome realize general robust These uncertainty issues have been partly addressed—existing methods focus on either issue, where joint separation dereverberation has achieved only overdetermined conditions. This paper presents an all-round method achieves arbitrary including underdetermined Our uses Bayesian nonparametrics infinitely extensible modeling flexibility so bypass model selection in problem, which is caused by uncertainty. Evaluation task various numbers conditions demonstrates (1) our applicable mixtures, (2) extraction comparable state-of-the-art suitable

参考文章(34)
David J. Aldous, Exchangeability and related topics Lecture Notes in Mathematics. ,vol. 1117, pp. 1- 198 ,(1985) , 10.1007/BFB0099421
Herbert Buchner, Walter Kellermann, TRINICON for Dereverberation of Speech and Audio Signals Speech Dereverberation. pp. 311- 385 ,(2010) , 10.1007/978-1-84996-056-4_10
Pierre Comon, Christian Jutten, Handbook of Blind Source Separation: Independent Component Analysis and Applications Academic Press. pp. 831- ,(2010)
Shoko Araki, Tomohiro Nakatani, Hiroshi Sawada, Shoji Makino, Stereo Source Separation and Source Counting with MAP Estimation with Dirichlet Prior Considering Spatial Aliasing Problem international conference on independent component analysis and signal separation. ,vol. 5441, pp. 742- 750 ,(2009) , 10.1007/978-3-642-00599-2_93
Hiroshi Sawada, Shoko Araki, Shoji Makino, MLSP 2007 Data Analysis Competition: Frequency-Domain Blind Source Separation for Convolutive Mixtures of Speech/Audio Signals international workshop on machine learning for signal processing. pp. 45- 50 ,(2007) , 10.1109/MLSP.2007.4414280
David J. Aldous, Illdar A. Ibragimov, Jean Jacod, École d'été de probabilités de Saint-Flour XIII - 1983 Springer Berlin Heidelberg. ,(1985) , 10.1007/BFB0099420
Intae Lee, Taesu Kim, Te-Won Lee, Fast fixed-point independent vector analysis algorithms for convolutive blind source separation Signal Processing. ,vol. 87, pp. 1859- 1871 ,(2007) , 10.1016/J.SIGPRO.2007.01.010
O. Yilmaz, S. Rickard, Blind separation of speech mixtures via time-frequency masking IEEE Transactions on Signal Processing. ,vol. 52, pp. 1830- 1847 ,(2004) , 10.1109/TSP.2004.828896
M. Togami, Y. Kawaguchi, R. Takeda, Y. Obuchi, N. Nukaga, Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 21, pp. 1369- 1380 ,(2013) , 10.1109/TASL.2013.2250960
Takuya Yoshioka, Tomohiro Nakatani, Masato Miyoshi, Hiroshi G. Okuno, Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 19, pp. 69- 84 ,(2011) , 10.1109/TASL.2010.2045183