Robust and Responsive Acoustic Pairing of Devices Using Decorrelating Time-Frequency Modelling

作者: Pablo Perez Zarazaga , Tom Buackstrom , Stephan Sigg

DOI: 10.23919/EUSIPCO.2019.8903125

关键词:

摘要: Voice user interfaces have increased in popularity, as they enable natural interaction with different applications using one’s voice. To improve their usability and audio quality, several devices could interact to provide a unified voice interface. However, cooperating sharing voice-related information, privacy may be at risk. Therefore, access management rules that preserve are important. State-of-the-art methods for acoustic pairing of fingerprinting based on the time-frequency representation signal error-correction. We propose use such authorise which acoustically close. aim obtain fingerprints ambient adapted requirements interfaces. Our experiments show responsiveness robustness is improved by combining overlapping windows decorrelating transforms.

参考文章(14)
Ton Kalker, Jaap Haitsma, A Highly Robust Audio Fingerprinting System. international symposium/conference on music information retrieval. ,(2002)
Guillaume Fuchs, Christian R. Helmrich, Goran Markovic, Matthias Neusinger, Emmanuel Ravelli, Takehiro Moriya, Low delay LPC and MDCT-based audio coding in the EVS codec international conference on acoustics, speech, and signal processing. pp. 5723- 5727 ,(2015) , 10.1109/ICASSP.2015.7179068
Moataz El Ayadi, Mohamed S. Kamel, Fakhri Karray, Survey on speech emotion recognition: Features, classification schemes, and databases Pattern Recognition. ,vol. 44, pp. 572- 587 ,(2011) , 10.1016/J.PATCOG.2010.09.020
Dominik Schürmann, S. Sigg, Secure Communication Based on Ambient Audio IEEE Transactions on Mobile Computing. ,vol. 12, pp. 358- 370 ,(2013) , 10.1109/TMC.2011.271
Wai-Tian Tan, Mary Baker, Bowon Lee, Ramin Samadani, The sound of silence international conference on embedded networked sensor systems. pp. 19- ,(2013) , 10.1145/2517351.2517362
Vijay Chandrasekhar, Matt Sharifi, David A. Ross, SURVEY AND EVALUATION OF AUDIO FINGERPRINTING SCHEMES FOR MOBILE QUERY-BY-EXAMPLE APPLICATIONS international symposium/conference on music information retrieval. pp. 801- 806 ,(2011)
Tom Bäckström, Florin Ghido, Johannes Fischer, Blind Recovery of Perceptual Models in Distributed Speech and Audio Coding. conference of the international speech communication association. pp. 2483- 2487 ,(2016) , 10.21437/INTERSPEECH.2016-27
Jacob Benesty, Mohan M. Sondhi, Yiteng Huang, Steven Greenberg, Springer Handbook of Speech Processing Journal of the Acoustical Society of America. ,vol. 126, pp. 2130- 2130 ,(2007) , 10.1121/1.3203918
Michael James Callaghan, Victor Bogdan Putinelu, Jeremy Ball, Jorge Caballero Salillas, Thibault Vannier, Augusto Gomez Eguíluz, Niall McShane, Practical Use of Virtual Assistants and Voice User Interfaces in Engineering Laboratories international conference on remote engineering and virtual instrumentation. pp. 660- 671 ,(2018) , 10.1007/978-3-319-64352-6_62
Phil Bartie, William Mackaness, Oliver Lemon, Tiphaine Dalmas, Srini Janarthanam, Robin L. Hill, Anna Dickinson, Xingkun Liu, A dialogue based mobile virtual assistant for tourists: The SpaceBook Project Computers, Environment and Urban Systems. ,vol. 67, pp. 110- 123 ,(2018) , 10.1016/J.COMPENVURBSYS.2017.09.010