On Distant Speech Recognition for Home Automation

作者: Michel Vacher , Benjamin Lecouteux , François Portet

DOI: 10.1007/978-3-319-16226-3_7

关键词: EngineeringActivities of daily livingSet (psychology)Home automationAugmentative and alternative communicationSpeech recognitionSentenceVoice activity detectionDecoding methodsVoice command device

摘要: In the framework of Ambient Assisted Living, home automation may be a solution for helping elderly people living alone at home. This study is part Sweet-Home project which aims developing new system based on voice command to improve support and well-being in loss autonomy. The goal vocal order recognition with focus two aspects: distance speech sentence spotting. Several ASR techniques were evaluated realistic corpus acquired 4-room flat equipped microphones set ceiling. distant French was recorded 21 speakers who acted scenarios activities daily living. Techniques acting decoding stage, such as our novel approach called Driven Decoding Algorithm (DDA), gave better results than baseline other approaches. uses best SNR channels priori knowledge (voice commands distress sentences) has demonstrated an increase rate without introducing false alarms. Generally speaking, short overview allows then outline research challenges that technologies must take up Living Augmentative Alternative Communication, current reseach avenues this domain.

参考文章(75)
Alex Acero, Mike Plumpe, Li Deng, Xuedong Huang, Large-vocabulary speech recognition under adverse acoustic environments. conference of the international speech communication association. pp. 806- 809 ,(2000)
John McDonough, Matthias Woelfel, Distant Speech Recognition ,(2009)
Dan Istrate, Michel Vacher, Jean-François Serignat, Embedded Implementation of Distress Situation Identification through Sound Analysis Journal on Information Technology in Healthcare (JITH). ,vol. 6, pp. 204- 211 ,(2008)
James Clifford, Donald J. Berndt, Using dynamic time warping to find patterns in time series knowledge discovery and data mining. pp. 359- 370 ,(1994)
Thomas Pellegrini, Isabel Trancoso, Annika Hämäläinen, António Calado, Miguel Sales Dias, Daniela Braga, Impact of Age in ASR for the Elderly: Preliminary Experiments in European Portuguese IberSPEECH. pp. 139- 147 ,(2012) , 10.1007/978-3-642-35292-8_15
Véronique Aubergé, Albert Rilliard, Nicolas Audibert, The prosodic dimensions of emotion in speech: the relative weights of parameters conference of the international speech communication association. pp. 525- 528 ,(2005)
Constantine Stephanidis, Intelligent and ubiquitous interaction environments Springer. ,(2009)
Pedro Chahuara, François Portet, Michel Vacher, Making Context Aware Decision from Uncertain Information in a Smart Home: A Markov Logic Network Approach ambient intelligence. ,vol. 8309, pp. 78- 93 ,(2013) , 10.1007/978-3-319-03647-2_6
Raul Santos de la Camara, Marc Cavazza, Markku Turunen, How was your day?: a companion ECA adaptive agents and multi-agents systems. pp. 1629- 1630 ,(2010) , 10.5555/1838206.1838515
Ravichander Vipperla, Steve Renals, Joe Frankel, Longitudinal study of ASR performance on ageing Voices conference of the international speech communication association. pp. 2550- 2553 ,(2008)