An adaptive neuro-fuzzy inference system for the qualitative study of perceptual prominence in linguistics

作者: Autilia Vitiello , Giovanni Acampora , Francesco Cutugno , Petra Wagner , Antonio Origlia

DOI: 10.1109/FUZZ-IEEE.2017.8015716

关键词:

摘要: This paper explores the applications of fuzzy logic inference systems as an instrument to perform linguistic analysis in domain prosodic prominence. Understanding how acoustic features interact make a unit be perceived more relevant than surrounding ones is generally needed study cognitive processes for speech understanding. It also has technological field recognition and synthesis. We present first experiment show systems, being characterised by their capability provide detailed insight about models obtained through supervised learning can help investigate complex relationships among linked prominence perception.

参考文章(21)
Stephen L. Chiu, Fuzzy Model Identification Based on Cluster Estimation Journal of Intelligent and Fuzzy Systems. ,vol. 2, pp. 267- 278 ,(1994) , 10.3233/IFS-1994-2306
Julia Bell Hirschberg, Rivka Levitan, Erica L. Cooper, Andrew Rosenberg, Cross-Language Prominence Detection Speech Prosody 2012. ,(2012) , 10.7916/D83F4Z4B
Antonio Origlia, Ailbhe Ní Chasaide, Lucie Rousier-Vercruyssen, David Escudero Mancebo, Anne Lacheret, Helena Moniz, Fabio Tesser, Mariapaola D'Imperio, Anne Catherine Simon, Georges Christodoulides, Martti Vainio, Cinzia Avesani, Francesco Cutugno, Bogdan Ludusan, Oliver Niebuhr, Petra Wagner, Juraj Simko, Barbara Gili Fivela, Different parts of the same elephant: a roadmap to disentangle and connect different perspectives on prosodic prominence Proceedings of the 18th International Congress of Phonetic Sciences. ,(2015)
P. Boersma, Praat, a system for doing phonetics by computer Glot International. ,vol. 5, pp. 341- 345 ,(2002)
D. House, Differential perception of tonal contours through the syllable Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96. ,vol. 4, pp. 2048- 2051 ,(1996) , 10.1109/ICSLP.1996.607203
Yasuhiro Hamada, Reda Elbarougy, Masato Akagi, A method for emotional speech synthesis based on the position of emotional state in Valence-Activation space Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific. pp. 1- 7 ,(2014) , 10.1109/APSIPA.2014.7041729
Sefer Kurnaz, Omer Cetin, Okyay Kaynak, Adaptive neuro-fuzzy inference system based autonomous flight control of unmanned air vehicles Expert Systems With Applications. ,vol. 37, pp. 1229- 1234 ,(2010) , 10.1016/J.ESWA.2009.06.009
A. Origlia, G. Abete, F. Cutugno, A dynamic tonal perception model for optimal pitch stylization Computer Speech & Language. ,vol. 27, pp. 190- 208 ,(2013) , 10.1016/J.CSL.2012.04.003
Alice E. Turk, James R. Sawusch, The processing of duration and intensity cues to prominence Journal of the Acoustical Society of America. ,vol. 99, pp. 3782- 3790 ,(1996) , 10.1121/1.414995
Bipul Pandey, Alok Ranjan, Rajeev Kumar, Anupam Shukla, Multilingual speaker recognition using ANFIS international conference signal processing systems. ,vol. 3, ,(2010) , 10.1109/ICSPS.2010.5555759