Speech/non-speech detection in Malay language spontaneous speech

作者: M. Izzad , Nursuriati Jamil , Zainab Abu Bakar

DOI: 10.1109/COMMANTEL.2013.6482394

关键词:

摘要: The goal of this work is to discriminate speech and non-speech segments in Malay language spontaneous as speech/non-speech detection important many processing applications. Inaccurate sentence boundaries are a major cause errors automatic recognition preprocessing stage that the signal into periods invaluable improving accuracy. We proposed combination three audio features energy, zero crossing rate (ZCR) fundamental frequency (F0) for each feature has unique properties differentiate segments. Experiments conducted on one-hour consisting more than 20,000 An accuracy evaluation reveals method achieved 97.8% rate. Non-speech will further be used candidates boundary our next experiment.

参考文章(15)
Francis Kubala, Amit Srivastava, Sentence boundary detection in arabic speech. conference of the international speech communication association. ,(2003)
R.G. Bachu, S. Kopparthi, B. Adapa, B.D. Barkana, Voiced/Unvoiced Decision for Speech Signals Based on Zero-Crossing Rate and Energy atcs. pp. 279- 282 ,(2010) , 10.1007/978-90-481-3660-5_47
L. R. Rabiner, M. R. Sambur, Algorithm for determining the endpoints of isolated utterances The Journal of the Acoustical Society of America. ,vol. 56, pp. S31- S31 ,(1974) , 10.1121/1.1914118
L. R. Rabiner, M. R. Sambur, An Algorithm for Determining the Endpoints of Isolated Utterances Bell System Technical Journal. ,vol. 54, pp. 297- 315 ,(1975) , 10.1002/J.1538-7305.1975.TB02840.X
Mojtaba Radmard, Mahdi Hadavi, Mohammad Mahdi Nayebi, A New Method of Voiced/Unvoiced Classification Based on Clustering Journal of Signal and Information Processing. ,vol. 02, pp. 336- 347 ,(2011) , 10.4236/JSIP.2011.24048
Alain de Cheveigné, Hideki Kawahara, YIN, a fundamental frequency estimator for speech and music The Journal of the Acoustical Society of America. ,vol. 111, pp. 1917- 1930 ,(2002) , 10.1121/1.1458024
Yang Liu, Andreas Stolcke, Elizabeth Shriberg, Mary Harper, Using Conditional Random Fields for Sentence Boundary Detection in Speech meeting of the association for computational linguistics. pp. 451- 458 ,(2005) , 10.3115/1219840.1219896
Huiqun Deng, Douglas O'Shaughnessy, Voiced-Unvoiced-Silence Speech Sound Classification Based on Unsupervised Learning international conference on multimedia and expo. pp. 176- 179 ,(2007) , 10.1109/ICME.2007.4284615
A.P. Lobo, P.C. Loizou, Voiced/unvoiced speech discrimination in noise using Gabor atomic decomposition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 820- 823 ,(2003) , 10.1109/ICASSP.2003.1198907
Noraini Seman, Zainab Abu Bakar, Nordin Abu Bakar, An evaluation of endpoint detection measures for malay speech recognition of an isolated words 2010 International Symposium on Information Technology. ,vol. 3, pp. 1628- 1635 ,(2010) , 10.1109/ITSIM.2010.5561618