High-rate data embedding in unvoiced speech.

作者: Gernot Kubin , Konrad Hofbauer

DOI:

关键词:

摘要: We propose a blind speech watermarking algorithm which allows high-rate embedding of digital side information into signals. exploit the fact that well-known LPC vocoder works very well for unvoiced speech. Using an auto-correlation based pitch tracking algorithm, voiced/unvoiced segmentation is carried out. In segments, linear prediction residual replaced by data sequence. This substitution does not cause perceptual degradation as long residual’s power matched. The signal resynthesised using unmodified filter coefficients. watermark decoded analysis received and extracted from sign residual. nearly imperceptible provides channel capacity up to 2000 bit/s in 8 kHzsampled signal.

参考文章(15)
Gernot Kubin, Martin Hagmuller, Horst Hering, Andreas Kropfl, Speech watermarking for air traffic control european signal processing conference. pp. 1653- 1656 ,(2004)
M. Celik, G. Sharma, A. Murat Tekalp, Pitch and duration modification for speech watermarking international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 17- 20 ,(2005) , 10.1109/ICASSP.2005.1415330
Qiang Cheng, J. Sorensen, Spread spectrum signaling for speech watermarking international conference on acoustics, speech, and signal processing. ,vol. 3, pp. 1337- 1340 ,(2001) , 10.1109/ICASSP.2001.941175
Toshiyuki Sakai, Naohisa Komatsu, Digital watermarking based on process of speech production conference on security steganography and watermarking of multimedia contents. ,vol. 5306, pp. 127- 138 ,(2002) , 10.1117/12.526284
S. Sakaguchi, T. Arai, Y. Murahara, The effect of polarity inversion of speech on human perception and data hiding as an application international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 917- 920 ,(2000) , 10.1109/ICASSP.2000.859110
L. Girin, S. Marchand, Watermarking of speech signals using the sinusoidal model and frequency modulation of the partials international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 633- 636 ,(2004) , 10.1109/ICASSP.2004.1326065
B. Chen, G.W. Wornell, Quantization index modulation: a class of provably good methods for digital watermarking and information embedding international symposium on information theory. ,vol. 47, pp. 1423- 1443 ,(2000) , 10.1109/18.923725
G. Kubin, B.S. Atal, W.B. Kleijn, Performance of noise excitation for unvoiced speech ieee workshop on speech coding for telecommunications. pp. 35- 36 ,(1993) , 10.1109/SCFT.1993.762326
Peter Jax, Peter Vary, Bernd Geiser, Artificial bandwidth extension of speech supported by watermark-transmitted side information. conference of the international speech communication association. pp. 1497- 1500 ,(2005)