An incremental technique for real-time bioacoustic signal segmentation

作者: Juan Gabriel Colonna , Marco Cristo , Mario Salvatierra , Eduardo Freire Nakamura

DOI: 10.1016/J.ESWA.2015.05.030

关键词: Unsupervised learningPrecision and recallSegmentationWireless sensor networkScale-space segmentationBinary classificationComputer scienceZero-crossing rateSpeech recognition

摘要: An incremental transformation of ZCR and energy without using temporal windows.With our method is possible to save memory transmission costs.Solution process large amounts data by resource-constrained devices as WSN. A bioacoustical animal recognition system composed two parts: (1) the segmenter, responsible for detecting syllables (animal vocalization) in audio; (2) classifier, which determines species/animal whose belong to. In this work, we first present a novel technique automatic segmentation anuran calls real time; then, assess performance whole system. The proposed performs an unsupervised binary classification time series (audio) that incrementally computes exponentially-weighted features (Energy Zero Crossing Rate). proposal, classical sliding windows are replaced with counters give higher weights new data, allowing us distinguish between syllable ambient noise (considered silences). Compared sliding-window approaches, associated cost proposal lower, processing speed higher. Our evaluation component considers three metrics: Matthews Correlation Coefficient point-to-point comparison; WinPR quantify precision boundaries; (3) AEER event-to-event counting. experiments were carried out dataset 896 seven different species anurans. To evaluate system, derived four equations helps understand impact recall has on task. Finally, show segmentation/recognition improvement 37%, while reducing communication. Therefore, results suggest suitable systems, such Wireless Sensor Networks (WSNs).

参考文章(37)
João Gama, Pedro Pereira Rodrigues, Learning from Data Streams Encyclopedia of Data Warehousing and Mining. pp. 1137- 1141 ,(2007) , 10.4018/978-1-60566-010-3.CH176
David Martin Ward Powers, None, Evaluation: from Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation arXiv: Learning. ,vol. 2, pp. 37- 63 ,(2011)
Ghazal Jaber, An approach for online learning in the presence of concept changes Université Paris Sud - Paris XI. ,(2013)
Shafiullah Khan, Al-Sakib Khan Pathan, Nabil Ali Alrajeh, None, Wireless Sensor Networks: Current Status and Future Trends ,(2018)
Jie Xie, Michael Towsey, Anthony Truskinger, Philip Eichinski, Jinglan Zhang, Paul Roe, Acoustic classification of Australian anurans using syllable features international conference on intelligent sensors sensor networks and information processing. pp. 1- 6 ,(2015) , 10.1109/ISSNIP.2015.7106924
Mark D. Plumbley, Emmanouil Benetos, Mathieu Lagrange, Dimitrios Giannoulis, Mathias Rossignol, Dan Stowell, A database and challenge for acoustic scene classification and event detection european signal processing conference. pp. 1- 5 ,(2013)
N. Garcia, E. Macias-Toro, J.F. Vargas-Bonilla, J.M. Daza, J.D. Lopez, Segmentation of bio-signals in field recordings using fundamental frequency detection 3rd IEEE International Work-Conference on Bioinspired Intelligence. pp. 86- 92 ,(2014) , 10.1109/IWOBI.2014.6913944
Thiago L.F. Evangelista, Thales M. Priolli, Carlos N. Silla, Bruno A. Angelico, Celso A.A. Kaestner, Automatic Segmentation of Audio Signals for Bird Species Identification international symposium on multimedia. pp. 223- 228 ,(2014) , 10.1109/ISM.2014.46
Seppo Fagerlund, Bird species recognition using support vector machines EURASIP Journal on Advances in Signal Processing. ,vol. 2007, pp. 64- 64 ,(2007) , 10.1155/2007/38637