Learning action descriptors for recognition

作者: M.J. Marin-Jimenez , N. Perez de la Blanca , M.A. Mendoza , M. Lucena , J.M. Fuertes

DOI: 10.1109/WIAMIS.2009.5031418

关键词:

摘要: This paper evaluates different Restricted Boltzmann Machines models in unsupervised, semi-supervised and supervised frameworks using information from human actions. After feeding these multilayer with low level features, we infer high-level discriminating features that highly improve the classification performance. approach eliminates difficult process of selecting good mid-level feature descriptors, changing selection extraction by a learning stage. Two main contributions are presented. First, new sequence-descriptor accumulated histograms optical flow (aHOF) is Second, comparative results experiments shown. The show RBM architectures provide very our task present properties for learning.

参考文章(16)
Geoffrey E. Hinton, Ruslan Salakhutdinov, Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure international conference on artificial intelligence and statistics. pp. 412- 419 ,(2007)
Gunnar Farnebäck, Two-frame motion estimation based on polynomial expansion scandinavian conference on image analysis. ,vol. 2749, pp. 363- 370 ,(2003) , 10.1007/3-540-45103-X_50
Hugo Larochelle, Yoshua Bengio, Classification using discriminative restricted Boltzmann machines Proceedings of the 25th international conference on Machine learning - ICML '08. pp. 536- 543 ,(2008) , 10.1145/1390156.1390224
Harmonium Models for Video Classification Statistical Analysis and Data Mining. ,vol. 1, pp. 23- 37 ,(2008) , 10.1002/SAM.V1:1
C. Schuldt, I. Laptev, B. Caputo, Recognizing human actions: a local SVM approach international conference on pattern recognition. ,vol. 3, pp. 32- 36 ,(2004) , 10.1109/ICPR.2004.747
Geoffrey E Hinton, Ruslan R Salakhutdinov, Reducing the Dimensionality of Data with Neural Networks Science. ,vol. 313, pp. 504- 507 ,(2006) , 10.1126/SCIENCE.1127647
E. Shechtman, M. Irani, Space-Time Behavior-Based Correlation-OR-How to Tell If Two Underlying Motion Fields Are Similar Without Computing Them? IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 29, pp. 2045- 2056 ,(2007) , 10.1109/TPAMI.2007.1119
Geoffrey E. Hinton, Training products of experts by minimizing contrastive divergence Neural Computation. ,vol. 14, pp. 1771- 1800 ,(2002) , 10.1162/089976602760128018
Nobuyuki Otsu, A Threshold Selection Method from Gray-Level Histograms IEEE Transactions on Systems, Man, and Cybernetics. ,vol. 9, pp. 62- 66 ,(1979) , 10.1109/TSMC.1979.4310076
Alireza Fathi, Greg Mori, Action recognition by learning mid-level motion features computer vision and pattern recognition. pp. 1- 8 ,(2008) , 10.1109/CVPR.2008.4587735