uulmMAD – A Human Action Recognition Dataset for Ground-Truth Evaluation and Investigation of View Invariances

作者: Michael Glodek , Georg Layher , Felix Heilemann , Florian Gawrilowicz , Günther Palm

DOI: 10.1007/978-3-319-14899-1_8

关键词: Benchmark (computing)Focus (optics)Ground truthBaseline (configuration management)Pattern recognitionComputer scienceAction (philosophy)Artificial intelligencePerspective (graphical)Noise (video)Pattern recognition (psychology)

摘要: In recent time, human action recognition has gained increasing attention in pattern recognition. However, many datasets the literature focus on a limited number of target-oriented properties. Within this work, we present novel dataset, named uulmMAD, which been created to benchmark state-of-the-art architectures addressing multiple properties, e.g. high-resolutions cameras, perspective changes, realistic cluttered background and noise, overlap classes, different execution speeds, variability subjects their clothing, availability pose ground-truth. The uulmMAD was recorded using three synchronized high-resolution cameras an inertial motion capturing system. Each subject performed fourteen actions at least times front green screen. Selected four variants were recorded, i.e. normal, pausing, fast deceleration. data post-processed order separate from background. Furthermore, camera have mapped onto each other 3D-avatars generated further extend dataset. avatars also used emulate self-occlusion when time-of-flight camera. analyze architecture provide first baseline results. results emphasize unique characteristics dataset will be made publicity available upon publication paper.

参考文章(28)
Michael Glodek, Stephan Reuter, Martin Schels, Klaus Dietmayer, Friedhelm Schwenker, Kalman Filter Based Classifier Fusion for Affective State Recognition multiple classifier systems. pp. 85- 94 ,(2013) , 10.1007/978-3-642-38067-9_8
Per Slycke, Daniel Roetenberg, Henk Luinge, Xsens MVN: Full 6DOF Human Motion Tracking Using Miniature Inertial Sensors ,(2009)
Du Tran, Alexander Sorokin, Human Activity Recognition with Metric Learning Lecture Notes in Computer Science. pp. 548- 561 ,(2008) , 10.1007/978-3-540-88682-2_42
Georg Layher, Martin A. Giese, Heiko Neumann, Learning Representations of Animated Motion Sequences—A Neural Model Topics in Cognitive Science. ,vol. 6, pp. 170- 182 ,(2014) , 10.1111/TOPS.12075
Stefan Scherer, Michael Glodek, Friedhelm Schwenker, Nick Campbell, Günther Palm, Spotting laughter in natural multiparty conversations ACM Transactions on Interactive Intelligent Systems. ,vol. 2, pp. 1- 31 ,(2012) , 10.1145/2133366.2133370
Maria-Jose Escobar, Guillaume S. Masson, Thierry Vieville, Pierre Kornprobst, Action Recognition Using a Bio-Inspired Feedforward Spiking Network International Journal of Computer Vision. ,vol. 82, pp. 284- 301 ,(2009) , 10.1007/S11263-008-0201-1
Alvy Ray Smith, James F. Blinn, Blue screen matting international conference on computer graphics and interactive techniques. pp. 259- 268 ,(1996) , 10.1145/237170.237263
Kishore K. Reddy, Mubarak Shah, Recognizing 50 human action categories of web videos Machine Vision and Applications. ,vol. 24, pp. 971- 981 ,(2013) , 10.1007/S00138-012-0450-4
J.K. Aggarwal, M.S. Ryoo, Human activity analysis: A review ACM Computing Surveys. ,vol. 43, pp. 16- ,(2011) , 10.1145/1922649.1922653