CNN-Based Multistage Gated Average Fusion (MGAF) for Human Action Recognition Using Depth and Inertial Sensors

作者: Zeeshan Ahmad , Naimul Khan

DOI: 10.1109/JSEN.2020.3028561

关键词:

摘要: Convolutional Neural Network (CNN) provides leverage to extract and fuse features from all layers of its architecture. However, extracting fusing intermediate different CNN structure is still uninvestigated for Human Action Recognition (HAR) using depth inertial sensors. To get maximum benefit accessing the CNN’s layers, in this paper, we propose novel Multistage Gated Average Fusion (MGAF) network which extracts fuses our computationally efficient (GAF) network, a decisive integral element MGAF. At input proposed MGAF, transform sensor data into images called sequential front view (SFI) signal (SI) respectively. These SFI are formed information generated by data. employed feature maps both modalities. GAF extracted effectively while preserving dimensionality fused as well. The MGAF has structural extensibility can be unfolded more than two Experiments on three publicly available multimodal HAR datasets demonstrate that outperforms previous state-of-the-art fusion methods depth-inertial terms recognition accuracy being much efficient. We increase an average 1.5% reducing computational cost approximately 50% over state-of-art.

参考文章(59)
Thomas Plötz, Nils Y. Hammerla, Patrick Olivier, Feature learning for activity recognition in ubiquitous computing international joint conference on artificial intelligence. pp. 1729- 1734 ,(2011) , 10.5591/978-1-57735-516-8/IJCAI11-290
Mohammad Farhad Bulbul, Yunsheng Jiang, Jinwen Ma, DMMs-Based Multiple Features Fusion for Human Action Recognition International Journal of Multimedia Data Engineering and Management. ,vol. 6, pp. 23- 39 ,(2015) , 10.4018/IJMDEM.2015100102
Allen Y. Yang, Roozbeh Jafari, S. Shankar Sastry, Ruzena Bajcsy, Distributed recognition of human actions using wearable motion sensor networks ambient intelligence. ,vol. 1, pp. 103- 115 ,(2009) , 10.3233/AIS-2009-0016
Mehdi Alirezanejad, Vahid Saffari, Saeed Amirgholipour, Aboosaleh Mohammad Sharifi, None, Effect of Locations of Using High Boost Filtering on the Watermark Recovery in Spatial Domain Watermarking Indian journal of science and technology. ,vol. 7, pp. 517- 524 ,(2014) , 10.17485/IJST/2014/V7I4/48643
Zhuowei Cai, Limin Wang, Xiaojiang Peng, Yu Qiao, Multi-view Super Vector for Action Recognition computer vision and pattern recognition. pp. 596- 603 ,(2014) , 10.1109/CVPR.2014.83
Soumitra Samanta, Bhabatosh Chanda, Space-Time Facet Model for Human Activity Classification IEEE Transactions on Multimedia. ,vol. 16, pp. 1525- 1535 ,(2014) , 10.1109/TMM.2014.2326734
Noridayu Manshor, Alfian Abdul Halin, Mandava Rajeswari, Dhanesh Ramachandram, Feature selection via dimensionality reduction for object class recognition international conference on instrumentation, communications, information technology, and biomedical engineering. pp. 223- 227 ,(2011) , 10.1109/ICICI-BME.2011.6108645
Xiaodong Yang, Chenyang Zhang, YingLi Tian, Recognizing actions using depth motion maps-based histograms of oriented gradients Proceedings of the 20th ACM international conference on Multimedia - MM '12. pp. 1057- 1060 ,(2012) , 10.1145/2393347.2396382
Kui Liu, Chen Chen, Roozbeh Jafari, Nasser Kehtarnavaz, Fusion of Inertial and Depth Sensor Data for Robust Hand Gesture Recognition IEEE Sensors Journal. ,vol. 14, pp. 1898- 1903 ,(2014) , 10.1109/JSEN.2014.2306094