Improved Generalization of Heading Direction Estimation for Aerial Filming Using Semi-supervised Regression

作者: Sebastian Scherer , Yanfu Zhang , Wenshan Wang , Rogerio Bonatti , Aayush Ahuja

DOI:

关键词:

摘要: In the task of Autonomous aerial filming a moving actor (e.g. person or vehicle), it is crucial to have good heading direction estimation for from visual input. However, models obtained in other similar tasks, such as pedestrian collision risk analysis and human-robot interaction, are very difficult generalize task, because difference data distributions. Towards improving generalization with less amount labeled data, this paper presents semi-supervised algorithm problem. We utilize temporal continuity unsupervised signal regularize model achieve better ability. This applied both training testing phases, which increases performance by large margin. show that leveraging unlabeled sequences, required can be significantly reduced. also discuss several important details on balancing loss, making combinations. Experimental results our approach robustly outputs different types actor. The aesthetic value video improved task.

参考文章(33)
Amir Roshan Zamir, Khurram Soomro, Mubarak Shah, UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild arXiv: Computer Vision and Pattern Recognition. ,(2012)
Xiaolong Wang, Abhinav Gupta, Unsupervised Learning of Visual Representations Using Videos 2015 IEEE International Conference on Computer Vision (ICCV). pp. 2794- 2802 ,(2015) , 10.1109/ICCV.2015.320
Harri Valpola, Tapani Raiko, Antti Rasmus, Mikko Honkala, Mathias Berglund, Semi-supervised learning with Ladder networks neural information processing systems. ,vol. 28, pp. 3546- 3554 ,(2015)
Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann LeCun, Unsupervised Learning of Spatiotemporally Coherent Metrics 2015 IEEE International Conference on Computer Vision (ICCV). pp. 4086- 4093 ,(2015) , 10.1109/ICCV.2015.465
Wu Liu, Yongdong Zhang, Sheng Tang, Jinhui Tang, Richang Hong, Jintao Li, Accurate Estimation of Human Body Orientation From RGB-D Sensors IEEE Transactions on Systems, Man, and Cybernetics. ,vol. 43, pp. 1442- 1452 ,(2013) , 10.1109/TCYB.2013.2272636
Fabian Flohr, Madalin Dumitru-Guzu, Julian F. P. Kooij, Dariu M. Gavrila, A Probabilistic Framework for Joint Pedestrian Head and Body Orientation Estimation IEEE Transactions on Intelligent Transportation Systems. ,vol. 16, pp. 1872- 1882 ,(2015) , 10.1109/TITS.2014.2379441
Felipe Patiño Vista, Deok-Jin Lee, Kil To Chong, Design of an EKF-CI based sensor fusion for robust heading estimation of marine vehicle International Journal of Precision Engineering and Manufacturing. ,vol. 16, pp. 403- 407 ,(2015) , 10.1007/S12541-015-0054-9
Catalin Ionescu, Dragos Papava, Vlad Olaru, Cristian Sminchisescu, Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 36, pp. 1325- 1339 ,(2014) , 10.1109/TPAMI.2013.248
David Yarowsky, Unsupervised word sense disambiguation rivaling supervised methods Proceedings of the 33rd annual meeting on Association for Computational Linguistics -. pp. 189- 196 ,(1995) , 10.3115/981658.981684
Shenghuo Zhu, Will Zou, Andrew Y. Ng, Kai Yu, Deep Learning of Invariant Features via Simulated Fixations in Video neural information processing systems. ,vol. 25, pp. 3203- 3211 ,(2012)