Convolutional Neural Networks for joint object detection and pose estimation: A comparative study

作者: Renaud Marlet , Mathieu Aubry , Francisco Massa

DOI:

关键词:

摘要: In this paper we study the application of convolutional neural networks for jointly detecting objects depicted in still images and estimating their 3D pose. We identify different feature representations oriented objects, energies that lead a network to learn representations. The choice representation is crucial since pose an object has natural, continuous structure while its category discrete variable. evaluate approaches on joint detection estimation task Pascal3D+ benchmark using Average Viewpoint Precision. show classification approach discretized viewpoints achieves state-of-the-art performance estimation, significantly outperforms existing baselines benchmark.

参考文章(5)
Ronan Collobert, Clément Farabet, Koray Kavukcuoglu, Torch7: A Matlab-like Environment for Machine Learning neural information processing systems. ,(2011)
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, Li Fei-Fei, ImageNet: A large-scale hierarchical image database computer vision and pattern recognition. pp. 248- 255 ,(2009) , 10.1109/CVPR.2009.5206848
Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition Proceedings of the IEEE. ,vol. 86, pp. 2278- 2324 ,(1998) , 10.1109/5.726791
Yann LeCun, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, Lawrence D Jackel, None, Backpropagation applied to handwritten zip code recognition Neural Computation. ,vol. 1, pp. 541- 551 ,(1989) , 10.1162/NECO.1989.1.4.541
Pierre Sermanet, Yann LeCun, David Eigen, Rob Fergus, Michael Mathieu, Xiang Zhang, OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks international conference on learning representations. ,(2014)