Transferring Dense Pose to Proximal Animal Classes.

作者: Andrea Vedaldi , Natalia Neverova , Vasil Khalidov , Maureen S. McCarthy , Artsiom Sanakoyeu

DOI:

关键词:

摘要: Recent contributions have demonstrated that it is possible to recognize the pose of humans densely and accurately given a large dataset poses annotated in detail. In principle, same approach could be extended any animal class, but effort required for collecting new annotations each case makes this strategy impractical, despite important applications natural conservation, science business. We show that, at least proximal classes such as chimpanzees, transfer knowledge existing dense recognition humans, well more general object detectors segmenters, problem other classes. do by (1) establishing DensePose model which also geometrically aligned (2) introducing multi-head R-CNN architecture facilitates multiple tasks between classes, (3) finding combination known can transferred most effectively (4) using self-calibrated uncertainty heads generate pseudo-labels graded quality training class. introduce two benchmark datasets labelled manner class chimpanzee use them evaluate our approach, showing excellent learning performance.

参考文章(45)
Geoffrey Hinton, Oriol Vinyals, Jeff Dean, Distilling the Knowledge in a Neural Network arXiv: Machine Learning. ,(2015)
Vincent Léon, Nicolas Bonneel, Guillaume Lavoué, Jean-Philippe Vandeborre, Continuous semantic description of 3D meshes Computers & Graphics. ,vol. 54, pp. 47- 56 ,(2016) , 10.1016/J.CAG.2015.07.018
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, C. Lawrence Zitnick, Microsoft COCO: Common Objects in Context Computer Vision – ECCV 2014. pp. 740- 755 ,(2014) , 10.1007/978-3-319-10602-1_48
Samuele Salti, Federico Tombari, Luigi Di Stefano, SHOT: Unique signatures of histograms for surface and texture description Computer Vision and Image Understanding. ,vol. 125, pp. 251- 264 ,(2014) , 10.1016/J.CVIU.2014.04.011
Mykhaylo Andriluka, Leonid Pishchulin, Peter Gehler, Bernt Schiele, 2D Human Pose Estimation: New Benchmark and State of the Art Analysis computer vision and pattern recognition. pp. 3686- 3693 ,(2014) , 10.1109/CVPR.2014.471
Sam Johnson, Mark Everingham, Learning effective human pose estimation from inaccurate annotation computer vision and pattern recognition. pp. 1465- 1472 ,(2011) , 10.1109/CVPR.2011.5995318
Sam Johnson, Mark Everingham, Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation british machine vision conference. pp. 1- 11 ,(2010) , 10.5244/C.24.12
Weiyu Zhang, Menglong Zhu, Konstantinos G. Derpanis, From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding international conference on computer vision. pp. 2248- 2255 ,(2013) , 10.1109/ICCV.2013.280
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90
Varun Ramakrishna, Takeo Kanade, Yaser Sheikh, Shih-En Wei, Convolutional Pose Machines arXiv: Computer Vision and Pattern Recognition. ,(2016)