Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition

作者: Jia Deng , J. Krause , A. C. Berg , Li Fei-Fei

DOI: 10.1109/CVPR.2012.6248086

关键词:

摘要: As visual recognition scales up to ever larger numbers of categories, maintaining high accuracy is increasingly difficult. In this work, we study the problem optimizing accuracy-specificity trade-offs in large scale recognition, motivated by observation that object categories form a semantic hierarchy consisting many levels abstraction. A classifier can select appropriate level, trading off specificity for case uncertainty. By trade-off, obtain classifiers try be as specific possible while guaranteeing an arbitrarily accuracy. We formulate maximizing information gain ensuring fixed, small error rate with hierarchy. propose Dual Accuracy Reward Trade-off Search (DARTS) algorithm and prove that, under practical conditions, it converges optimal solution. Experiments demonstrate effectiveness our on datasets ranging from 65 over 10,000 categories.

参考文章(31)
Hamed Masnadi-Shirazi, Nuno Vasconcelos, Risk minimization, probability elicitation, and cost-sensitive SVMs international conference on machine learning. pp. 759- 766 ,(2010)
Florent Perronnin, Jorge Sánchez, Thomas Mensink, Improving the fisher kernel for large-scale image classification european conference on computer vision. ,vol. 6314, pp. 143- 156 ,(2010) , 10.1007/978-3-642-15561-1_11
Marcin Marszałek, Cordelia Schmid, Constructing Category Hierarchies for Visual Recognition european conference on computer vision. ,vol. 5305, pp. 479- 491 ,(2008) , 10.1007/978-3-540-88693-8_35
Rob Fergus, Hector Bernal, Yair Weiss, Antonio Torralba, None, Semantic label sharing for learning with many categories european conference on computer vision. pp. 762- 775 ,(2010) , 10.1007/978-3-642-15549-9_55
Jia Deng, Alexander C. Berg, Kai Li, Li Fei-Fei, What does classifying more than 10,000 image categories tell us? european conference on computer vision. pp. 71- 84 ,(2010) , 10.1007/978-3-642-15555-0_6
Ofer Dekel, Joseph Keshet, Yoram Singer, Large margin hierarchical classification Twenty-first international conference on Machine learning - ICML '04. pp. 27- ,(2004) , 10.1145/1015330.1015374
Jianxiong Xiao, James Hays, Krista A. Ehinger, Aude Oliva, Antonio Torralba, SUN database: Large-scale scene recognition from abbey to zoo computer vision and pattern recognition. pp. 3485- 3492 ,(2010) , 10.1109/CVPR.2010.5539970
Tianshi Gao, Daphne Koller, Discriminative learning of relaxed hierarchy for large-scale visual recognition international conference on computer vision. pp. 2072- 2079 ,(2011) , 10.1109/ICCV.2011.6126481
Jinjun Wang, Jianchao Yang, Kai Yu, Fengjun Lv, Thomas Huang, Yihong Gong, Locality-constrained Linear Coding for image classification computer vision and pattern recognition. pp. 3360- 3367 ,(2010) , 10.1109/CVPR.2010.5540018
Jorge Sanchez, Florent Perronnin, High-dimensional signature compression for large-scale image classification CVPR 2011. pp. 1665- 1672 ,(2011) , 10.1109/CVPR.2011.5995504