Finding Pictures of Objects in Large Collections of Images

作者: David A. Forsyth , Jitendra Malik , Margaret M. Fleck , Hayit Greenspan , Thomas Leung

DOI: 10.1007/3-540-61750-7_36

关键词:

摘要: Retrieving images from very large collections, using image content as a key, is becoming an important problem. Users prefer to ask for pictures notions of that are strongly oriented the presence abstractly defined objects. Computer programs implement these queries automatically desirable, but hard build because conventional object recognition techniques computer vision cannot recognize general objects in contexts. This paper describes our approach recognition, which structured around sequence increasingly specialized grouping activities assemble coherent regions can be shown satisfy stringent constraints. The constraints satisfied provide form classification quite view distinguished by: far richer involvement early visual primitives, including color and texture; hierarchical learning strategies process; ability deal with rather uncontrolled configurations We illustrate properties four case-studies: one demonstrating use texture descriptors; showing how trees described by fusing geometric properties; scenery concepts grouped features; this yields program tell, accurately, whether picture contains naked people or not.

参考文章(51)
Barry Taylor, Tense and continuity Linguistics and Philosophy. ,vol. 1, pp. 199- 220 ,(1977) , 10.1007/BF00351103
Stan Sclaroff, World Wide Web Image Search Engines Boston University Computer Science Department. ,(1995)
Thomas Peter Minka, An image database browser that learns from user interaction Massachusetts Institute of Technology. ,(1996)
Carol Lee Tenny, Grammaticalizing aspect and affectedness Massachusetts Institute of Technology. ,(1987)
Jitendra Malik, Ruth Rosenholtz, Recovering surface curvature and orientation from texture distortion: a least squares algorithm and sensitivity analysis european conference on computer vision. pp. 353- 364 ,(1994) , 10.1007/3-540-57956-7_39
Margaret M. Fleck, David A. Forsyth, Chris Bregler, Finding Naked People european conference on computer vision. pp. 593- 602 ,(1996) , 10.1007/3-540-61123-1_173
Thomas Leung, Jitendra Malik, Detecting, localizing and grouping repeated scene elements from an image european conference on computer vision. pp. 546- 555 ,(1996) , 10.1007/BFB0015565
C.A. Rothwell, A. Zisserman, J.L. Mundy, D.A. Forsyth, Efficient model library access by projectively invariant indexing functions computer vision and pattern recognition. pp. 109- 114 ,(1992) , 10.1109/CVPR.1992.223219
Gilles Burel, Dominique Carel, Detection and localization of faces on digital images Pattern Recognition Letters. ,vol. 15, pp. 963- 967 ,(1994) , 10.1016/0167-8655(94)90027-2
Jonathan H. Connell, Michael Brady, Generating and generalizing models of visual objects Artificial Intelligence. ,vol. 31, pp. 159- 183 ,(1987) , 10.1016/0004-3702(87)90018-X