Control structures for incorporating picture-specific context in image interpretation

作者: Rohini K. Srihari , Rajiv Chopra

DOI:

关键词: Interpretation (philosophy)Context (language use)Natural language understandingDomain (software engineering)Information retrievalControl (linguistics)Object (computer science)Computer scienceIdentification (information)Task (project management)Natural language processingArtificial intelligence

摘要: This paper describes an efficient control mechanism for incorporating picture-specific context in the task of image interpretation. Although other knowledge-based vision systems use general domain reducing computational burden interpretation, to our knowledge, this is first effort exploring collateral information. We assume that constraints on picture are generated from a natural language understanding module which processes descriptive text accompanying pictures. have developed unified framework exploiting these both object location and identification (labeling) stage. In particular, we describe technique constrained search context-based vision. Finally, demonstrate effectiveness approach PICTION, system uses captions label human faces newspaper photographs.

参考文章(15)
David Sher, Sargur N. Srihari, Venu Govindaraju, A computational model for face location based on cognitive principles national conference on artificial intelligence. pp. 350- 355 ,(1992)
Rohini K. Srihari, Debra T. Burhans, Visual semantics: extracting visual information from text accompanying pictures national conference on artificial intelligence. pp. 793- 798 ,(1994)
Peter Gilman Selfridge, Reasoning about success and failure in aerial image understanding The University of Rochester. ,(1982)
Rohini Srihari, Venu Govindaraju, Rajiv Chopra, Mahesh Venkataraman, Debra T. Burhans, Use of Collateral Text in Image Interpretation ,(1994)
Thomas David Garvey, Perceptual strategies for purposive vision ,(1975)
Thomas M. Strat, Natural Object Recognition ,(1992)
J. A. Feldman, D. H. Ballard, C. M. Brown, An approach to knowledge-directed image analysis international joint conference on artificial intelligence. pp. 664- 670 ,(1977)
Oscar Firschein, Martin A. Fischler, Readings in computer vision: issues, problems, principles, and paradigms Morgan Kaufmann Publishers Inc.. ,(1987)
Roger Mohr, Gérald Masini, Good old discrete relaxation european conference on artificial intelligence. pp. 651- 656 ,(1988)
H. Niemann, G.F. Sagerer, S. Schroder, F. Kummert, ERNEST: a semantic network system for pattern understanding IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 12, pp. 883- 905 ,(1990) , 10.1109/34.57683