Skip to main navigation Skip to search Skip to main content

Computational models for integrating linguistic and visual information: A survey

Research output: Contribution to journalReview articlepeer-review

35 Scopus citations

Abstract

This paper surveys research in developing computational models for integrating linguistic and visual information. It begins with a discussion of systems which have been actually implemented and continues with computationally motivated theories of human cognition. Since existing research spans several disciplines (e.g., natural language understanding, computer vision, knowledge representation), as well as several application areas, an important contribution of this paper is to categorize existing research based on inputs and objectives. Finally, some key issues related to integrating information from two such diverse sources are outlined and related to existing research. Throughout, the key issue addressed is the correspondence problem, namely how to associate visual events with words and vice versa.

Original languageEnglish
Pages (from-to)349-369
Number of pages21
JournalArtificial Intelligence Review
Volume8
Issue number5-6
DOIs
StatePublished - Sep 1994

Keywords

  • computer vision
  • diagram understanding
  • multimedia
  • natural language understanding
  • spatial reasoning

Fingerprint

Dive into the research topics of 'Computational models for integrating linguistic and visual information: A survey'. Together they form a unique fingerprint.

Cite this