Abstract
This paper surveys research in developing computational models for integrating linguistic and visual information. It begins with a discussion of systems which have been actually implemented and continues with computationally motivated theories of human cognition. Since existing research spans several disciplines (e.g., natural language understanding, computer vision, knowledge representation), as well as several application areas, an important contribution of this paper is to categorize existing research based on inputs and objectives. Finally, some key issues related to integrating information from two such diverse sources are outlined and related to existing research. Throughout, the key issue addressed is the correspondence problem, namely how to associate visual events with words and vice versa.
| Original language | English |
|---|---|
| Pages (from-to) | 349-369 |
| Number of pages | 21 |
| Journal | Artificial Intelligence Review |
| Volume | 8 |
| Issue number | 5-6 |
| DOIs | |
| State | Published - Sep 1994 |
Keywords
- computer vision
- diagram understanding
- multimedia
- natural language understanding
- spatial reasoning
Fingerprint
Dive into the research topics of 'Computational models for integrating linguistic and visual information: A survey'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver