TY - GEN
T1 - Use of multimedia input in automated image annotation and content-based retrieval
AU - Srihari, Rohini K.
PY - 1995
Y1 - 1995
N2 - This research explores the interaction of linguistic and photographic information in an integrated text/image database. By utilizing linguistic descriptions of a picture (speech and text input) coordinated with pointing references to the picture, we extract information useful in two aspects: image interpretation and image retrieval. In the image interpretation phase, objects and regions mentioned in the text are identified; the annotated image is stored in a database for future use. We incorporate techniques from our previous research on photo understanding using accompanying text: a system, PICTION, which identifies human faces in a newspaper photograph based on the caption. In the image retrieval phase, images matching natural language queries are presented to a user in a ranked order. This phase combines the output of (1) the image interpretation/annotation phase, (2) statistical text retrieval methods, and (3) image retrieval methods (e.g., color indexing). The system allows both point and click querying on a given image as well as intelligent querying across the entire text/image database.
AB - This research explores the interaction of linguistic and photographic information in an integrated text/image database. By utilizing linguistic descriptions of a picture (speech and text input) coordinated with pointing references to the picture, we extract information useful in two aspects: image interpretation and image retrieval. In the image interpretation phase, objects and regions mentioned in the text are identified; the annotated image is stored in a database for future use. We incorporate techniques from our previous research on photo understanding using accompanying text: a system, PICTION, which identifies human faces in a newspaper photograph based on the caption. In the image retrieval phase, images matching natural language queries are presented to a user in a ranked order. This phase combines the output of (1) the image interpretation/annotation phase, (2) statistical text retrieval methods, and (3) image retrieval methods (e.g., color indexing). The system allows both point and click querying on a given image as well as intelligent querying across the entire text/image database.
UR - https://www.scopus.com/pages/publications/0029457325
M3 - Conference contribution
AN - SCOPUS:0029457325
SN - 081941767X
SN - 9780819417671
T3 - Proceedings of SPIE - The International Society for Optical Engineering
SP - 249
EP - 260
BT - Proceedings of SPIE - The International Society for Optical Engineering
A2 - Niblack, Wayne
A2 - Jain, Ramesh C.
T2 - Storage and Retrieval for Image and Video Databases III
Y2 - 9 February 1995 through 10 February 1995
ER -