Skip to main navigation Skip to search Skip to main content

Automatic identification of ROI in figure images toward improving hybrid (text and image) biomedical document retrieval

  • Daekeun You
  • , Sameer Antani
  • , Dina Demner-Fushman
  • , Md Mahmudur Rahman
  • , Venu Govindaraju
  • , George R. Thoma
  • SUNY Buffalo
  • National Institutes of Health

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

Biomedical images are often referenced for clinical decision support (CDS), educational purposes, and research. They appear in specialized databases or in biomedical publications and are not meaningfully retrievable using primarily textbased retrieval systems. The task of automatically finding the images in an article that are most useful for the purpose of determining relevance to a clinical situation is quite challenging. An approach is to automatically annotate images extracted from scientific publications with respect to their usefulness for CDS. As an important step toward achieving the goal, we proposed figure image analysis for localizing pointers (arrows, symbols) to extract regions of interest (ROI) that can then be used to obtain meaningful local image content. Content-based image retrieval (CBIR) techniques can then associate local image ROIs with identified biomedical concepts in figure captions for improved hybrid (text and image) retrieval of biomedical articles. In this work we present methods that make robust our previous Markov random field (MRF)-based approach for pointer recognition and ROI extraction. These include use of Active Shape Models (ASM) to overcome problems in recognizing distorted pointer shapes and a region segmentation method for ROI extraction. We measure the performance of our methods on two criteria: (i) effectiveness in recognizing pointers in images, and (ii) improved document retrieval through use of extracted ROIs. Evaluation on three test sets shows 87% accuracy in the first criterion. Further, the quality of document retrieval using local visual features and text is shown to be better than using visual features alone.

Original languageEnglish
Title of host publicationProceedings of SPIE-IS and T Electronic Imaging - Document Recognition and Retrieval XVIII
DOIs
StatePublished - 2011
EventDocument Recognition and Retrieval XVIII - San Francisco, CA, United States
Duration: Jan 26 2011Jan 27 2011

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume7874
ISSN (Print)0277-786X

Conference

ConferenceDocument Recognition and Retrieval XVIII
Country/TerritoryUnited States
CitySan Francisco, CA
Period01/26/1101/27/11

Keywords

  • Active Shape Model
  • biomedical article retrieval
  • Biomedical image analysis
  • content-based image retrieval
  • image overlay extraction
  • pointer symbol extraction

Fingerprint

Dive into the research topics of 'Automatic identification of ROI in figure images toward improving hybrid (text and image) biomedical document retrieval'. Together they form a unique fingerprint.

Cite this