Skip to main navigation Skip to search Skip to main content

Visual semantics for reducing false positives in video search

  • Janya Inc.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This research explores the interaction of textual and visual information in video indexing and searching. Much of the recent work has focused on machine learning techniques that learn from both text and image/video features, e.g. the text surrounding a photograph on a web page. This is useful in similarity search (i.e. searching by example), but has drawbacks when more semantic search is desired, e.g. find video clips of Obama meeting with ordinary citizens. By extracting key visual semantics from the audio/text accompanying video, we are able to enhance the precision and granularity of video search. Visual semantics relates to identifying and correlating linguistic triggers with visual properties of accompanying video/images. Significant progress has been made in text-based information extraction, which can be brought to bear for video search. In this paper, we focus on linguistic triggers related to a special class of events referred to as nominal events. We describe how proper detection and interpretation of such events can prevent false positives in video searches.

Original languageEnglish
Title of host publicationMultimedia Information Extraction - Papers from the AAAI Fall Symposium, Technical Report
PublisherAmerican Association for Artificial Intelligence
Pages31-35
Number of pages5
ISBN (Print)9781577353973
StatePublished - 2008
Event2008 AAAI Fall Symposium - Arlington, VA, United States
Duration: Nov 7 2008Nov 9 2008

Publication series

NameAAAI Fall Symposium - Technical Report
VolumeFS-08-05

Conference

Conference2008 AAAI Fall Symposium
Country/TerritoryUnited States
CityArlington, VA
Period11/7/0811/9/08

Fingerprint

Dive into the research topics of 'Visual semantics for reducing false positives in video search'. Together they form a unique fingerprint.

Cite this