TY - GEN
T1 - Visual semantics for reducing false positives in video search
AU - Srihari, Rohini K.
AU - Novischi, Adrian
PY - 2008
Y1 - 2008
N2 - This research explores the interaction of textual and visual information in video indexing and searching. Much of the recent work has focused on machine learning techniques that learn from both text and image/video features, e.g. the text surrounding a photograph on a web page. This is useful in similarity search (i.e. searching by example), but has drawbacks when more semantic search is desired, e.g. find video clips of Obama meeting with ordinary citizens. By extracting key visual semantics from the audio/text accompanying video, we are able to enhance the precision and granularity of video search. Visual semantics relates to identifying and correlating linguistic triggers with visual properties of accompanying video/images. Significant progress has been made in text-based information extraction, which can be brought to bear for video search. In this paper, we focus on linguistic triggers related to a special class of events referred to as nominal events. We describe how proper detection and interpretation of such events can prevent false positives in video searches.
AB - This research explores the interaction of textual and visual information in video indexing and searching. Much of the recent work has focused on machine learning techniques that learn from both text and image/video features, e.g. the text surrounding a photograph on a web page. This is useful in similarity search (i.e. searching by example), but has drawbacks when more semantic search is desired, e.g. find video clips of Obama meeting with ordinary citizens. By extracting key visual semantics from the audio/text accompanying video, we are able to enhance the precision and granularity of video search. Visual semantics relates to identifying and correlating linguistic triggers with visual properties of accompanying video/images. Significant progress has been made in text-based information extraction, which can be brought to bear for video search. In this paper, we focus on linguistic triggers related to a special class of events referred to as nominal events. We describe how proper detection and interpretation of such events can prevent false positives in video searches.
UR - https://www.scopus.com/pages/publications/65649133597
M3 - Conference contribution
AN - SCOPUS:65649133597
SN - 9781577353973
T3 - AAAI Fall Symposium - Technical Report
SP - 31
EP - 35
BT - Multimedia Information Extraction - Papers from the AAAI Fall Symposium, Technical Report
PB - American Association for Artificial Intelligence
T2 - 2008 AAAI Fall Symposium
Y2 - 7 November 2008 through 9 November 2008
ER -