Skip to main navigation Skip to search Skip to main content

Scalable forest hashing for fast similarity search

  • Nanyang Technological University

Research output: Contribution to journalConference articlepeer-review

11 Scopus citations

Abstract

Indexing images and videos using binary hash bits has shown promising results for fast similarity search. Existing datadriven hashing methods learn compact hash codes from the data, but usually with the cost of generating unbalanced hash buckets, thus affecting the search efficiency. We propose a novel data-driven hashing method called forest hashing, which utilizes multiple tree structures to perform data hashing. By leveraging the index structure of trees, we can significantly improve the hashing efficacy by generating balanced hash buckets. Moreover, forest hashing naturally supports scalable coding where more trees can improve the coding quality with a longer code. Last but not the least, our forest hashing can be easily extended for semantic search by integrating semi-supervised label information. Experiments on two benchmark datasets show favorable results compared with the state-of-the-art hashing methods.

Original languageEnglish
Article number6890219
JournalProceedings - IEEE International Conference on Multimedia and Expo
Volume2014-September
Issue numberSeptmber
DOIs
StatePublished - Sep 3 2014
Event2014 IEEE International Conference on Multimedia and Expo, ICME 2014 - Chengdu, China
Duration: Jul 14 2014Jul 18 2014

Keywords

  • Approximated Nearest Neighbor Search
  • Random Projection Trees
  • Scalable Hashing
  • Sematic Search

Fingerprint

Dive into the research topics of 'Scalable forest hashing for fast similarity search'. Together they form a unique fingerprint.

Cite this