Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Grassmann Hashing For Approximate Nearest Neighbor Search In High Dimensional Space
 
conference paper

Grassmann Hashing For Approximate Nearest Neighbor Search In High Dimensional Space

Wang, Xinchao  
•
Li, Zhu
•
Zhang, Lei
Show more
2011
2011 Ieee International Conference On Multimedia And Expo (Icme)
IEEE International Conference on Multimedia and Expo (ICME)

Locality-Sensitive Hashing (LSH) approximates nearest neighbors in high dimensions by projecting original data into low-dimensional subspaces. The basic idea is to hash data samples to ensure that the probability of collision is much higher for samples that are close to each other than for those that are far apart. However, by applying k random hashing functions on original data, LSH fails to find the most discriminant hashing-subspaces, so the nearest neighbor approximation is inefficient. To alleviate this problem, we propose the Grassmann Hashing (GRASH) for approximate nearest neighbor search in high dimensions. GRASH frrst introduces a set of subspace candidates from Linear Discriminant Analysis (LDA). Then it applies Grassmann metric to select the optimal subspaces for hashing. Finally, it generates hashing codes based on non-uniform bucket size design motivated by Lloyd-Max quantization. The proposed GRASH model enjoys a number of merits: 1) GRASH introduces the Grassmann metric to measure the similarity between different hashing subspaces, so the hashing function can better capture the data diversity; 2) GRASH obtains the subspace candidates from LDA, so it incorporates the discriminant information into the hashing functions; 3) GRASH extends LSH's 1-d hashing subspaces to m-d, i.e. it is a multidimensional extension of hashing approximation; 4) motivated by Lloyd-Max quantization, GRASH applies non-uniform size bucket to generate hashing codes, so the distortion can be minimized. Experimental results on a number of datasets confirm the validity of our proposed model.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICME.2011.6012027
Web of Science ID

WOS:000304354700029

Author(s)
Wang, Xinchao  
Li, Zhu
Zhang, Lei
Yuan, Junsong
Date Issued

2011

Publisher

Ieee

Publisher place

New York

Published in
2011 Ieee International Conference On Multimedia And Expo (Icme)
ISBN of the book

978-1-61284-349-0

Total of pages

6

Subjects

hashing

•

subspace learning

•

grassmann manifold

•

optimization

Editorial or Peer reviewed

REVIEWED

Written at

OTHER

EPFL units
ISIM  
Event name
IEEE International Conference on Multimedia and Expo (ICME)
Available on Infoscience
February 27, 2013
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/89833
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés