Product Quantization for Nearest Neighbor Search

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2011 France Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 33, pages 117-128 (issn: 0162-8828,

Copyright policy )

Authors: Jégou, Hervé; Douze, Matthijs; Schmid, Cordelia;

doi: 10.1109/tpami.2010.57

pmid: 21088323

Product Quantization for Nearest Neighbor Search

- Summary
- Subjects
- Metrics

Abstract

This paper introduces a product quantization-based approach for approximate nearest neighbor search. The idea is to decompose the space into a Cartesian product of low-dimensional subspaces and to quantize each subspace separately. A vector is represented by a short code composed of its subspace quantization indices. The euclidean distance between two vectors can be efficiently estimated from their codes. An asymmetric version increases precision, as it computes the approximate distance between a vector and a code. Experimental results show that our approach searches for nearest neighbors efficiently, in particular in combination with an inverted file system. Results for SIFT and GIST image descriptors show excellent search accuracy, outperforming three state-of-the-art approaches. The scalability of our approach is validated on a data set of two billion vectors.

Country

France

Related Organizations

View all View all

Keywords

Models, Statistical, 000, [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Information Storage and Retrieval, 004, Pattern Recognition, Automated, Representations, data structures, [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Artificial Intelligence, Sorting and searching, Image Interpretation, Computer-Assisted, Image Processing, Computer-Assisted, Cluster Analysis, Computer vision, transforms, Algorithms

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2K
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.01%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.01%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 0.1%