Polysemous Codes

European Conference on Computer Vision 2016 (ECCV)
October 10, 2016

Abstract

This paper considers the problem of approximate nearest neighbor search in the compressed domain. We introduce polysemous codes, which offer both the distance estimation quality of product quantization and the efficient comparison of binary codes with Hamming distance. Their design is inspired by algorithms formerly introduced in the 90's to construct channel-optimized vector quantizers. At search time, this dual interpretation accelerates the search. Most of the indexed vectors are filtered out with Hamming distance, letting only a fraction of the vectors to be ranked with an asymmetric distance estimator. The method is complementary with a coarse partitioning of the feature space such as the inverted multi-index. This is shown by our experiments performed on several benchmarks. In particular, our approach outperforms the state of the art by a large margin on the BIGANN dataset comprising one billion vectors. Last but not least, our approach allows the approximate computation of the kNN graph associated with the Yahoo Flickr Creative Commons 100M, described by CNN image descriptors, in less than 8 hours on a single machine.

Related Blog Posts

Learning to Segment
by Piotr DollarAug 25, 2016
fastText
by Armand Joulin, Edouard Grave, Piotr Bojanowski, Tomas MikolovAug 18, 2016
Facebook Researchers Focus on the Most Challenging Machine Learning Questions at ICML 2016
by Jason Weston, Leon Bottou, Joaquin Quinonero Candela, Hussein Mehanna, Pierre Andrews, Aditya Kalro, Alexander Sidorov, Ronan Collobert, Armand Joulin, Laurens van der Maaten, David Grangier, Tomas Mikolov, Antoine Bordes, Rob Fergus, Lars Backstrom, Ross GirshickJun 19, 2016
Facebook AI Research at ICLR 2016
by Ari EntinMay 2, 2016

Related Publications

Learning to Refine Object Segments
by Pedro O. Pinheiro, Tsung-Yi Lin, Ronan Collobert, Piotr DollarECCVOct 10, 2016
Polysemous Codes
by Matthijs Douze, Hervé Jégou, Florent PerronninEuropean Conference on Computer Vision 2016 (ECCV)Oct 10, 2016
A MultiPath Network for Object Detection
by Sergey Zagoruyko, Adam Lerer, Tsung-Yi Lin, Pedro O. Pinheiro, Sam Gross, Soumith Chintala, Piotr DollarBMVCSep 18, 2016
Joint Learning of Speaker and Phonetic Similarities with Siamese Networks
by Neil Zeghidour, Gabriel Synnaeve, Nicolas Usunier, Emmanuel DupouxInterspeech 2016Sep 8, 2016

Join Us

Do you want to help more than a billion people all over the world connect and share?

View Open Positions

Code

Learn about our open source tools and technologies, our challenging scaling experiences, and more.

Go to Facebook Code