IVFPQ + HNSW for Billion-scale Similarity Search

The best indexing approach for billion-sized vector datasets

Peggy Chang
Towards Data Science
17 min readAug 29, 2022

--

Article cover for “IVFPQ + HNSW for Billion-scale Similarity Search — The best indexing approach for billion-sized vector datasets”. Author: Peggy Chang
Photo by Paul Talbot on Unsplash

We learned about IVFPQ in the previous article, where the inverted file index (IVF) is combined with product quantization (PQ) to create an effective method for large-scale similarity search.

--

--