Product Quantization (PQ) is a lossy compression technique for high-dimensional vectors that dramatically reduces memory footprint by splitting each vector into subvectors, independently quantizing each subspace using learned codebooks, and representing the original vector as a short concatenated code. This process transforms a continuous vector space into a product of quantized subspaces, enabling the storage of billions of vectors in RAM by replacing full-precision floats with compact integer codes. The core trade-off is between compression ratio, reconstruction error, and search accuracy, making PQ fundamental for scalable vector database infrastructure and dense vector indices.
