Approximate Nearest Neighbor (ANN) search is a class of algorithms that efficiently finds vectors semantically similar to a query vector in a high-dimensional space, trading a small, controlled amount of accuracy for orders-of-magnitude improvements in speed and memory usage compared to exact search. It is the core computational engine behind semantic search in vector stores, enabling rapid retrieval from large-scale embedding databases that underpin Retrieval-Augmented Generation (RAG) and agentic memory systems.
