A consistency level is a configurable setting in a distributed vector database that determines how many replica nodes must acknowledge a read or write operation before it is considered successful, directly controlling the trade-off between data accuracy and system latency.
In a replicated system, data is copied across multiple nodes for fault tolerance. When a client writes a new vector, the database must decide when to confirm that write to the application. A strong consistency level like ALL requires acknowledgment from every replica before returning success, guaranteeing that any subsequent read will see the latest data but incurring high latency. A weaker level like ONE returns success after just one replica acknowledges, offering low latency but risking that a read from a different replica might return stale data. This configuration is a core part of the CAP theorem trade-off, allowing engineers to tune the system based on whether consistency, availability, or partition tolerance is the priority for a given workload.