Vector sharding is a horizontal partitioning strategy that distributes a collection of high-dimensional vectors across multiple physical nodes or storage disks in a database cluster. It is a core technique for achieving scalability in vector databases, allowing the system to handle datasets that exceed the memory or storage capacity of a single machine. Sharding is performed using a shard key, which determines the assignment of each vector to a specific partition, often based on metadata attributes, vector IDs, or the results of a clustering algorithm. This distribution enables parallel query execution, as each shard can process a portion of a similarity search independently.




