Research Article: Query-Aware ANN Sharding for Vector Search
Abstract
We present a query-aware shard assigner that co-optimizes recall and tail latency for enterprise vector search.
Cite this article
Jones, O. & Torres, N. (2022). Research Article: Query-Aware ANN Sharding for Vector Search. Research Explorations in Global Knowledge & Technology (REGKT), 1 (10). Retrieved from https://regkt.com/article.php?id=686&slug=query-aware-ann-sharding-vector-search