Newsletters




Qdrant Launches Tiered Multitenancy for Vector Search


Qdrant, the open-source vector search engine used by enterprises and AI-native teams, announced Tiered Multitenancy—as part of the v1.16 release—a new capability that helps organizations isolate heavy-traffic tenants, improve performance, and scale vector search workloads more efficiently.

According to the company, Tiered Multitenancy stores all tenants in one shared collection and allows operators to promote any large or latency-sensitive tenant to a dedicated shard when needed.

Promotion happens without downtime, without reindexing, and without requiring any changes to client applications. Shared and dedicated paths operate within the same collection, keeping operations simple while ensuring predictable performance for high-demand tenants.

“Customers want strong tenant isolation without the operational burden of maintaining dozens of separate indexes,” said Andre Zayarni, CEO at Qdrant. “Tiered Multitenancy offers that balance. Teams can scale their largest tenants independently while keeping the rest of the system simple and unified.”

Qdrant combines payload-based filtering and custom sharding inside a single architecture. Tenants begin in a shared fallback shard. When a tenant grows or requires dedicated resources, operators can promote it through a single API call that uses a filtered streaming transfer mechanism. Throughout the transfer, Qdrant automatically routes, reads, and writes to the correct shard and maintains consistency guarantees so applications remain fully operational.

This approach removes the complex client-side routing logic common in multi-tenant systems, the company said.

Some competing platforms require a separate index for every tenant, while others cannot support cross-tenant search at all.

Qdrant is one of the first vector search engines to provide both tenant isolation and global search inside the same collection. This makes it possible to support hybrid workloads such as agents that access tenant-specific memory while also querying a global knowledge base.

Multi-tenant RAG platforms, coding agents, and enterprise copilots frequently host customers with very different performance profiles. Tiered Multitenancy gives teams the ability to isolate large customers, scale compute only where necessary, and maintain a global index for cross-tenant retrieval. It also reduces operational complexity by consolidating multi-index architectures into a single collection, the company said.

For more information about this news, visit https://qdrant.tech.


Sponsors