Question 1

Do I need a vector database for RAG?

Accepted Answer

Not always. For under 100,000 chunks, pgvector on a standard PostgreSQL instance handles similarity search efficiently. Dedicated vector databases like Pinecone and Qdrant pull ahead at millions of vectors or high query rates.

Question 2

What is HNSW indexing?

Accepted Answer

Hierarchical Navigable Small World (HNSW) is the algorithm most vector databases use for approximate nearest neighbor search. It trades a small amount of RAM overhead (20–40% extra storage for the graph index) for dramatically faster query times versus brute-force search.

Question 3

How many dimensions should my embeddings have?

Accepted Answer

More dimensions generally means better semantic accuracy but higher storage and compute cost. OpenAI's text-embedding-3-small (1536 dims) balances accuracy and cost well for most use cases. For budget-sensitive applications, 768-dim embeddings (Google, BERT) offer good quality at half the storage.

Question 4

Is pgvector production-ready?

Accepted Answer

Yes. pgvector is used in production by many companies for datasets up to 10–50 million vectors. It runs inside standard PostgreSQL, benefits from all Postgres tooling, and requires no additional infrastructure. Supabase and Neon both offer pgvector as a managed service.

Question 5

When should I migrate from pgvector to a dedicated vector DB?

Accepted Answer

Consider migrating when: query latency exceeds 200ms, you have over 10 million vectors, you need multi-tenancy at scale, or you require features like hybrid search or payload filtering that pgvector doesn't support natively.

Vector Database Sizing Calculator

Your Vector Store

All Database Options

Need help sizing your RAG infrastructure?

Vector Database Sizing — FAQ

Our Offices