Deep Dive

Vector Databases

The memory layer of AI - search by meaning, not keywords

Traditional databases search by exact match - 'find rows where city = Tokyo.' Vector databases search by meaning - 'find documents similar to this concept.' This semantic search is what powers RAG, recommendation engines, and AI memory.

The core idea: convert any data (text, images, audio) into high-dimensional vectors (embeddings), then find nearest neighbors in that space. Two sentences with similar meaning will have similar vectors, even if they share no words.

How It Works

Embedding Generation

An embedding model converts data into dense vectors (typically 768-3072 dimensions). Each dimension captures a facet of meaning. Similar items cluster together.

Indexing

Raw vector search is O(n) - too slow for millions of vectors. Index structures like HNSW (Hierarchical Navigable Small World) enable approximate nearest neighbor (ANN) search in milliseconds.

Similarity Search

Given a query vector, find the k most similar vectors using cosine similarity, dot product, or Euclidean distance. Trade accuracy for speed with approximation.

Hybrid Search

Combine vector similarity with keyword filtering, metadata matching, and re-ranking. Best results come from mixing semantic and lexical search.

Retrieval & Ranking

Retrieved results are re-ranked by a cross-encoder model for higher precision. This two-stage approach (retrieve broadly, rank precisely) balances speed and quality.

Key Components

Pinecone

Managed vector DB, serverless, enterprise-grade, simple API

pgvector

PostgreSQL extension - add vector search to your existing database

Weaviate

Open-source, hybrid search, built-in vectorization modules

Qdrant

Rust-based, high-performance, rich filtering, open-source

ChromaDB

Developer-friendly, in-process, great for prototyping RAG apps

Milvus

Cloud-native, GPU-accelerated, scales to billions of vectors

Who's Building With This

Perplexity

Vector search over web-scale indices for real-time AI search

Spotify

Embedding-based music recommendations - find songs you'll love

Visual search - find similar pins using image embeddings

Shopify

Semantic product search - find what customers mean, not just what they type

Key Takeaway

Vector databases are the bridge between AI models and real-world data. They enable search by meaning rather than keywords - turning every database into a knowledge base that AI can reason over.

References & Further Reading

← STORY OF INTELLIGENCE HOME

Deep Dive

Vector Databases

The memory layer of AI - search by meaning, not keywords

How It Works

Embedding Generation

An embedding model converts data into dense vectors (typically 768-3072 dimensions). Each dimension captures a facet of meaning. Similar items cluster together.

Indexing

Raw vector search is O(n) - too slow for millions of vectors. Index structures like HNSW (Hierarchical Navigable Small World) enable approximate nearest neighbor (ANN) search in milliseconds.

Similarity Search

Given a query vector, find the k most similar vectors using cosine similarity, dot product, or Euclidean distance. Trade accuracy for speed with approximation.

Hybrid Search

Combine vector similarity with keyword filtering, metadata matching, and re-ranking. Best results come from mixing semantic and lexical search.

Retrieval & Ranking

Retrieved results are re-ranked by a cross-encoder model for higher precision. This two-stage approach (retrieve broadly, rank precisely) balances speed and quality.

Key Components

Pinecone

Managed vector DB, serverless, enterprise-grade, simple API

pgvector

PostgreSQL extension - add vector search to your existing database

Weaviate

Open-source, hybrid search, built-in vectorization modules

Qdrant

Rust-based, high-performance, rich filtering, open-source

ChromaDB

Developer-friendly, in-process, great for prototyping RAG apps

Milvus

Cloud-native, GPU-accelerated, scales to billions of vectors

Who's Building With This

Perplexity

Vector search over web-scale indices for real-time AI search

Spotify

Embedding-based music recommendations - find songs you'll love

Visual search - find similar pins using image embeddings

Shopify

Semantic product search - find what customers mean, not just what they type

Key Takeaway

Vector databases are the bridge between AI models and real-world data. They enable search by meaning rather than keywords - turning every database into a knowledge base that AI can reason over.

Vector Databases

How It Works

Embedding Generation

Indexing

Similarity Search

Hybrid Search

Retrieval & Ranking

Key Components

Pinecone

pgvector

Weaviate

Qdrant

ChromaDB

Milvus

Who's Building With This

Perplexity

Spotify

Pinterest

Shopify

References & Further Reading

Vector Databases

How It Works

Embedding Generation

Indexing

Similarity Search

Hybrid Search

Retrieval & Ranking

Key Components

Pinecone

pgvector

Weaviate

Qdrant

ChromaDB

Milvus

Who's Building With This

Perplexity

Spotify

Pinterest

Shopify

References & Further Reading