Question 1

What is Chroma DB?

Accepted Answer

Chroma is an open-source, AI-native embedding database for storing, indexing, and retrieving high-dimensional vectors that LLM applications use to look up relevant context at query time. The project is released under Apache 2.0, free for commercial use, with more than 26,000 GitHub stars and over 11 million downloads per month as of 2025. The server-mode binary listens on TCP port 8000 by default.

The open-source Chroma server is the self-hosted path the company also sells as Chroma Cloud. On a VPS the operator runs it from PyPI as chroma run --path /chroma_db_path or from the chromadb/chroma Docker image, with the persistent directory on local SSD and the HNSW indices loaded into RAM at start.

Question 2

Is Chroma DB free?

Accepted Answer

The open-source single-node version is free under the Apache 2.0 license, with no limits on collection count, document count, or commercial use. Chroma Cloud is a separate paid managed service from the same company, with a free tier up to roughly one million embeddings, and usage-based pricing on storage at around two cents per gigabyte per month above that. Self-hosting on a VPS keeps both the data and the cost predictable for any team running a private retrieval pipeline.

Question 3

What is Chroma DB used for?

Accepted Answer

The dominant use case is retrieval-augmented generation, where a chatbot grounds its answers in a private document corpus through nearest-neighbor lookup. Semantic search over legal documents, scientific papers, support tickets, or internal wikis follows the same pattern at a different scale. Other common uses include long-term memory for AI agents, small to mid-scale recommendation systems, and code search over an embedded corpus of source repositories. Each pattern stores embeddings in collections and queries them by similarity to a query vector.

Question 4

How does Chroma DB work?

Accepted Answer

Documents are converted into numerical embeddings by an embedding model, stored in a Chroma collection alongside metadata, and indexed in a Hierarchical Navigable Small World graph. At query time the question is embedded the same way and the database returns the nearest vectors by L2, cosine, or inner-product distance. Each collection has its own embedding function and its own HNSW index, with metadata filters such as $eq, $gt, and $in applied at the same step as the vector search itself.

Question 5

Does Chroma DB run locally?

Accepted Answer

Chroma supports three client modes for local and remote use. EphemeralClient stores data in memory only for tests, PersistentClient writes to a local directory for development and small deployments, and HttpClient connects to a separately running Chroma server. The server mode is the common production path, started with chroma run --path /your/db/path or from the chromadb/chroma Docker image. The same Python or JavaScript code talks to all three modes, so an application can move from local development to a remote VPS without code changes.

Question 6

How much RAM does Chroma DB need?

Accepted Answer

The minimum recommended RAM is 2 GB on the host. For production sizing, the official formula is N (millions of vectors) equals R (system RAM in GB) times 0.245, with one-thousand-and-twenty-four-dimensional embeddings, three metadata records, and a small document per embedding. That works out to roughly four gigabytes of RAM per one million vectors at 1024 dimensions, with the HNSW index resident in memory at all times during operation. Smaller embeddings like the 384-dim default need proportionally less RAM headroom.

Question 7

Can Chroma DB run in Docker?

Accepted Answer

The official image is published as chromadb/chroma on Docker Hub for self-hosters. The common command is docker run -p 8000:8000 -v /path/on/host:/chroma/chroma chromadb/chroma, which exposes port 8000 and mounts a host directory as the persistent volume. On a VPS with root access, the operator can install Docker Engine, run the image as a long-lived service, and back up the volume by snapshotting the host directory. The same image is the one used by most self-hosted Chroma tutorials in 2025.

Question 8

What is the default embedding function in Chroma?

Accepted Answer

The default embedding function is Sentence Transformers all-MiniLM-L6-v2 at 384 dimensions, run locally through ONNX Runtime. No external API key is required to get started, which is one of the reasons people pick Chroma for prototyping over services that need cloud credentials on the first install. Alternative providers are wired in by name in the client config, including OpenAI, Cohere, Google PaLM and Gemini, Hugging Face, Jina, Voyage, and Ollama, plus custom embedding functions written against a simple Python interface.

Question 9

What distance metrics does Chroma support?

Accepted Answer

Chroma supports three HNSW distance metrics. L2 squared Euclidean is the default, with cosine and inner product as the alternatives. The metric is set when the collection is created and is fixed for that collection, since the HNSW index is built around it. L2 and inner product are sensitive to vector magnitude, so embeddings are commonly normalized before being added when those metrics are used. Cosine distance handles the magnitude issue internally and is the safe pick for general semantic search workloads.

Question 10

How does Chroma DB store data?

Accepted Answer

A chroma.sqlite3 file at the persistent directory holds system metadata for tenants, databases, collections, and segments. Each collection gets a UUID-named subdirectory containing its HNSW index files alongside the index metadata. Backups happen at the filesystem level, by snapshotting or copying the persistent directory with the server briefly stopped for a consistent point-in-time copy. The older DuckDB backend was removed in version 0.4.0 back in 2023 in favor of SQLite as the unified store for both local and client-server deployments.

Chroma VPS Docker Hosting

Why Run Chroma on GreenGeeks

Strong CPU Throughput for Queries

RAM Headroom for HNSW Indices

Fast SSD for SQLite and Index

24/7 Uptime for a Private RAG API

Self-Managed VPS Plans

VPS 4GB

VPS 8GB

VPS 16GB

VPS 32GB

What is Chroma?

What You Can Build with Chroma

The Key Features of Chroma

Frequently Asked Questions

Launch your Chroma DB on a VPS