chromadb

Name: chromadb
Author: mkurman/zorai

$npx mdskill add mkurman/zorai/chromadb

Store and retrieve document embeddings for RAG tasks.

Enables semantic search and metadata filtering on text data.
Depends on sentence-transformers for automatic embedding generation.
Selects results using vector similarity and optional filters.
Returns ranked document snippets directly to the agent.

SKILL.md

.github/skills/chromadbView on GitHub ↗

---
name: chromadb
description: "Chroma — AI-native embedding database. In-process, lightweight vector store with automatic embedding, metadata filtering, and full-text search. Simplest path from prototype to production RAG."
tags: [chromadb, vector-database, embeddings, rag, semantic-search, python, zorai]
---
## Overview

Chroma is an AI-native embedding database optimized for RAG workflows. Lightweight, in-process, with automatic embedding via sentence-transformers, metadata filtering, and semantic search — no separate server required. Fastest path from prototype to production.

## Installation

```bash
uv pip install chromadb
```

## Basic Usage

```python
import chromadb

client = chromadb.PersistentClient(path="./chroma_data")
collection = client.create_collection(name="documents")

# Add documents with metadata
collection.add(
    documents=["Paris is the capital of France.", "Berlin is the capital of Germany."],
    metadatas=[{"country": "France"}, {"country": "Germany"}],
    ids=["doc1", "doc2"],
)

# Query with filter
results = collection.query(
    query_texts=["What is the capital of France?"],
    n_results=3,
    where={"country": "France"},
)
print(results["documents"][0])
```

## References
- [Chroma docs](https://docs.trychroma.com/)
- [Chroma GitHub](https://github.com/chroma-core/chroma)

More from mkurman/zorai

Skill	Description
account-management	>
agile-scrum	>
albumentations	Fast image augmentation library (Albumentations). 70+ transforms for classification, segmentation, object detection, keypoints, and pose estimation. Optimized OpenCV-based pipeline with unified API across all CV tasks. Supports images, masks, bounding boxes, and keypoints simultaneously. Note: classic Albumentations (MIT) is no longer maintained; successor AlbumentationsX uses AGPL-3.0. For torchvision-native augmentations, use torchvision.transforms.v2.
aml-compliance	Anti-Money Laundering (AML) and Know Your Customer (KYC) compliance workflow. Sanctions screening, PEP detection, transaction monitoring, suspicious activity reporting (SAR), and OFAC compliance.
anki-connect	This skill is for interacting with Anki through AnkiConnect, and should be used whenever a user asks to interact with Anki, including to read or modify decks, notes, cards, models, media, or sync operations.
approval-checkpoint-long-task	Canonical long-task pack for daemon-managed work with deliberate approval checkpoints, status summaries, rollback notes, and mobile-safe governance-aware updates.
auditing-goal-artifacts	Use when reviewing recent zorai goal run outputs, closure markers, ledgers, or evidence bundles to judge whether completion is credible or to identify remaining uncertainty.
autogen	AutoGen (Microsoft) — multi-agent conversation framework. Agent-to-agent chat, code generation & execution, tool use, group chat, and human-in-the-loop. Build collaborative AI systems with specialized agents.
backtrader	Python backtesting framework for trading strategies. Data feeds, brokers, analyzers, and live trading support. Strategy development with commission models, slippage, and signal-based execution.
beautiful-mermaid	Render Mermaid diagrams as SVG and PNG using the Beautiful Mermaid library. Use when the user asks to render a Mermaid diagram.