weaviate

$npx mdskill add mkurman/zorai/weaviate

Execute hybrid search and multi-modal retrieval from vector databases.

  • Retrieves documents using vector similarity and keyword matching.
  • Integrates with OpenAI, Cohere, HuggingFace, and CLIP vectorizers.
  • Decides results by combining semantic vectors with BM25 keyword scores.
  • Delivers structured object data containing properties and metadata.

SKILL.md

.github/skills/weaviateView on GitHub ↗
---
name: weaviate
description: "Weaviate — open-source vector database with built-in ML. Hybrid search (vector + keyword), generative search, graph connections, multi-modal (text + image), and automatic schema inference."
tags: [vector-database, hybrid-search, rag-retrieval, embedding-indexes, weaviate]
---
## Overview

Weaviate is an open-source vector database with built-in vectorization modules (OpenAI, Cohere, HuggingFace, Transformers, CLIP, multi-modal). Supports hybrid search (vector + BM25 keyword), generative search (RAG with LLM integration), and multi-modal data.

## Installation

```bash
docker run -p 8080:8080 semitechnologies/weaviate:latest
```

## Python Client

```python
import weaviate
import weaviate.classes as wvc

client = weaviate.connect_to_local()
collection = client.collections.create(
    name="Documents",
    vectorizer_config=wvc.config.Configure.Vectorizer.text2vec_transformers(),
)
collection.data.insert({
    "title": "Paris",
    "content": "Paris is the capital of France. It is known for the Eiffel Tower.",
})

# Hybrid search (vector + keyword)
response = collection.query.hybrid(query="French capital", limit=5)
for obj in response.objects:
    print(obj.properties)
```

## References
- [Weaviate docs](https://weaviate.io/developers/weaviate)
- [Weaviate GitHub](https://github.com/weaviate/weaviate)

More from mkurman/zorai

SkillDescription
account-management>
agile-scrum>
albumentationsFast image augmentation library (Albumentations). 70+ transforms for classification, segmentation, object detection, keypoints, and pose estimation. Optimized OpenCV-based pipeline with unified API across all CV tasks. Supports images, masks, bounding boxes, and keypoints simultaneously. Note: classic Albumentations (MIT) is no longer maintained; successor AlbumentationsX uses AGPL-3.0. For torchvision-native augmentations, use torchvision.transforms.v2.
aml-complianceAnti-Money Laundering (AML) and Know Your Customer (KYC) compliance workflow. Sanctions screening, PEP detection, transaction monitoring, suspicious activity reporting (SAR), and OFAC compliance.
anki-connectThis skill is for interacting with Anki through AnkiConnect, and should be used whenever a user asks to interact with Anki, including to read or modify decks, notes, cards, models, media, or sync operations.
approval-checkpoint-long-taskCanonical long-task pack for daemon-managed work with deliberate approval checkpoints, status summaries, rollback notes, and mobile-safe governance-aware updates.
auditing-goal-artifactsUse when reviewing recent zorai goal run outputs, closure markers, ledgers, or evidence bundles to judge whether completion is credible or to identify remaining uncertainty.
autogenAutoGen (Microsoft) — multi-agent conversation framework. Agent-to-agent chat, code generation & execution, tool use, group chat, and human-in-the-loop. Build collaborative AI systems with specialized agents.
backtraderPython backtesting framework for trading strategies. Data feeds, brokers, analyzers, and live trading support. Strategy development with commission models, slippage, and signal-based execution.
beautiful-mermaidRender Mermaid diagrams as SVG and PNG using the Beautiful Mermaid library. Use when the user asks to render a Mermaid diagram.