bentoml
$
npx mdskill add mkurman/zorai/bentomlDeploy ML models to production with OpenAPI and Kubernetes.
- Packages any framework into portable Bento units for instant serving.
- Integrates with Docker, Kubernetes, AWS, GCP, and Azure platforms.
- Generates OpenAPI/Swagger specs automatically from service definitions.
- Delivers JSON-formatted predictions via RESTful API endpoints.
SKILL.md
.github/skills/bentomlView on GitHub ↗
---
name: bentoml
description: "BentoML — model serving and deployment. Build prediction services from any ML framework with OpenAPI/Swagger. Containerize, deploy to Kubernetes, AWS, GCP, Azure. Adaptive batching and GPU support."
tags: [bentoml, model-serving, deployment, mlops, api, kubernetes, zorai]
---
## Overview
BentoML packages ML models with service definitions, dependencies, environment config, and deployment targets into a portable "Bento." Deploy to Kubernetes (Kserve, Seldon), AWS SageMaker, GCP Vertex AI, or as a standalone Docker container.
## Installation
```bash
uv pip install bentoml
```
## Service Definition
```python
import bentoml
from bentoml.io import JSON
import numpy as np
iris_clf = bentoml.sklearn.get("iris_model:latest")
@bentoml.service
class IrisClassifier:
def __init__(self):
self.model = iris_clf.to_runner()
self.model.init_local()
@bentoml.api(input=JSON(), output=JSON())
def classify(self, input_data):
result = self.model.run(np.array([input_data["features"]]))
return {"class": int(result[0]), "probabilities": result[1].tolist()}
```
## Build & Deploy
```bash
bentoml build # creates a Bento
bentoml containerize iris_classifier:latest # Docker image
docker run -p 3000:3000 iris_classifier:latest
```
## References
- [BentoML docs](https://docs.bentoml.com/)
- [BentoML GitHub](https://github.com/bentoml/BentoML)More from mkurman/zorai
- account-management>
- agile-scrum>
- albumentationsFast image augmentation library (Albumentations). 70+ transforms for classification, segmentation, object detection, keypoints, and pose estimation. Optimized OpenCV-based pipeline with unified API across all CV tasks. Supports images, masks, bounding boxes, and keypoints simultaneously. Note: classic Albumentations (MIT) is no longer maintained; successor AlbumentationsX uses AGPL-3.0. For torchvision-native augmentations, use torchvision.transforms.v2.
- aml-complianceAnti-Money Laundering (AML) and Know Your Customer (KYC) compliance workflow. Sanctions screening, PEP detection, transaction monitoring, suspicious activity reporting (SAR), and OFAC compliance.
- anki-connectThis skill is for interacting with Anki through AnkiConnect, and should be used whenever a user asks to interact with Anki, including to read or modify decks, notes, cards, models, media, or sync operations.
- approval-checkpoint-long-taskCanonical long-task pack for daemon-managed work with deliberate approval checkpoints, status summaries, rollback notes, and mobile-safe governance-aware updates.
- auditing-goal-artifactsUse when reviewing recent zorai goal run outputs, closure markers, ledgers, or evidence bundles to judge whether completion is credible or to identify remaining uncertainty.
- autogenAutoGen (Microsoft) — multi-agent conversation framework. Agent-to-agent chat, code generation & execution, tool use, group chat, and human-in-the-loop. Build collaborative AI systems with specialized agents.
- backtraderPython backtesting framework for trading strategies. Data feeds, brokers, analyzers, and live trading support. Strategy development with commission models, slippage, and signal-based execution.
- beautiful-mermaidRender Mermaid diagrams as SVG and PNG using the Beautiful Mermaid library. Use when the user asks to render a Mermaid diagram.