captum

$npx mdskill add mkurman/zorai/captum

Explain neural model decisions using integrated gradients and feature attribution.

  • Reveals hidden patterns in vision and text model predictions.
  • Depends on PyTorch models and the captum library.
  • Calculates attributions via gradient-based methods and occlusion.
  • Outputs numerical scores and visual heat maps for analysis.

SKILL.md

.github/skills/captumView on GitHub ↗
---
name: captum
description: "Captum (PyTorch) — model interpretability and feature attribution. Integrated Gradients, DeepLIFT, SmoothGrad, Occlusion, SHAP approximation, and Layer-wise Relevance Propagation. For vision and text models."
tags: [captum, explainability, feature-attribution, integrated-gradients, pytorch, interpretability, zorai]
---
## Overview

Captum (Comprehension in PyTorch) provides model interpretability for PyTorch models. Implements Integrated Gradients, Gradient SHAP, DeepLIFT, Occlusion, Feature Ablation, and Layer Conductance. Supports computer vision, NLP, and tabular models.

## Installation

```bash
uv pip install captum
```

## Integrated Gradients

```python
import torch
import torch.nn as nn
from captum.attr import IntegratedGradients

model = nn.Linear(10, 2)
input = torch.randn(1, 10)
baseline = torch.zeros(1, 10)

ig = IntegratedGradients(model)
attrs = ig.attribute(input, baseline, target=0)
print(f"Feature attributions: {attrs}")
```

## Occlusion

```python
from captum.attr import Occlusion

occ = Occlusion(model)
attrs = occ.attribute(input, target=0, sliding_window_shapes=(1,))  # 1D
print(attrs)
```

## Visualization

```python
from captum.attr import visualization as viz

_ = viz.visualize_image_attr(
    attrs.squeeze().numpy(),
    original_image=input.squeeze().numpy(),
    method="heat_map",
    sign="absolute_value",
    show_colorbar=True,
)
```

## References
- [Captum docs](https://captum.ai/docs/)
- [Captum GitHub](https://github.com/pytorch/captum)

More from mkurman/zorai

SkillDescription
account-management>
agile-scrum>
albumentationsFast image augmentation library (Albumentations). 70+ transforms for classification, segmentation, object detection, keypoints, and pose estimation. Optimized OpenCV-based pipeline with unified API across all CV tasks. Supports images, masks, bounding boxes, and keypoints simultaneously. Note: classic Albumentations (MIT) is no longer maintained; successor AlbumentationsX uses AGPL-3.0. For torchvision-native augmentations, use torchvision.transforms.v2.
aml-complianceAnti-Money Laundering (AML) and Know Your Customer (KYC) compliance workflow. Sanctions screening, PEP detection, transaction monitoring, suspicious activity reporting (SAR), and OFAC compliance.
anki-connectThis skill is for interacting with Anki through AnkiConnect, and should be used whenever a user asks to interact with Anki, including to read or modify decks, notes, cards, models, media, or sync operations.
approval-checkpoint-long-taskCanonical long-task pack for daemon-managed work with deliberate approval checkpoints, status summaries, rollback notes, and mobile-safe governance-aware updates.
auditing-goal-artifactsUse when reviewing recent zorai goal run outputs, closure markers, ledgers, or evidence bundles to judge whether completion is credible or to identify remaining uncertainty.
autogenAutoGen (Microsoft) — multi-agent conversation framework. Agent-to-agent chat, code generation & execution, tool use, group chat, and human-in-the-loop. Build collaborative AI systems with specialized agents.
backtraderPython backtesting framework for trading strategies. Data feeds, brokers, analyzers, and live trading support. Strategy development with commission models, slippage, and signal-based execution.
beautiful-mermaidRender Mermaid diagrams as SVG and PNG using the Beautiful Mermaid library. Use when the user asks to render a Mermaid diagram.