systematic-probing

Name: systematic-probing
Author: yogsoth-ai/de-anthropocentric-research-engine

$npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/systematic-probing

Anthropic-style systematic probing: exhaustive coverage of all threat surfaces with structured attack generation and execution.

SKILL.md

.github/skills/systematic-probingView on GitHub ↗

---
name: systematic-probing
description: "Strategy: AI-safety systematic probing — enumerate all threat surfaces, generate attack vectors per surface, execute probes, and aggregate findings across the full attack space."
type: strategy
used-by: [red-teaming]
tactics: [structured-attack-campaign, assumption-cascade]
---

# Systematic Probing Strategy

Anthropic-style systematic probing: exhaustive coverage of all threat surfaces with structured attack generation and execution.

## Method

1. **threat-surface-mapping** enumerates all attackable surfaces of the artifact
2. **attack-vector-generation** produces specific attacks per surface
3. Vectors prioritized by expected severity and likelihood
4. **probe-execution** executes each attack, records success/failure/partial
5. Failed probes trigger deeper investigation via follow-up vectors
6. **attack-resilience-scoring** computes coverage and resilience metrics

## Budget Table

| Parameter | S | M | L |
|---|---|---|---|
| Attack vectors | 5 | 12 | 20 |
| Probing rounds | 3 | 6 | 10 |
| Personas | 2 | 4 | 6 |
| Assumption checks | 5 | 10 | 20 |

## Orchestration

```
threat-surface-mapping → [enumerate surfaces]
→ [for each surface]:
    attack-vector-generation (generate vectors)
    → [for each vector]:
        probe-execution (execute attack)
        → (if partial success: generate follow-up vectors)
→ finding-aggregation → attack-resilience-scoring
```

## Subagents

- threat-surface-mapping (surface enumeration)
- attack-vector-generation (vector design)
- probe-execution (attack execution)
- finding-aggregation (result synthesis)
- attack-resilience-scoring (metric computation)

More from yogsoth-ai/de-anthropocentric-research-engine

Skill	Description
abductive-hypothesis-generation	Strategy: 面对异常的最佳解释推理
ablation-brainstorm	Remove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
ablation-component-mapping	Map system architecture to ablatable units for ablation studies
ablation-design	Design ablation studies to isolate component contributions in ML systems
ablation-execution	Remove components one by one from a system, record the response/impact of each removal.
abp-vulnerability-classification	Classify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
abstraction-extraction	Extract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
abstraction-ladder	Perform bisociation at multiple abstraction levels
abstraction-laddering	Move between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
abstraction-to-design	Abstract biological principle to design principle. Bridge from biology to engineering.