scaling-design

Name: scaling-design
Author: yogsoth-ai/de-anthropocentric-research-engine

$npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/scaling-design

**Question**: How does performance scale with resources?

SKILL.md

.github/skills/scaling-designView on GitHub ↗

---
name: scaling-design
description: "Design scaling experiments to characterize performance-resource relationships"
version: 1.0.0
category: experiment-execution
type: strategy
used-by: experiment-design
sops:
  - factor-identification
  - level-specification
  - metric-specification
  - sample-size-estimation
  - design-matrix-construction
tactics:
  - statistical-method-selection
  - budget-constrained-design
---

# Strategy: Scaling Design

**Question**: How does performance scale with resources?

## Methodology

- **Neural Scaling Laws** (Kaplan 2020, Hoffmann 2022): Power-law relationships between compute/data/parameters and loss.
- **Compute-Optimal Scaling** (Chinchilla): Find optimal allocation between model size and data.
- **Data Scaling**: Characterize learning curves as function of dataset size.
- **Model Scaling**: Performance vs. parameter count at fixed data.
- **Inference Scaling**: Throughput/latency vs. batch size, sequence length, model size.

## Execution Flow

1. **factor-identification** → Identify scaling axes (data, compute, parameters, time)
2. **level-specification** → Define scale points (geometric progression, typically 4-8 points)
3. **metric-specification** → Define metrics at each scale (loss, downstream task, efficiency)
4. **design-matrix-construction** → Build scaling experiment grid
5. **sample-size-estimation** → Determine replicates needed for reliable curve fitting
6. **budget-constrained-design** (tactic) → Optimize which scale points to run given budget

## Budget Gate

| Scaling Type | Scale Points | Replicates | Min Runs | Typical Cost |
|-------------|-------------|------------|----------|--------------|
| Data scaling | 4-6 | 3 | 12-18 | Low (same model, subset data) |
| Model scaling | 4-8 | 2-3 | 8-24 | High (different model sizes) |
| Compute-optimal | 6-10 per iso-FLOP | 1-2 | 12-20 | Very high |
| Inference scaling | 5-10 | 5 | 25-50 | Low (inference only) |

More from yogsoth-ai/de-anthropocentric-research-engine

Skill	Description
abductive-hypothesis-generation	Strategy: 面对异常的最佳解释推理
ablation-brainstorm	Remove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
ablation-component-mapping	Map system architecture to ablatable units for ablation studies
ablation-design	Design ablation studies to isolate component contributions in ML systems
ablation-execution	Remove components one by one from a system, record the response/impact of each removal.
abp-vulnerability-classification	Classify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
abstraction-extraction	Extract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
abstraction-ladder	Perform bisociation at multiple abstraction levels
abstraction-laddering	Move between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
abstraction-to-design	Abstract biological principle to design principle. Bridge from biology to engineering.