datapowers
datapowers provides a robust framework for data mining and machine learning workflows, ensuring statistical integrity and rigorous validation.
datapowers is a professional data mining and machine learning workflow protocol for Claude Code, Gemini CLI, and autonomous agents. It bridges software engineering rigor (TDD) with data science's statistical sovereignty β enforcing leakage prevention, three-layer validation, and evidence-based delivery as non-negotiable pipeline gates.
π Why datapowers?
Statistical Integrity > Model Metrics. datapowers is not a prompt collection β it is an Audit Protocol.
- π‘οΈ Leakage Defense β Systematic temporal and preprocessing leakage audits at the source, before any evaluation.
- βοΈ Three-Layer Validation β Physical (Schema), Logical (Business Rules), and Statistical (Distribution Drift) gates block training on bad data.
- π« Point Estimate Illusions β Mandatory confidence intervals and significance tests on all reported results.
- π§ Hypothesis-Driven β Forces falsifiable goal-setting and a baseline expectation before touching raw data.
βοΈ How It Works: Skill Loading Flow
datapowers uses a hook-based on-demand loading architecture inspired by superpowers.
Session Start
β
βΌ
hooks/session-start fires
β
βββ Reads skills/using-datapowers/SKILL.md
β
βββ Injects it as session context (EXTREMELY_IMPORTANT)
β
βΌ
Agent now knows:
β’ All 20 available skills
β’ Trigger keyword for each skill
β’ When to invoke each skill
β
βΌ
Analyst asks a question
β
βΌ
Agent matches trigger β invokes Skill tool
β
βββ Loads skills/<name>/SKILL.md (Iron Laws, Hard Gates, Procedure)
β
βββ Loads supplementary files as needed:
β’ reference docs (e.g., assertion-anti-patterns.md)
β’ pressure tests (test-pressure-1.md, ...)
β’ agent prompt templates (analyst-prompt.md, ...)
β’ utility scripts (pipeline-pollution-detection.sh)
Key design principles:
using-datapowersis always loaded β it is the routing table for all other skills.- All other skills are loaded on demand β no unnecessary context inflation.
- Supplementary files are scoped β each skill only loads what it needs.
- Session state persists in
artifacts/analysis_manifest.jsonβ the agent can resume any session without losing context.
π Skills Reference
All 20 skills, organized by phase.
Phase 0 β Entry & State
| Skill | When to Use | Key Output |
|---|---|---|
using-datapowers | Always loaded at session start | Routing table: all skills + trigger keywords |
analysis-manifest | Session start; "Where are we?"; after brainstorming | artifacts/analysis_manifest.json β single source of truth for session state |
Phase 1 β Design
| Skill | When to Use | Key Output |
|---|---|---|
brainstorming | "Analyze X", "Design spec", "Hypothesis" | docs/datapowers/specs/ design doc with β₯3 falsifiable hypotheses |
writing-analysis-plans | After brainstorming approval | docs/datapowers/plans/ plan with 15β30 min tasks, no placeholders |
Phase 2 β Data Understanding
| Skill | When to Use | Key Output |
|---|---|---|
data-profiling | New dataset; before any subagent dispatch | artifacts/data_profile.md β high-density PII-free profile |
data-exploration | First-time dataset exploration; EDA requested | docs/datapowers/eda/ report; leakage candidate flags |
data-validation | Before any model training or feature engineering | Pandera validation report; BLOCK / PROCEED verdict |
Phase 3 β Feature Engineering & Modeling
| Skill | When to Use | Key Output |
|---|---|---|
leakage-guard | Temporal dataset; before feature engineering review | BLOCKED / NEEDS_HUMAN_REVIEW / APPROVED verdict |
feature-engineering | Building or transforming features | Fitted transformers saved to artifacts/; Feature Registry updated |
test-driven-data-science | Before any model.fit() call | Three-layer assertion results; training blocked on CRITICAL failures |
model-selection | Choosing between candidate models | Baseline comparison table; Optuna HPO results (β₯50 trials) |
model-evaluation | Final test set evaluation | Bootstrap CIs; SHAP summary; one-time test set gate |
Phase 4 β Execution & Review
| Skill | When to Use | Key Output |
|---|---|---|
executing-plans | "Start tasks", "Follow plan" | Two-stage review per task: Statistical β Code Quality |
subagent-driven-analysis | Multi-task parallel analysis | Parallel analyst subagents with isolated context; review gating |
requesting-statistical-review | "Audit results", "Significance", after task completion | Statistical review verdict: APPROVED / ISSUES FOUND / BLOCKED |
debugging-pipelines | Pipeline error; unexpected model behavior; performance drop | Root cause investigation log; PSI drift report |
Phase 5 β Delivery
| Skill | When to Use | Key Output |
|---|---|---|
verification-before-delivery | "Done", "Complete", before any delivery | Artifact integrity checklist; reproducibility confirmation |
report-writing | Final stakeholder report | Reproducibility-header report with CIs and significance tests |
finishing-an-analysis-branch | Analysis complete; ready to deliver | Commit / PR / archive options; manifest-gated delivery |
Meta
| Skill | When to Use | Key Output |
|---|---|---|
writing-data-skills | "New skill", "Add skill", "Contribute skill" | New SKILL.md passing Statistical Pressure Test (3 scenarios) |
π The Core Workflow
brainstorming β writing-analysis-plans
β
βΌ
data-profiling β data-exploration β data-validation
β
βΌ
leakage-guard β feature-engineering β test-driven-data-science
β
βΌ
model-selection β model-evaluation
β
βΌ
executing-plans (with requesting-statistical-review per task)
β
βΌ
verification-before-delivery β report-writing β finishing-an-analysis-branch
At any step: analysis-manifest tracks completed stages. debugging-pipelines handles any unexpected failures. subagent-driven-analysis can parallelize independent tasks.
π οΈ Installation
Claude Code
# Install the plugin
/plugin install https://github.com/zpower426/datapowers
# Or clone and install locally
git clone https://github.com/zpower426/datapowers
cd datapowers
/plugin install .
How it works: .claude-plugin/plugin.json registers the hooks directory. On every session start, hooks/session-start fires and injects using-datapowers skill content into the session context automatically.
Gemini CLI
gemini extensions install https://github.com/zpower426/datapowers
The GEMINI.md file and gemini-extension.json manifest handle skill path registration.
OpenCode
git clone https://github.com/zpower426/datapowers ~/.opencode/plugins/datapowers
The .opencode/plugins/datapowers.js plugin injects bootstrap context via system prompt transform and auto-registers the skills directory.
Cursor / Codex
See .cursor-plugin/plugin.json and .codex/INSTALL.md for setup instructions.
βοΈ Iron Laws
| Domain | Iron Law |
|---|---|
| EDA | NO MODELING WITHOUT EXPLORATORY DATA ANALYSIS |
| Validation | NO TRAINING WITHOUT DATA QUALITY VALIDATION |
| Leakage | NO TRANSFORMERS FIT ON FULL DATASET BEFORE TRAIN/TEST SPLIT |
| Evaluation | TEST SET EVALUATED EXACTLY ONCE β AT THE END |
| Delivery | NO DELIVERY WITHOUT CONFIDENCE INTERVALS AND SIGNIFICANCE TESTS |
| Review | STATISTICAL AUDIT ALWAYS PRECEDES CODE QUALITY REVIEW |
π Star History
π€ Contributing
Contributions welcome. Read the writing-data-skills skill first β every new skill must pass the Statistical Pressure Test (balanced data, 1:100 imbalance, n < 200 small sample) before merge.
β€οΈ Acknowledgments
datapowers is inspired by and built on the shoulders of superpowers by Jesse Vincent (@obra).
The hook-based on-demand skill loading architecture, Iron Laws pattern, and Hard Gates design are all adapted from the superpowers family β extended here for the specific demands of statistical rigor in data science.