Rune

Multi-agent engineering orchestration for Claude Code, enhancing code quality and compliance.

Rune

Multi-agent engineering orchestration for Claude Code.

Built on Discipline Engineering — a proof-based architecture that treats specification compliance, not just code quality, as the primary metric.

Plan, implement, review, test, and audit your codebase using coordinated Agent Teams — each teammate with its own dedicated context window.

Claude Code Version License: MIT Agents Skills


What Is This?

Rune is a Claude Code plugin that turns a single-agent coding session into a coordinated multi-agent engineering team. It provides 152 specialized AI agents, 69 skills, and a 45-phase end-to-end pipeline that handles planning, implementation, code review, testing, and PR creation — all orchestrated through Claude Code's Agent Teams.

Compatibility: Requires Claude Code 2.1.81+ with Agent Teams support. macOS 12+ or Linux. See full requirements.


Why This Exists

Claude Code is powerful on its own — but a single agent has a single context window. As tasks grow in scope (reviewing a 50-file diff, planning a feature across multiple services, running a full implementation pipeline), one context window becomes the bottleneck:

  • Context saturation — A single agent reviewing 40 files loses focus on file 35. Rune gives each reviewer its own full context window, so the last file gets the same attention as the first.
  • Specialization over generalization — One agent trying to catch security issues, performance bugs, and naming inconsistencies simultaneously does none of them well. Rune dispatches Ward Sentinel for security, Ember Oracle for performance, and Pattern Seer for consistency — each focused on what it does best.
  • Parallelism — Sequential work on 6 implementation tasks takes 6x as long. Swarm workers claim and complete tasks independently, bounded only by file-level conflicts.
  • Separation of concerns — Planning, implementing, reviewing, and testing in one context creates confirmation bias (the same agent reviews code it just wrote). Rune enforces phase boundaries: different agents plan, build, and critique.

These problems are symptoms of a deeper issue: AI agents optimize for completion signals, not specification compliance. Rune's architecture is designed around the Discipline Engineering framework — five layers of decomposition, comprehension verification, proof-based completion, and anti-rationalization gates that ensure agents deliver what was specified, not just what compiles.

The trade-off is token cost — multi-agent workflows consume more tokens than a single session. Rune is designed for cases where quality, thoroughness, and coverage matter more than minimizing API usage.


<a name="token-warning"></a>

[!WARNING] Rune is token-intensive and time-intensive.

Each workflow spawns multiple agents, each with its own dedicated context window. This means higher token consumption and longer runtimes than single-agent usage.

WorkflowTypical DurationWhy
/rune:devise10–30 minUp to 7 agents across 7 phases (brainstorm, research, synthesize, forge, review)
/rune:appraise5–20 minUp to 8 review agents analyzing your diff in parallel — scales with LOC changed
/rune:audit10–30 minFull codebase scan — same agents, broader scope
/rune:strive10–30 minSwarm workers implementing tasks in parallel
/rune:arc1–2 hoursFull 45-phase pipeline (forge → plan review → work → gap analysis → code review → mend → test → ship → merge)
/rune:arc (complex)up to 3 hoursLarge plans with multiple review-mend convergence loops

/rune:arc is intentionally slow because it runs the entire software development lifecycle autonomously — planning enrichment, parallel implementation, multi-agent code review, automated fixes, 3-tier testing, and PR creation. Each phase spawns and tears down a separate agent team. The result is higher quality, but it takes time.

Want faster iterations? Run the steps individually instead of the full pipeline:

/rune:plan   →  /rune:work   →  /rune:review
 (10–30 min)    (10–30 min)     (5–20 min)

This gives you the same core workflow (plan → implement → review) in 25–80 minutes with manual control between steps — versus 1–3 hours for /rune:arc which adds forge enrichment, gap analysis, automated mend loops, 3-tier testing, and PR creation on top.

Claude Max ($200/month) or higher recommended. Use --dry-run where available to preview scope before committing.


Install

/plugin marketplace add https://github.com/vinhnxv/rune
/plugin install rune

Restart Claude Code after installation.

<details> <summary>Local development</summary>
claude --plugin-dir /path/to/rune
</details>

Setup

Rune requires Agent Teams. Enable it in .claude/settings.json or .claude/settings.local.json:

{
  "env": {
    "CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS": "1"
  },
  "includedGitignorePatterns": [
    "plans/",
    "todos/",
    "tmp/",
    "reviews/",
    ".rune/",
    ".claude/CLAUDE.local.md"
  ]
}

includedGitignorePatterns lets Claude Code read Rune's output directories that are typically gitignored.

Quick Configuration (Optional)

Generate a talisman.yml tailored to your project's tech stack:

/rune:talisman init      # Auto-detect stack and generate .rune/talisman.yml
/rune:talisman audit     # Check existing config for missing/outdated sections
/rune:talisman status    # Overview of current configuration health

See the Talisman deep dive for full configuration coverage.


How It Works

Rune orchestrates multi-agent workflows where specialized AI teammates collaborate through shared task lists and file-based communication. Instead of one agent doing everything in a single context window, Rune splits work across purpose-built agents — each with its own full context window.

You ──► /rune:devise ──► Plan
                           │
         /rune:arc ◄───────┘
             │
             ├─ Forge & Validate     enrich plan, review architecture, refine
             ├─ Work                 swarm workers implement in parallel
             ├─ Gap Analysis         detect and remediate implementation gaps
             ├─ Review & Mend        multi-agent code review + auto-fix findings
             ├─ Test                 3-tier testing (unit → integration → E2E)
             ├─ Ship                 validate and create PR
             └─ Merge               rebase and merge

Workflows

Quick Start (New Users)

CommandWhat it doesAlias for
/rune:planPlan a feature or task/rune:devise
/rune:workImplement a plan with AI workers/rune:strive
/rune:reviewReview your code changes/rune:appraise
/rune:brainstormExplore ideas through dialogue/rune:brainstorm
/rune:quickQuick 3-phase pipeline (plan -> work -> review)/rune:arc-quick

/rune:tarnished — The Unified Entry Point

Don't remember which command to use? /rune:tarnished is the intelligent master command that routes natural language to the correct Rune workflow. It understands both English and Vietnamese.

# Route by keyword — passes through to the right skill
/rune:tarnished plan add user authentication
/rune:tarnished work plans/my-plan.md
/rune:tarnished review
/rune:tarnished arc plans/my-plan.md
/rune:tarnished arc-batch plans/*.md
/rune:tarnished arc-issues --label "rune:ready"

# Chain workflows — multi-step with confirmation between steps
/rune:tarnished review and fix
/rune:tarnished plan then work

# Natural language — classifies intent automatically
/rune:tarnished implement the latest plan
/rune:tarnished fix the findings from the last review

# Guidance — ask Rune anything
/rune:tarnished help
/rune:tarnished what should I do next?
/rune:tarnished khi nào nên dùng audit vs review?

When run with no arguments, /rune:tarnished scans your project state (plans, reviews, git changes) and suggests the most logical next action.

Core Commands

CommandWhat it doesAgentsDuration
/rune:deviseTurn ideas into structured plans with parallel researchup to 710–30 min
/rune:striveExecute plans with self-organizing swarm workers2-610–30 min
/rune:appraiseMulti-agent code review on your diffup to 85–20 min
/rune:auditFull codebase audit with specialized reviewersup to 810–30 min
/rune:arc-quickQuick pipeline: plan → work → review → mend (4 phases)varies25–60 min
/rune:arcEnd-to-end pipeline: plan → work → review → test → shipvaries1–3 hours
/rune:mendParallel resolution of review findings1-53–10 min
/rune:forgeDeepen a plan with topic-aware research enrichment3-125–15 min
/rune:goldmaskImpact analysis — what breaks if you change this?85–10 min
/rune:inspectPlan-vs-implementation gap audit (10 dimensions)45–10 min
/rune:elicitStructured reasoning (Tree of Thoughts, Pre-mortem, 5 Whys)02–5 min

Batch & Automation

CommandWhat it does
/rune:arc-batchRun /rune:arc across multiple plans sequentially
/rune:arc-issuesFetch GitHub issues by label, generate plans, run arc for each
/rune:arc-hierarchyExecute hierarchical parent/child plan decompositions

Utilities

CommandWhat it does
/rune:restClean up tmp/ artifacts from completed workflows
/rune:echoesManage persistent agent memory (show, prune, reset)
/rune:learnExtract CLI corrections and review recurrences from session history into Echoes
/rune:file-todosStructured file-based todo tracking with YAML frontmatter
/rune:cancel-arcGracefully stop a running arc pipeline
/rune:cancel-reviewStop an active code review
/rune:cancel-auditStop an active audit
/rune:cancel-arc-batchStop an active arc-batch loop
/rune:cancel-arc-hierarchyStop an active arc-hierarchy loop
/rune:cancel-arc-issuesStop an active arc-issues loop
/rune:cancel-codex-reviewStop an active codex review
/rune:team-delegateTask delegation dashboard
/rune:plan-reviewReview plan code samples for correctness
/rune:pr-guardianAutomated PR shepherd — lint, CI, rebase, migrations, browser test, auto-merge (cron every 5 min)

Workflow Details

<a name="devise"></a> /rune:devise — Planning

Transforms a feature idea into a structured plan through a multi-phase pipeline:

  1. Brainstorm — structured exploration with elicitation methods
  2. Research — parallel agents scan your repo, git history, echoes, and external docs
  3. Solution Arena — competing approaches evaluated on weighted dimensions
  4. Synthesize — consolidate findings into a plan document
  5. Predictive Goldmask — risk scoring for files the plan will touch
  6. Forge — topic-aware enrichment by specialist agents
  7. Review — automated verification + optional technical review
/rune:devise                  # Full pipeline
/rune:devise --quick          # Skip brainstorm + forge (faster)

Output: plans/YYYY-MM-DD-{type}-{name}-plan.md

<a name="arc"></a> /rune:arc — End-to-End Pipeline

The full pipeline from plan to merged PR, with 45 phases:

Forge → Plan Review → Refinement → Verification → Semantic Verification
  → Design Extraction → Task Decomposition → Work → Storybook Verification
  → Design Verification → UX Verification → Gap Analysis → Codex Gap Analysis
  → Gap Remediation → Goldmask Verification → Code Review (--deep)
  → Goldmask Correlation → Mend → Verify Mend → Design Iteration
  → Test → Test Coverage Critique → Pre-Ship Validation → Release Quality Check
  → Ship → Bot Review Wait → PR Comment Resolution → Merge
/rune:arc plans/my-plan.md
/rune:arc plans/my-plan.md --resume        # Resume from checkpoint
/rune:arc plans/my-plan.md --no-forge      # Skip forge enrichment
/rune:arc plans/my-plan.md --skip-freshness  # Bypass plan freshness check

Features: checkpoint-based resume, adaptive review-mend convergence loop (3 tiers: LIGHT/STANDARD/THOROUGH), diff-scoped review, co-author propagation.

How arc phases work: Arc uses Claude Code's Stop hook to drive the phase loop — when one phase finishes, the stop hook reads state from .rune/arc-phase-loop.local.md, determines the next phase, and re-injects a prompt. Each phase is literally a new Claude Code turn with its own fresh context window. This solves the context degradation problem (phase 18 gets the same quality as phase 1) but means the stop hook chain is a critical path — a bug in any hook silently breaks the pipeline. See docs/state-machine.md for the full phase graph.

<a name="strive"></a> /rune:strive — Swarm Execution

Self-organizing workers parse a plan into tasks and claim them independently:

/rune:strive plans/my-plan.md
/rune:strive plans/my-plan.md --approve    # Require human approval per task

<a name="appraise"></a> /rune:appraise — Code Review

Multi-agent review of your git diff with up to 8 specialized Ashes:

/rune:appraise                # Standard review
/rune:appraise --deep         # Multi-wave deep review (up to 18 Ashes across 3 waves)

Built-in reviewers include: Ward Sentinel (security), Pattern Seer (consistency), Flaw Hunter (logic bugs), Ember Oracle (performance), Depth Seer (missing logic), and more. Stack-aware intelligence auto-adds specialist reviewers based on your tech stack.

<a name="audit"></a> /rune:audit — Codebase Audit

Full-scope analysis of your entire codebase (not just the diff):

/rune:audit                   # Deep audit (default)
/rune:audit --standard        # Standard depth
/rune:audit --deep            # Multi-wave investigation
/rune:audit --incremental     # Stateful audit with priority scoring and coverage tracking

<a name="mend"></a> /rune:mend — Fix Findings

Parse a TOME (aggregated review findings) and dispatch parallel fixers:

/rune:mend tmp/reviews/{id}/TOME.md

<a name="forge"></a> /rune:forge — Plan Enrichment

Deepen a plan with Forge Gaze — topic-aware agent matching that selects the best specialists for each section:

/rune:forge plans/my-plan.md
/rune:forge plans/my-plan.md --exhaustive  # Lower threshold, more agents

<a name="goldmask"></a> /rune:goldmask — Impact Analysis

Three-layer analysis: Impact (what changes), Wisdom (why it was written that way), Lore (how risky the area is):

/rune:goldmask                # Analyze current diff

<a name="inspect"></a> /rune:inspect — Gap Audit

Compares a plan against its implementation across 10 quality dimensions:

/rune:inspect plans/my-plan.md
/rune:inspect plans/my-plan.md --focus "auth module"

<a name="elicit"></a> /rune:elicit — Structured Reasoning

24 curated methods for structured thinking: Tree of Thoughts, Pre-mortem Analysis, Red Team vs Blue Team, 5 Whys, ADR, and more.

/rune:elicit

Agents

152 specialized agents across 8 categories:

Review Agents (50)

Core reviewers active in every /rune:appraise and /rune:audit run. UX and design reviewers (below) are conditionally activated for frontend files. Stack specialists are additionally auto-activated based on detected tech stack:

AgentFocus
Ward SentinelSecurity (OWASP Top 10, auth, secrets)
Pattern SeerCross-cutting consistency (naming, error handling, API design)
Flaw HunterLogic bugs (null handling, race conditions, silent failures)
Ember OraclePerformance (N+1 queries, algorithmic complexity)
Depth SeerMissing logic (error handling gaps, state machine incompleteness)
Void AnalyzerIncomplete implementations (TODOs, stubs, placeholders)
Wraith FinderDead code (unused exports, orphaned files, unwired DI)
Phantom WardenPhantom implementations (spec-to-code gaps, dead specs, unenforced rules)
Tide WatcherAsync/concurrency (waterfall awaits, race conditions)
Forge KeeperData integrity (migration safety, transaction boundaries)
Trial OracleTest quality (TDD compliance, assertion quality)
Simplicity WardenOver-engineering (YAGNI violations, premature abstractions)
Rune ArchitectArchitecture (layer boundaries, SOLID, dependency direction)
Mimic DetectorCode duplication (DRY violations)
Blight SeerDesign anti-patterns (God Service, leaky abstractions)
Refactor GuardianRefactoring completeness (orphaned callers, broken imports)
Reference ValidatorImport paths and config reference correctness
Phantom CheckerDynamic references (getattr, decorators, string dispatch)
Naming Intent AnalyzerName-behavior mismatches
Type WardenType safety (mypy strict, modern Python idioms)
Doubt SeerCross-agent claim verification
Assumption SlayerPremise validation (solving the right problem?)
Reality ArbiterProduction viability (works in isolation vs. real conditions)
Entropy ProphetLong-term consequence prediction
Schema Drift DetectorSchema drift between migrations and ORM/model definitions
Agent Parity ReviewerAgent-native parity, orphan features, context starvation
Senior Engineer ReviewerPersona-based senior engineer review, production thinking
Cross-Shard SentinelCross-shard consistency for Inscription Sharding (naming drift, pattern inconsistency, auth boundary gaps)
Shard ReviewerUniversal sharded review — all dimensions for assigned file subset
Sediment DetectorFeature sediment detection (dead code paths, unused config)

UX & Design Reviewers (conditionally activated for frontend files):

AgentFocus
Aesthetic Quality ReviewerVisual quality beyond pixel-perfect fidelity (AI slop detection, generic layouts)
Design System Compliance ReviewerDesign system convention adherence (token usage, variant patterns)
UX Heuristic ReviewerNielsen Norman + Baymard heuristic evaluation (50+ items)
UX Flow ValidatorUser flow completeness (loading, error, empty states, confirmations)
UX Interaction AuditorMicro-interactions (hover/focus, keyboard a11y, touch targets, animation)
UX Cognitive WalkerFirst-time user cognitive walkthrough (discoverability, learnability)

Stack Specialists (auto-activated by detected tech stack):

AgentStack
Python ReviewerPython 3.10+ (type hints, async, Result patterns)
TypeScript ReviewerStrict TypeScript (discriminated unions, exhaustive matching)
Rust ReviewerRust (ownership, unsafe, tokio)
PHP ReviewerPHP 8.1+ (type declarations, enums, readonly)
FastAPI ReviewerFastAPI (Pydantic, IDOR, dependency injection)
Django ReviewerDjango + DRF (ORM, CSRF, admin, migrations)
Laravel ReviewerLaravel (Eloquent, Blade, middleware, gates)
Axum ReviewerAxum/SQLx (extractor ordering, N+1 queries, IDOR, transaction boundaries)
SQLAlchemy ReviewerSQLAlchemy (async sessions, N+1, eager loading)
TDD Compliance ReviewerTDD practices (test-first, coverage, assertion quality)
DDD ReviewerDomain-Driven Design (aggregates, bounded contexts)
DI ReviewerDependency Injection (scope, circular deps, service locator)
Design Implementation ReviewerDesign-to-code fidelity (tokens, layout, responsive, a11y, variants)

Investigation Agents (32)

Used by /rune:goldmask, /rune:inspect, and /rune:audit --deep:

CategoryAgents
Impact TracersAPI Contract, Business Logic, Data Layer, Config Dependency, Event Message
Quality InspectorsGrace Warden (+ inspect, plan-review variants), Ruin Prophet (+ inspect, plan-review), Sight Oracle (+ inspect, plan-review), Vigil Keeper (+ inspect, plan-review)
Deep AnalysisBreach Hunter, Decay Tracer, Decree Auditor, Ember Seer, Fringe Watcher, Hypothesis Investigator, Order Auditor, Rot Seeker, Ruin Watcher, Signal Watcher, Strand Tracer, Truth Seeker
SynthesisGoldmask Coordinator, Lore Analyst, Wisdom Sage

Research Agents (7)

AgentPurpose
Repo SurveyorCodebase structure and pattern analysis
Echo ReaderSurfaces relevant past learnings from Rune Echoes
Git MinerGit archaeology — commit history, contributors, code evolution
Lore ScholarFramework docs via Context7 MCP + web search fallback
Practice SeekerExternal best practices and industry patterns
Activation PathfinderMaps activation path for new code (migrations, config, deployment)
Wiring CartographerMaps integration points where new code connects to existing system

Work Agents (8)

AgentPurpose
Rune SmithTDD-driven code implementation
Proto WorkerDesign prototype synthesis from Figma references
Trial ForgerTest generation following project patterns
Design Sync AgentFigma extraction and Visual Spec Map creation
Design IteratorIterative design refinement (screenshot-analyze-fix loop)
Storybook ReviewerComponent verification via screenshots (Mode A/B quality checks)
Storybook FixerApplies Storybook finding fixes with re-verification
Gap FixerAutomated remediation of inspection gaps from VERDICT.md

Utility Agents (22)

AgentPurpose
RunebinderAggregates multi-agent review outputs into TOME
Mend FixerApplies targeted code fixes for review findings
Elicitation SageStructured reasoning method execution
Scroll ReviewerDocument quality review
Flow SeerFeature spec analysis for completeness
Decree ArbiterTechnical soundness validation
Knowledge KeeperDocumentation coverage review
Horizon SageStrategic depth assessment
State WeaverPlan state machine validation (phases, transitions, I/O contracts)
Veil PiercerPlan reality-gap analysis
Evidence VerifierFactual claim validation with grounding scores
Research VerifierResearch output quality verification
Truthseer ValidatorAudit coverage quality validation
Deployment VerifierDeployment artifact generation (Go/No-Go checklists, rollback plans)
Design AnalystFigma frame relationship classifier (5-signal weighted composite)
Todo VerifierTODO staleness verification — classifies TODOs as VALID or FALSE_POSITIVE
UX Pattern AnalyzerCodebase UX maturity assessment (loading, error, form, navigation patterns)
Codex Phase HandlerIsolated Codex phase execution (codex-exec.sh wrapper)
Tome DigestTOME finding extraction (P1/P2/P3 counts, recurring patterns) — shell-based
Forge WardenMulti-perspective backend code review for forge enrichment
Verdict BinderInspection aggregator — merges Inspector findings into VERDICT.md
Veil Piercer (Plan)Plan-level reality-gap analysis and assumption validation

Testing Agents (6)

AgentPurpose
Unit Test RunnerDiff-scoped unit test execution
Integration Test RunnerAPI, database, and business logic tests
E2E Browser TesterBrowser automation via agent-browser CLI
Test Failure AnalystRoot cause analysis of test failures
Extended Test RunnerExtended-tier test execution with checkpoint/resume protocol
Contract ValidatorAPI contract validation (request/response schemas)

QA Agents (8)

AgentPurpose
Phase QA VerifierIndependent arc phase completion artifact verification
Code Review QA VerifierVerifies code review phase TOME existence and finding quality
Forge QA VerifierVerifies forge phase enrichment depth and structural preservation
Gap Analysis QA VerifierVerifies gap analysis compliance matrix and criteria coverage
Mend QA VerifierVerifies mend resolution report and per-finding status
Test QA VerifierVerifies test phase SEAL markers and tier coverage
Work QA VerifierVerifies work phase delegation manifests and task completeness
Design QA VerifierVerifies design verification phase report and criteria matrix

Meta-QA Agents (8)

AgentPurpose
Prompt LinterLints agent definition files for consistency (15 rules)
Workflow AuditorAudits arc workflow definitions for structural integrity
Rule Consistency AuditorDetects contradictions between CLAUDE.md, skills, and talisman
Hook Integrity AuditorValidates hooks.json entries match actual scripts
Improvement AdvisorGenerates fix proposals for meta-QA findings
Hallucination DetectorDetects phantom claims and evidence fabrication in arc artifacts
Effectiveness AnalyzerPer-agent finding accuracy and false-positive tracking
Convergence AnalyzerRetry efficiency, quality trajectory, stagnation detection

Skills

69 skills providing background knowledge, workflow orchestration, and tool integration:

SkillTypePurpose
deviseWorkflowMulti-agent planning pipeline
striveWorkflowSwarm work execution
appraiseWorkflowMulti-agent code review
auditWorkflowFull codebase audit
arcWorkflowEnd-to-end pipeline orchestration
arc-batchWorkflowSequential batch arc execution
arc-hierarchyWorkflowHierarchical plan execution
arc-issuesWorkflowGitHub Issues-driven batch arc
forgeWorkflowPlan enrichment with Forge Gaze
goldmaskWorkflowCross-layer impact analysis
inspectWorkflowPlan-vs-implementation gap audit
mendWorkflowParallel finding resolution
elicitationReasoning24 structured reasoning methods
roundtable-circleOrchestrationReview/audit 7-phase lifecycle
rune-orchestrationOrchestrationCore coordination patterns
context-weavingOrchestrationContext overflow prevention
rune-echoesMemory5-tier persistent agent memory
stacksIntelligenceStack-aware detection and routing
frontend-design-patternsIntelligenceDesign-to-code patterns (tokens, a11y, responsive, components)
design-syncWorkflowFigma design sync (extraction, implementation, fidelity review)
inner-flameQualityUniversal self-review protocol
ash-guideReferenceAgent invocation guide
tarnishedRoutingUnified entry point — natural language to workflow
using-runeReferenceWorkflow discovery and routing
codex-cliIntegrationCross-model verification
testingTesting3-tier test orchestration
agent-browserTestingE2E browser automation knowledge
systematic-debuggingDebugging4-phase debugging methodology
file-todosTrackingStructured file-based todos
git-worktreeIsolationWorktree-based parallel execution
polling-guardReliabilityMonitoring loop fidelity
zsh-compatCompatibilitymacOS zsh shell safety
chome-patternCompatibilityMulti-account config resolution
resolve-gh-pr-commentWorkflowResolve a single GitHub PR review comment
resolve-all-gh-pr-commentsWorkflowBatch resolve all open PR review comments
skill-testingDevelopmentTDD for skill development
debugDebuggingACH-based parallel hypothesis debugging
codex-reviewWorkflowCross-model code review (Claude + Codex in parallel)
learnMemorySession self-learning (CLI corrections, review recurrences)
figma-to-reactIntegrationFigma-to-React MCP server knowledge
statusReportingWorker status reporting for swarm execution
brainstormWorkflowCollaborative idea exploration (solo, roundtable, deep modes)
design-prototypeWorkflowStandalone Figma-to-Storybook prototype generator (extract, match, synthesize, verify)
design-system-discoveryIntelligenceDesign system auto-detection (libraries, tokens, variants)
resolve-todosWorkflowFile-based TODO resolution with verify-before-fix pipeline
runsReportingInspect per-agent structured artifacts from workflow runs
storybookIntelligenceStorybook component verification knowledge (CSF3, MCP tools)
elevateMemoryPromote project echoes to global scope with domain tagging
team-sdkOrchestrationCentralized team management SDK (ExecutionEngine, lifecycle)
team-statusReportingActive agent team monitoring dashboard
test-browserTestingStandalone browser E2E testing (no agent teams)
untitledui-mcpIntegrationUntitledUI MCP integration (6 tools, builder-protocol)
ux-design-processIntelligenceUX design methodology (heuristic evaluation, flow validation)
talismanConfigurationDeep talisman.yml management (init, audit, update, guide, status)
cc-inspectWorkflowRun Claude Code built-in inspection script
disciplineQualityProof-based orchestration discipline for spec compliance
post-findingsWorkflowPost review findings to GitHub PR as formatted comment
self-auditQualityMeta-QA self-audit of Rune's own plugin infrastructure
arc-quickWorkflowQuick 4-phase pipeline: plan → work → review → mend
variant-huntAnalysisSystematic variant analysis — "find more like this" for confirmed findings
supply-chain-auditSecurityDependency risk analysis (maintainer count, CVE history, abandonment)
pr-guardianAutomationCron-based PR shepherd loop — comments, lint, CI, rebase, migrations, browser test, auto-merge
codex-statusReportingCodex activity summary for current/recent arc run
verifyWorkflowVerify TOME findings before mend (TRUE_POSITIVE/FALSE_POSITIVE classification)
react-composition-patternsIntelligenceReact compound components, state lifting, explicit variants, React 19 APIs
react-native-patternsIntelligenceReact Native/Expo best practices (FlashList, Reanimated, native navigation)
react-performance-rulesIntelligenceReact/Next.js performance — 69 rules across 8 categories
react-view-transitionsIntelligenceReact View Transition API — placement, types, CSS recipes, Next.js integration
web-interface-rulesIntelligenceWeb interface quality — 100+ UI/UX/a11y rules across 15 categories

Configuration

Rune is configured via talisman.yml (dozens of top-level sections, 100+ keys):

# Project-level (highest priority)
.rune/talisman.yml

# User-global
~/.rune/talisman.yml

Quickest way to configure: Run /rune:talisman init to auto-detect your stack and generate a tailored config.

<details> <summary>Example configuration</summary>
version: 1

# File classification — decides which Ashes get summoned
rune-gaze:
  backend_extensions: [.py]
  skip_patterns: ["**/migrations/**", "**/__pycache__/**"]

# Work execution
work:
  ward_commands: ["ruff check .", "mypy .", "pytest --tb=short -q"]
  max_workers: 3

# Arc pipeline
arc:
  timeouts:
    forge: 900000               # 15 min
    work: 2100000               # 35 min
    code_review: 900000         # 15 min
  ship:
    auto_pr: true
    merge_strategy: "squash"

# Review settings
review:
  diff_scope:
    enabled: true
    expansion: 8

# Goldmask impact analysis
goldmask:
  enabled: true
  devise:
    depth: enhanced             # basic | enhanced | full

# Cross-model verification
codex:
  enabled: true
  workflows: [devise, arc, appraise]

# Custom Ashes
ashes:
  custom:
    - name: "my-reviewer"
      agent: "my-custom-agent"
      source: ".claude/agents/my-custom-agent.md"
</details>

See talisman.example.yml for the full schema with all options.


Codex CLI Integration (Optional)

Rune supports OpenAI Codex CLI as a cross-model verification layer. If you have a ChatGPT Pro subscription, you can enable Codex to add a second AI perspective alongside Claude — giving you higher-confidence results through independent cross-verification.

What Codex adds

WorkflowCodex Role
/rune:arcGap analysis phase — Codex independently reviews implementation gaps
/rune:appraiseCross-model review — Claude and Codex review in parallel, findings are cross-verified
/rune:devisePlan validation — Codex provides a second opinion on plan feasibility
/rune:codex-reviewDedicated cross-model review — runs Claude + Codex agents side by side

Findings are tagged with confidence levels: CROSS-VERIFIED (both models agree), STANDARD (single model), or DISPUTED (models disagree).

Trade-off: quality vs. time

Enabling Codex increases runtime for every workflow that uses it — each Codex invocation adds an extra verification pass. For /rune:arc, this can add 10–20 minutes on top of the already 1–3 hour pipeline. Enable it when correctness matters more than speed.

Enable / Disable

Codex integration is controlled via talisman.yml:

# .rune/talisman.yml
codex:
  enabled: true                          # Set to false to disable entirely
  workflows: [devise, arc, appraise]     # Which workflows use Codex

To disable: set codex.enabled: false or remove the codex section. Rune auto-detects whether the codex CLI is installed and authenticated — if not available, Codex phases are silently skipped.

Prerequisites

  1. ChatGPT Pro subscription (for Codex API access)
  2. Codex CLI installed: npm install -g @openai/codex
  3. Authenticated: codex login
  4. .codexignore file in project root (required for --full-auto mode)

MCP Tool Integrations (Optional)

Rune supports third-party MCP servers for component libraries and design tools. Declare them in talisman.yml and Rune routes them into the right workflow phases automatically.

UI Builder Protocol (v1.133.0+)

The UI Builder Protocol integrates any component library MCP (UntitledUI, shadcn/ui, custom) into Rune's full pipeline — planning, implementation, design sync, and code review:

Integration PointWhat happens
/rune:devisePlan includes ui_builder frontmatter section + Component Strategy
/rune:striveWorkers injected with builder workflow (search → get → customize)
/rune:design-syncPhase 1.5 Component Match: reference code → library search → annotated VSM
/rune:appraiseCompliance reviewer generates DSYS-BLD-* findings for convention violations

UntitledUI is supported out of the box — register the MCP server and add talisman config. No project skill needed.

shadcn/ui and custom libraries: create a builder skill with builder-protocol frontmatter.

# .rune/talisman.yml — minimal builder integration
integrations:
  mcp_tools:
    untitledui:
      server_name: "untitledui"
      tools:
        - { name: "search_components", category: "search" }
        - { name: "get_component", category: "details" }
      phases: { devise: true, strive: true, arc: true }
      skill_binding: "untitledui-mcp"   # built-in plugin skill
      trigger:
        extensions: [".tsx", ".ts"]
        keywords: ["ui", "component"]
        always: false

See docs/guides/ui-builder-protocol.en.md for the full developer guide.

MCP Integration Framework (v1.131.0+)

Declarative integrations.mcp_tools talisman config routes any MCP tool into the right workflow phases, with trigger conditions, rules injection, and companion skill auto-loading.

See docs/guides/mcp-integration-spec.en.md for the full spec (3 integration levels, schema reference, trigger system, worked examples).


Architecture

rune/
├── .claude-plugin/
│   └── marketplace.json          # Marketplace registry
└── plugins/
    └── rune/                     # Main plugin
        ├── .claude-plugin/
        │   └── plugin.json       # Plugin manifest (v1.128.0)
        ├── agents/               # 109 core agent definitions
        │   ├── review/           #   17 review agents
        │   ├── investigation/    #   31 investigation agents
        │   ├── utility/          #   17 utility agents
        │   ├── research/         #    8 research agents
        │   ├── work/             #    7 work agents
        │   ├── qa/               #    8 QA agents
        │   └── meta-qa/          #    9 meta-QA agents
        ├── registry/             # 43 extended agents
        ├── skills/               # 69 skills
        ├── commands/             # 16 slash commands
        ├── hooks/                # Event-driven hooks
        │   └── hooks.json
        ├── scripts/              # Hook & utility scripts (256 .sh/.py files)
        ├── .mcp.json             # MCP server config (5 servers: echo-search, figma-to-react, agent-search, context7, figma-context)
        ├── talisman.example.yml  # Configuration reference
        ├── CLAUDE.md             # Plugin instructions
        ├── CHANGELOG.md
        └── README.md             # Detailed component reference

State Machine Reference

Every Rune workflow is an explicit state machine with named phases, conditional gates, and error recovery tiers. See docs/state-machine.md for mermaid diagrams of all 10 workflows — useful for debugging pipeline failures, understanding phase transitions, and verifying correctness.

Key Concepts

TermMeaning
TarnishedThe orchestrator/lead agent that coordinates workflows
AshAny teammate agent (reviewer, worker, researcher)
TOMEAggregated findings document from a review
TalismanConfiguration file (talisman.yml)
Forge GazeTopic-aware agent matching for plan enrichment
Rune Echoes5-tier persistent agent memory (.rune/echoes/)
InscriptionContract file (inscription.json) for agent coordination
SealDeterministic completion marker emitted by Ashes
Discipline EngineeringThe architectural backbone — 5-layer proof-based system ensuring spec compliance over task completion. See full document

Known Gotchas

A few things to know when working with Rune — especially if you're debugging a pipeline failure or writing custom hooks/scripts:

GotchaDetails
macOS bash is 3.2The system bash on macOS is ancient (3.2). No associative arrays, no readarray, no |&. Rune's enforce-zsh-compat.sh hook auto-fixes 5 common patterns at runtime, but custom scripts must target bash 3.2.
status is read-only in zshzsh (macOS default shell) treats status as read-only. Using status= in any script will silently fail or crash. Use task_status or tstat instead. Enforced by enforce-zsh-compat.sh.
Hook timeout budget is tightPreToolUse hooks: 5s. Stop hooks: 15s (arc-phase) or 30s (detect-workflow). A slow git or gh call in a hook can cause silent timeout — the hook is killed and the phase loop breaks.
Stop hooks chain in sequence6 Stop hooks fire in order: arc-phase-stop-hook.sh (inner) → arc-batch-stop-hook.sharc-hierarchy-stop-hook.sharc-issues-stop-hook.shdetect-workflow-complete.shon-session-stop.sh (outer). A crash in an inner hook breaks all outer hooks.
SEAL convention for completionAshes emit <seal>TAG</seal> as their last output line. The on-teammate-idle.sh hook checks for this marker to distinguish "done writing" from "idle mid-task". Missing seals cause premature aggregation.

See the Troubleshooting guide for more operational details.


Torrent — Tmux Arc Orchestrator

Torrent is a standalone Rust TUI tool that manages rune:arc execution across multiple Claude Code sessions using tmux:

  • Multi-config support — use different CLAUDE_CONFIG_DIR accounts
  • tmux isolation — each arc gets its own Claude Code instance with clean context
  • Crash resilience — tmux sessions survive if torrent exits
  • Channels bridge (--channels) — bidirectional communication with Claude Code via Channels API. Bridge View ([b] key) provides full-screen chat with message history and persistence

Install

# Install to ~/.local/bin (recommended)
curl -fsSL https://raw.githubusercontent.com/vinhnxv/rune/main/torrent/install.sh | bash

# Or install system-wide
curl -fsSL https://raw.githubusercontent.com/vinhnxv/rune/main/torrent/install.sh | bash -s -- --system

See torrent/README.md for full documentation.


Requirements & Compatibility

RequirementMinimumRecommended
Claude Code2.1.81+Latest
OSmacOS 12+, Linux (Ubuntu 20.04+)macOS 14+, Ubuntu 22.04+
Shellbash 3.2+ or zsh 5.0+zsh (macOS default)
Python3.11+ (for MCP servers)3.12+
Node.js18+ (for Context7 MCP)20+
jq1.6+Latest
git2.25+Latest
Claude PlanPro ($20/mo) for basic useMax ($200/mo) for full Arc pipeline

Windows: Not currently supported. WSL2 with Ubuntu may work but is untested.


Resources Overview

ResourceWhat you'll find
Getting StartedFirst-time walkthrough — plan, work, review in 3 commands
Plugin Component ReferenceAll agents, skills, commands, hooks in detail
Arc & Batch GuideEnd-to-end pipeline, batch mode, GitHub Issues
Talisman ConfigurationFull configuration reference
TroubleshootingDebugging, cost optimization, common issues
Documentation HubAll guides, English + Vietnamese
State MachinesMermaid diagrams of all 10 workflow state machines
Torrent TUITmux-based multi-session arc orchestrator
ChangelogRelease history

Links


License

MIT

Rune — plugin by vinhnxv | Shared Context