Claude Corps

Transform Claude Code into a collaborative development team with parallel agentic workflows.

<p align="center"> <img src="claude-corps.png" alt="claude-corps banner" width="100%"> </p> <p align="center"> <strong>Parallel agentic development framework for Claude Code</strong> <br> Turn Claude Code into an autonomous development team &mdash; multiple agents working simultaneously on isolated branches, coordinated by an orchestrator, with structured workflow that improves across sessions. </p> <p align="center"> <a href="#quick-start">Quick Start</a> &middot; <a href="#why-claude-corps">Why?</a> &middot; <a href="#claude-code-skills-reference">Skills</a> &middot; <a href="#autonomous-multi-hour-orchestration">Auto-Run</a> &middot; <a href="#faq">FAQ</a> </p> <p align="center"> <a href="https://github.com/josephneumann/claude-corps/blob/main/LICENSE"><img src="https://img.shields.io/github/license/josephneumann/claude-corps" alt="License"></a> <a href="https://github.com/josephneumann/claude-corps/stargazers"><img src="https://img.shields.io/github/stars/josephneumann/claude-corps" alt="Stars"></a> <a href="https://github.com/josephneumann/claude-corps/commits/main"><img src="https://img.shields.io/github/last-commit/josephneumann/claude-corps" alt="Last Commit"></a> <img src="https://img.shields.io/badge/Claude%20Code-compatible-blueviolet" alt="Claude Code Compatible"> <img src="https://img.shields.io/badge/Worktree%20Isolation-supported-green" alt="Worktree Isolation"> </p>

Why claude-corps?

Claude Code is powerful on its own. claude-corps makes it a team.

  • Parallel execution — Dispatch 3-5 Claude agents working simultaneously on different tasks, each in an isolated git worktree
  • Sequential execution — Dependent tasks run one at a time on the branch via --sequential — no worktree overhead, no merge conflicts
  • Full lifecycle coverage — From spec to dispatch to PR to code review, every step has a skill
  • Autonomous multi-hour runs/auto-run chains dispatch, reconcile, and repeat until your entire backlog is done
  • Human in the loop — Agents execute, you decide. PRs are created, never auto-merged

Quick Start

Option A: Plugin Install

In Claude Code:

/plugin marketplace add josephneumann/claude-corps
/plugin install claude-corps@claude-corps

Note: Plugin install namespaces skills as /claude-corps:orient, /claude-corps:dispatch, etc. For the full un-namespaced experience with CLAUDE.md integration, use Option B.

Option B: Full Install (Recommended)

git clone https://github.com/josephneumann/claude-corps.git ~/Code/claude-corps
cd ~/Code/claude-corps && ./install.sh

This symlinks skills, agents, hooks, scripts, and docs into ~/.claude/ for un-namespaced /orient, /dispatch, etc.

Then in any project:

claude
> /orient              # Survey your project and identify parallel work
> /dispatch --count 3  # Spawn 3 workers in isolated worktrees
> /auto-run            # Or go fully autonomous

The Workflow

graph LR
    DP["/deep-plan"] --> PR["/product-review"]
    PR --> A["/spec"]
    A --> REV["Deep Reviews"]
    REV --> MR["/multi-review --plan"]
    MR --> DEC["Decompose"]
    DEC --> D["/orient"]
    A -.->|"or skip reviews"| D
    D --> E["/dispatch"]
    E --> F["Workers: /start-task"]
    F --> G["Workers: /finish-task"]
    G --> H["/reconcile-summary"]
    E -.->|"/auto-run loop"| H
PhaseWhat happens
Plan/deep-plan orchestrates the full pipeline: /product-review challenges scope, /spec writes the plan, deep eng/design reviews add rigor, /multi-review --plan catches issues pre-coding, then decomposes to Linear. Or run each skill individually.
Execute/orient surveys the project. /dispatch spawns workers — parallel (worktree-isolated, default) or sequential (--sequential, direct on branch). Each worker implements, tests, and writes a session summary. /auto-run does this in a loop until all tasks are done.
Review/multi-review runs parallel specialized code review. /reconcile-summary syncs worker output with the task board.

Claude Code Skills Reference

All workflow capabilities are implemented as slash commands in skills/.

Planning

SkillPurpose
/deep-planFull planning pipeline with checkpoints: product review, optional design exploration, spec, deep eng/design reviews, plan-stage multi-review, decomposition. Single entry point.
/design-shotgunGenerate 3-5 intentionally different UI directions before committing to a design approach
/product-reviewProduct-taste review with interrogation mode, assumption mapping, and devil's advocate challenges. EXPAND / HOLD / REDUCE / DESIGN modes
/specResearch, plan, optionally decompose into Linear issues
/spec --deepenEnhance an existing plan with parallel research

Execution

SkillPurpose
/orientSurvey project, identify parallel work streams
/start-task <id>Claim task, create branch, gather context (requires Linear MCP for task tracking)
/finish-task <id> [--direct]Tests, commit, PR, code review, browser verification, session summary, close. --direct skips PR/review for sequential tasks.

Orchestration

SkillPurpose
/dispatch [--sequential]Spawn workers: parallel (worktree-isolated, default) or sequential (direct on branch)
/auto-run [--sequential]Autonomous dispatch-reconcile loop. --sequential for dependent task chains.
/reconcile-summarySync worker output with task board (requires Linear MCP)
/summarize-session <id>Mid-session progress checkpoint (read-only)

Quality

SkillPurpose
/qaAcceptance-first validation for release readiness, regressions, and browser workflows
/benchmarkRun a repeatable command on current branch vs baseline and report measured performance deltas
/multi-reviewParallel code review with specialized agents. --plan <path> reviews plans. --codex adds Codex second-opinion.
/milestone-reviewIterative review-fix loop for accumulated branch changes. Supports Codex via review.json config.
/humanizerRemove AI writing patterns, add natural voice
/verifyVerification discipline — evidence before claims
/debugSystematic debugging with pattern matching, 5 Whys, evidence gathering, and structured reports
/writing-skillsSkill authoring guidance (structure, tone, persuasion)
<details> <summary><strong>Skill details</strong> (click to expand)</summary>

Planning Skills

/deep-plan — Full planning pipeline that orchestrates all planning skills in sequence with user checkpoints between phases: product review (always), design exploration (opt-in), spec (always), deep engineering review (opt-out), deep design review (opt-out), plan-stage multi-review (opt-out), and Linear decomposition (opt-out). Supports --yes for autonomous operation, --skip-reviews to rely on spec's inline Phase 2.5, and --no-decompose. Use instead of manually chaining /product-review, /spec, and review skills.

/design-shotgun — Divergence before commitment for UI-heavy work. Generates 3-5 intentionally different design directions from a brief or plan, forcing real variation in hierarchy, interaction model, responsive posture, and differentiation. Ends with a recommended direction and a handoff block for /product-review DESIGN or /plan-design-review. Does not edit plans or write code.

/product-review — Product-taste review that challenges scope and approach before committing engineering effort. Interrogation mode provides recommended answers for every question (grill-me style). Includes status quo analysis, assumption mapping across Value/Usability/Viability/Feasibility with risk prioritization, devil's advocate challenges (steel-man opposition, kill criteria), and alternatives analysis. Four modes: EXPAND (dream big), HOLD (maximum rigor), REDUCE (strip to essentials), DESIGN (UX-first — user journeys, interaction patterns, responsive strategy). Run before /spec or standalone.

/spec — Interactive refinement (Phase 0) moves from a vague idea to clear requirements. Runs parallel research agents (repo-research-analyst, spec-flow-analyzer, and conditionally best-practices-researcher and framework-docs-researcher). Writes plan to docs/plans/, then optionally decomposes into Linear issues with parent-child hierarchy and blocking relations (requires Linear MCP).

/spec --deepen — Finds the most recent plan in docs/plans/, discovers and applies all available skills, runs parallel research agents per-section, launches all review agents, and merges findings back into the plan. Updates tasks accordingly.

Execution Skills

/orient — Discovers project structure, reads CLAUDE.md/README/PROJECT_SPEC, analyzes task state, checks git health, outputs a structured orientation report with recommended parallel work streams. Always offers /dispatch as next action.

/dispatch — Identifies ready tasks, generates context, and spawns workers. Two modes: parallel (default) uses git worktree isolation for independent tasks, sequential (--sequential) executes tasks one at a time directly on the current branch for dependent tasks that need to see each other's changes. Supports --count N, --sequential, --plan-first, --no-plan, --yes, --model, and custom per-task context.

/start-task <id> — Validates the task, claims it, creates a task branch for isolation, gathers project context, optionally runs research agents, defines acceptance criteria, and begins implementation.

/finish-task <id> [--direct] — Runs quality gates (tests must pass), commits changes, pushes to remote. In default mode: creates a PR, runs /multi-review (skipped for milestone-branch PRs — review happens at milestone level), merges. With --direct: skips PR/review entirely (used by sequential dispatch). In both modes: closes the task and outputs a session summary. Merge conflicts in worktree mode trigger a fail-fast stop — no retry loops.

/reconcile-summary — Auto-discovers unreconciled summaries in docs/session_summaries/, cross-references summary claims against PR/CI evidence before trusting them, analyzes spec divergences, updates affected tasks in Linear (if connected), and closes obsoleted tasks. Discovered work from worker summaries is collected and proposed as a batch for user approval (batch-with-veto) — no issues are created without confirmation. Also updates the project description with current progress. Supports --yes for autonomous operation (auto-approves batch).

/summarize-session <id> — Read-only progress snapshot. Does not commit, push, or close anything.

Quality Skills

/qa — Acceptance-first release validation. Gathers acceptance criteria from a task, plan, or diff; runs deterministic checks; infers regression surface; and, for UI-visible changes, follows the shared browser testing protocol. Produces a final verdict of Pass, Blocked, or Needs manual verification with evidence for every claim.

/benchmark — Measured performance workflow for repeatable commands. Runs the same command on the current branch and a baseline ref in a temporary worktree, captures multiple timed runs, reports median/min/max, and calls out variance so performance claims stay grounded in data.

/multi-review — Selects 3-5 review agents based on change types, runs them in parallel, aggregates findings by severity (Critical/Important/Informational), and resolves every finding through a resolution ledger: auto-fixes without prompting, drops false positives with reasons, defers genuine human decisions for adjudication. Includes workflow-based browser testing for frontend PRs (cache clearing, diff-driven workflow inference, interactive verification). Maximum 3 review cycles with exit conditions. Codex integration (--codex / --codex-adversarial): optionally includes an OpenAI Codex review running in parallel with Claude agents, normalized into the same findings pipeline. Plan mode (--plan <path>): reviews an implementation plan pre-coding — keyword-scans the plan to select relevant reviewers (security, performance, architecture, etc.), prompts them to identify issues that would surface during code review, and resolves findings by amending the plan.

/milestone-review — Autonomous iterative review-fix loop for accumulated branch changes. Supports Codex second-opinion reviews when enabled in review.json ("codex": {"enabled": true}). Unlike /multi-review (interactive), milestone-review fixes all verified findings itself — refactoring, multi-file changes, pattern fixes — repeating until the branch is clean or max iterations are reached. Used automatically by /auto-run after tasks complete, or run standalone on any branch.

/humanizer — Writing editor that identifies and removes AI writing patterns (significance inflation, sycophantic tone, filler phrases, em dash overuse, etc.) to make text sound natural and human. Based on Wikipedia's Signs of AI writing. Outputs a draft rewrite, self-audit for remaining tells, and final revision.

Discipline Skills

/verify — Centralized verification discipline cross-referenced by other skills. Contains the Iron Law (no claims without evidence), anti-rationalization table, red flags list, verification checklist, and anti-sycophancy guidance. Inspired by obra/superpowers.

/debug — Systematic debugging with evidence gathering, known bug pattern matching (race conditions, nil propagation, state corruption, integration failures, config drift, stale cache, off-by-one), "5 Whys" root cause drilling, hypothesis testing, and structured debug reports (DONE / DONE_WITH_CONCERNS / BLOCKED). Three-strikes rule with interactive escalation. Regression test required before closing. Evolved from obra/superpowers with patterns from garrytan/gstack.

/writing-skills — Meta-skill for authoring effective skills. Covers CSO (description design), word count targets, anti-rationalization patterns, tone guidance from the humanizer, and persuasion principles. Informed by obra/superpowers.

</details>

Autonomous Multi-Hour Orchestration

/auto-run enables fully autonomous operation. It dispatches workers, waits for completions, reconciles results, dispatches newly unblocked tasks, and repeats. Supports both parallel (worktree-isolated, default) and sequential (--sequential, direct on branch) modes. After all tasks complete, it runs a milestone review phase — an iterative review-fix loop on the accumulated branch changes (skip with --skip-milestone-review).

# All ready tasks (parallel)
/auto-run

# Sequential — for dependent task chains
/auto-run --sequential --through INT-14

# Everything needed to complete a specific task (resolves dependency graph)
/auto-run --through INT-14

# All tasks in a project
/auto-run --epic "Product Showcase Vignettes"

# Specific tasks plus their blockers
/auto-run --only INT-15 INT-16

# With limits
/auto-run --max-batches 3 --max-hours 4 --max-concurrent 5

Unattended Mode (Wrapper Script)

For runs that outlast a single context window, the wrapper script provides process-level resilience:

~/.claude/scripts/auto-run.sh --max-hours 8
~/.claude/scripts/auto-run.sh --through INT-14 --max-hours 4

The wrapper uses expect to allocate a pty for interactive Claude sessions, sends /auto-run --resume into each fresh Claude session, and checks task state between iterations. State is checkpointed to docs/auto-run-checkpoint.json and survives restarts.


Specialized Agents

Agent definitions in agents/ are used by skills for research and review.

Research Agents

Deployed by /orient and /start-task to gather context before implementation.

AgentPurpose
repo-research-analystMap architecture and conventions
git-history-analyzerHistorical context and contributors
framework-docs-researcherLibrary docs and deprecation checks
best-practices-researcherIndustry patterns and recommendations

Code Review Agents

Deployed by /multi-review and /milestone-review for parallel specialized review.

AgentFocus
code-simplicity-reviewerYAGNI, minimize complexity
security-sentinelCWE-enriched OWASP review, business logic, absence detection
performance-oracleN+1 queries, memory, caching
pattern-recognition-specialistAnti-patterns, conventions
architecture-strategistSOLID, design alignment
agent-native-reviewerAction/context parity for agents
api-security-reviewerRate limiting, pagination, CORS, response filtering
data-integrity-guardianMigration safety, ACID, GDPR/CCPA
data-migration-expertValidates mappings against production
nextjs-reviewerApp Router, RSC, metadata, routing
tailwind-reviewerTailwind/shadcn, accessibility, responsive
python-backend-reviewerFastAPI, SQLAlchemy, async, Alembic, pytest
ux-reviewerInteraction flows, state completeness, form UX, cognitive load
frontend-performance-reviewerCore Web Vitals, bundle size, rendering, waterfalls
External (optional)
codex-reviewerSecond-opinion review via OpenAI Codex CLI. Opt-in via --codex flag or review.json config

Note: Framework-specific reviewers (nextjs, tailwind, python-backend, api-security, ux, frontend-perf) auto-detect from changed files. Use reviewers.exclude in .claude/review.json to suppress. See Setting Up a New Project.

Workflow Agents

AgentPurpose
spec-flow-analyzerAnalyze specs for dependencies, gaps, feasibility

Security

claude-corps uses a layered security model combining AI-driven review with deterministic CI/CD tooling for comprehensive coverage.

AI Agent Layer (security-sentinel)

The security-sentinel agent focuses on what AI uniquely excels at:

  • Business logic vulnerabilities — IDOR (CWE-639), authorization bypass (CWE-863), workflow manipulation, mass assignment (CWE-915)
  • Absence detection — Missing rate limiting, CSRF protection, auth middleware, input validation, security headers, audit logging
  • Self-verification — Every finding is tested against mitigating controls before reporting, reducing false positives
  • CWE-enriched output — All findings include CWE numbers, confidence scores, exploit scenarios, and specific remediation
  • SAST triage — When CI/CD produces SAST results, security-sentinel verifies each finding in context (confirm, dismiss, or escalate)

The api-security-reviewer agent covers API-specific gaps: rate limiting, pagination bounds, response data filtering, CORS, request size limits, and security logging.

CI/CD Layer (deterministic tools)

A template workflow at docs/examples/security-checks.yml adds SAST and container scanning alongside your existing CI:

ToolPurpose
SemgrepKnown vulnerability patterns (SQLi, XSS, path traversal) via SAST
TrivyCVEs in container base images and OS packages
SARIFResults uploaded to GitHub Security tab

This complements (not replaces) existing CI checks like dependency audit (pnpm audit, pip-audit) and secret scanning (TruffleHog).

Pre-Commit Layer

A secret scanning hook (hooks/secret-scan-precommit.sh) blocks commits containing credentials before they enter git history. Uses gitleaks if installed, falls back to regex pattern matching for AWS keys, GitHub tokens, private keys, and generic secrets.

Recommended Configuration

For security-conscious projects, add security-sentinel as an always-on reviewer in .claude/review.json:

{
  "reviewers": {
    "include": ["security-sentinel"]
  }
}

How It All Fits Together

project-root/
├── CLAUDE.md                        # Project-specific config (you write this)
├── .claude/
│   ├── review.json                  # Optional: review config (tiers, reviewer overrides)
│   └── worktrees/                   # Worktrees for manual `claude --worktree` sessions
├── docs/
│   ├── session_summaries/           # Worker outputs (created by /finish-task)
│   │   └── reconciled/              # Processed by /reconcile-summary
│   ├── plans/                       # Output from /spec
│   ├── auto-run-checkpoint.json     # Auto-run state (survives restarts)
│   └── auto-run-logs/               # Wrapper iteration logs

~/.claude/                           # Global config (symlinked from this repo)
├── CLAUDE.md                        # Global workflow guidance
├── skills/                          # Slash commands
├── agents/                          # Specialized agent definitions
├── hooks/                           # Event hooks
├── scripts/                         # Wrapper scripts (auto-run.sh)
└── docs/                            # Global documentation

Principles

  1. Parallel by default, sequential when needed — Independent tasks run simultaneously in isolated git worktrees. Dependent tasks run sequentially on the branch (--sequential).
  2. Orchestrator + Workers — One session coordinates, workers execute discrete tasks and report back.
  3. Task-sized work — Big enough to be a meaningful atomic change, small enough to complete without exhausting context.
  4. Bounded autonomy — Clarify requirements first, then execute autonomously within those bounds.
  5. Tests as the contract — "Done" means tests pass. The code proves itself.
  6. Human in the loop — Humans approve PRs, prioritize tasks, and make architectural decisions.
  7. Handoffs over context bloat — Fresh context beats exhausted context.
  8. Session summaries — Every completed task leaves breadcrumbs for the next session.
  9. Save what you learn — Save debugging insights and non-obvious solutions to auto-memory when completing tasks.
  10. Codify the routine — Repeated patterns become skills. If you do something twice, automate it.
  11. Evaluate, don't agree — Verify claims against evidence before acting. No performative agreement.

Workflow Examples

Single-Session (with Linear)

/orient
/start-task INT-14
# implement...
/finish-task INT-14

Multi-Agent Parallel (independent tasks)

/orient
/dispatch --count 3
# 3 workers spawn, each in an isolated worktree, working in parallel
# Workers run in the background and notify on completion
/reconcile-summary

Sequential (dependent tasks, with Linear)

git checkout milestone/m4
/dispatch --sequential INT-14 INT-15 INT-16
# Tasks execute one at a time on the branch — each sees the previous task's commits
# No worktrees, no PRs per task. Review at milestone level.

Fully Autonomous (requires Linear)

# Interactive (parallel)
/auto-run --through INT-14

# Interactive (sequential — for dependent chains)
/auto-run --sequential --through INT-14

# Unattended (hours-long, restarts across context exhaustions)
~/.claude/scripts/auto-run.sh --max-hours 8

Full Planning Pipeline

# Single entry point — runs product review, spec, deep reviews, decomposition
/deep-plan "real-time price alerts for crypto"
/orient
/dispatch

# Autonomous (auto-answers all checkpoints)
/deep-plan "dashboard redesign" --yes

# Light mode (skip deep reviews, rely on spec's Phase 2.5)
/deep-plan "new feature" --skip-reviews

# Or manually chain individual planning skills
/product-review DESIGN    # map user journeys and interaction patterns first
/spec "dashboard redesign"
/multi-review --plan docs/plans/<plan>.md   # review plan before coding
/orient
/dispatch

Setting Up a New Project

Global workflow config loads automatically from ~/.claude/CLAUDE.md. Each project only needs its own CLAUDE.md for project-specific details:

cd /path/to/your/project
claude
> /orient                  # Start working

Your project CLAUDE.md should include: project summary, dev commands (uv run pytest, pnpm dev, etc.), critical rules, and architecture overview. Everything else comes from the global config.

Optionally, create a .claude/review.json to configure risk tiers and reviewer overrides. Framework-specific reviewers auto-detect from changed files — no config needed. See docs/examples/review-fullstack.json for an example.

For security-conscious projects: add "include": ["security-sentinel"] to your review config for always-on security review, and copy docs/examples/security-checks.yml to .github/workflows/ for CI/CD SAST and container scanning.


Working Without Linear

All skills work without a task tracker. Here's what changes:

SkillWithout LinearWith Linear
/deep-planFull pipeline, skips decomposition phaseFull pipeline including Linear decomposition
/specWrites plan file to docs/plans/Writes plan file + creates Linear issues with dependencies
/orientReads git state, plan files, code healthReads git state + Linear project board with full task graph
/dispatchN/A (run tasks manually from plan)Spawns workers from ready issues, delegates via Linear
/start-taskN/A (start manually, no task tracking)Claims issue, tracks status, gathers context from issue
/finish-taskTests, commit, PR, session summaryTests, commit, PR, session summary + closes Linear issue
/auto-runN/A (manual workflow)Autonomous dispatch-reconcile loop
/multi-reviewFull parallel code reviewFull parallel code review
/product-reviewFull product reviewFull product review
/milestone-reviewFull iterative review-fixFull iterative review-fix

Without Linear, the workflow is:

/spec "your feature"          # Writes docs/plans/<plan>.md
# Read the plan, implement each section
/finish-task                  # Tests, commit, PR (skips task tracking)

With Linear, the workflow is:

/spec "your feature"          # Writes plan + creates Linear issues
/orient                       # Reads Linear board, identifies parallel streams
/dispatch --count 3           # Spawns 3 parallel workers from ready issues
/reconcile-summary            # Syncs worker results back to Linear

FAQ

How is this different from using Claude Code directly?

Claude Code runs as a single agent. claude-corps adds orchestration — multiple Claude sessions working in parallel on different tasks, each on isolated branches, coordinated by a team lead session that dispatches work and reconciles results.

Can I run this unattended for hours?

Yes. /auto-run with the wrapper script (~/.claude/scripts/auto-run.sh) runs for hours, restarting Claude when context is exhausted. State is checkpointed and restored across restarts. PRs are created but never auto-merged — you review when ready.

Do I need Linear?

No. Linear is optional. Without it, /spec writes plan files to docs/plans/ and you execute from them directly. All planning, review, and quality skills work without any task tracker.

With Linear MCP connected, skills automatically create issues, track status, manage dependencies, and coordinate parallel workers. Add it with:

claude mcp add --transport http linear-server https://mcp.linear.app/mcp

Can I use this without any task tracker?

Yes. All skills degrade gracefully. /spec produces plan files. /orient surveys git state and code. /multi-review runs full review. /finish-task still does tests, commit, and PR. The orchestration skills (/dispatch, /auto-run) require Linear for task discovery, but you can always run tasks manually.

What is the review config?

The review config (.claude/review.json) lets you configure per-project file sensitivity levels and reviewer overrides. It drives three behaviors: (1) /multi-review selects more reviewers for higher-risk files, (2) /dispatch uses plan-mode for critical/high-risk tasks, and (3) model selection routes critical/high tasks to Opus and medium/low to Sonnet. Framework-specific reviewers auto-detect from changed files — use reviewers.exclude to suppress false positives and reviewers.include to force always-on reviewers. Add "codex": {"enabled": true} to include Codex second-opinion reviews. Without a config file, skills fall back to keyword-based detection.

How does smart model selection work?

/dispatch automatically selects Opus for critical/high-risk tasks and Sonnet for medium/low tasks. Use --model opus or --model sonnet to override. During /multi-review, critical-tier files get Opus for security-sentinel and architecture-strategist reviews.

How does security review work?

claude-corps uses a layered model: the security-sentinel AI agent handles business logic vulnerabilities (IDOR, auth bypass, absence detection) that require understanding intent, while CI/CD tools handle deterministic checks (SAST via Semgrep, container scanning via Trivy, dependency audit, secret scanning). A pre-commit hook catches secrets before they enter git history. When SAST results are available, the AI agent triages them in context — confirming real vulnerabilities and dismissing false positives. See the Security section for setup.

Does this require Agent Teams?

No. claude-corps uses subagent worktrees (isolation: "worktree" on the Agent tool) for parallel execution and plain subagents (no isolation) for sequential execution. No Agent Teams configuration needed. All skills work with standard Claude Code.

When should I use --sequential vs default parallel?

Use --sequential when tasks depend on each other (Phase 1 must complete before Phase 2 starts) or when tasks modify overlapping files. Use default parallel when tasks are independent. Sequential avoids merge conflicts and worktree overhead but runs one task at a time.


How claude-corps Fits the Ecosystem

ToolWhat It DoesRelationship
Claude CodeAnthropic's agentic coding CLIRequired — claude-corps extends it
Linear + MCPTask tracking and project managementOptional — skills use Linear MCP for task tracking when connected
OpenAI CodexAI coding agent with code reviewOptional — second-opinion reviewer in /multi-review and /milestone-review
Claude SquadManage multiple terminal Claude agentsAlternative approach to multi-agent
AiderAI pair programming in your terminalDifferent paradigm (pair vs team)

Prerequisites

Required

  • Claude Code — Anthropic's CLI for Claude
  • git — With worktree support (standard in modern git)
  • gh — GitHub CLI for PR creation. Install: brew install gh

For Task Tracking (Optional)

  • Linear MCP — Connects Linear project management to Claude Code for automatic issue creation, status tracking, and dependency management. Add with:
    claude mcp add --transport http linear-server https://mcp.linear.app/mcp
    
    Then run /mcp in Claude Code to authenticate via OAuth. Without Linear, all skills still work — you just manage tasks manually from plan files.

For Frontend Browser Testing (Optional)

  • Playwright MCP — Browser automation for workflow verification. Install with --isolated for clean browser state on each session:

    claude mcp add playwright -- npx @playwright/mcp@latest --headless --isolated
    npx playwright install chromium
    

    The --isolated flag starts each session with an ephemeral in-memory profile, preventing stale cache/cookies from prior sessions. Without it, Playwright persists state to disk at ~/Library/Caches/ms-playwright/.

    Used by /finish-task, /multi-review, and /milestone-review for workflow-based browser testing: cache clearing, diff-driven workflow inference, interactive testing (click, fill, type), persistence verification, and responsive checks. See docs/browser-testing-protocol.md for the full protocol.

For Codex Integration (Optional)

  • OpenAI Codex CLI — Second-opinion code review from a different AI model. Install and authenticate:
    npm install -g @openai/codex
    codex login
    
    Used by /multi-review --codex and /milestone-review (when enabled in review.json) to run Codex reviews in parallel with Claude review agents. Requires a ChatGPT subscription or OpenAI API key.

For Unattended Auto-Run

  • expect — Allocates a pty for the wrapper script. Install: brew install expect (macOS) or apt install expect (Linux). Not needed if using /auto-run interactively.

Installation

git clone https://github.com/josephneumann/claude-corps.git ~/Code/claude-corps
cd ~/Code/claude-corps && ./install.sh
source ~/.zshrc

The installer creates symlinks from ~/.claude/ to this repo. It's idempotent — safe to run multiple times. Existing directories are backed up.

Verify:

ls -la ~/.claude/skills ~/.claude/hooks ~/.claude/agents ~/.claude/scripts ~/.claude/docs
<details> <summary><strong>Adding custom skills and hooks</strong></summary>

Adding a Skill

Create skills/my-skill/SKILL.md with frontmatter:

---
name: my-skill
description: "When to invoke this skill"
allowed-tools: Read, Bash, Glob, Grep
---

# My Skill: $ARGUMENTS

Instructions for Claude...

Commit and push — available as /my-skill in all projects.

Adding a Hook

Create an executable script in hooks/:

vim ~/Code/claude-corps/hooks/my-hook.sh
chmod +x ~/Code/claude-corps/hooks/my-hook.sh

Register it in ~/.claude/settings.json under the appropriate event.

</details>

Inspiration

claude-corps' verification discipline, debugging methodology, and skill authoring guidance draw from obra/superpowers — Jesse Vincent's excellent single-agent skills framework. Specific techniques adapted:

  • Anti-rationalization tables and Iron Laws for prompt-level compliance (verification-before-completion skill)
  • CSO (Claude Search Optimization) for skill description design (writing-skills skill)
  • Systematic debugging methodology (debug skill)
  • Persuasion-informed skill design based on Cialdini principles (writing-skills skill)
  • Subagent distrust model for processing agent reports (reconcile-summary skill)

Where superpowers optimizes a single agent session, claude-corps orchestrates many agents in parallel. The two frameworks are complementary — we borrowed their single-agent discipline to strengthen our multi-agent system.


License

MIT