
Staff Platform Engineer. Obsessed with figuring out how to make AI as reliable as Infrastructure
Most Used Tags
Explore upstream codebases to streamline open source contributions.
Analyze PR outcomes to improve future contributions by learning from accept/reject patterns.
Review incoming PRs, agent-generated changes, or diffs. Structured review with security, correctness, performance, and maintainability checks. Triggers: "review", "review PR", "review changes", "code review", "review this PR", "review agent output", "check this diff".
Safe, incremental refactoring with regression testing at each step.
Analyze code complexity and identify refactor targets using radon and gocyclo.
Test generation, coverage analysis, and TDD workflow. Triggers: "test", "generate tests", "test coverage", "write tests", "tdd", "add tests", "test strategy", "missing tests", "coverage gaps".
Compiles raw knowledge artifacts into an interlinked markdown wiki.
Create structured handoff for session continuation. Triggers: handoff, pause, save context, end session, pick up later, continue later.
Manage holdout scenarios for behavioral validation, ensuring isolation from implementing agents.
Automate project scaffolding, component generation, and CI/CD setup with ease.
Reverse-engineer products into feature catalogs, code maps, and specifications with verification gates.
Interactive overnight operator mode for Dream, facilitating setup, runs, and reports.
Validate pull requests for isolation, scope creep, and quality before submission.
Conduct persona-based adversarial validation to ensure usability of docs and skills.
Continuous repository security scanning and release gating. Triggers: "security scan", "security audit", "pre-release security", "run scanners", "check vulnerabilities".
Meta skill for the AgentOps operating model, facilitating coding agent workflows.
Manage and track progress in the RPI workflow with permanent gates.
Scaffold and audit OSS documentation packs for open source projects.
Epic decomposition into trackable issues. Triggers: "create a plan", "plan implementation", "break down into tasks", "decompose into features", "create beads issues from research", "what issues should we create", "plan out the work".
Automate epic execution hands-free until all tasks are completed.
Operationalize a mature .agents corpus into actionable knowledge surfaces.
Performance profiling and optimization tool for various programming languages.
Streamline your PR preparation with automated validation and structured body generation.
Spawn isolated agents for parallel task execution with automatic runtime selection.
Provides language-specific coding standards and validation rules for various programming languages.
Trace design decisions and concepts through session history, handoffs, and git.
Orchestrate multiple Codex agents to execute focused tasks efficiently.
Deep codebase exploration. Triggers: research, explore, investigate, understand, deep dive, current state.
Autonomous improvement loop for continuous code evolution and issue resolution.
Orchestrates the full validation phase to ensure implementation quality and extract learnings.
Quickly onboard new users to AgentOps with a streamlined setup guide.
Orchestrates the full RPI lifecycle with a single command, delegating to phase-specific skills.
Extract knowledge from session transcripts to identify decisions, learnings, failures, and patterns.
Systematically investigate bugs or perform proactive code audits.
Trace knowledge artifact lineage and identify stale citations or orphans.
Deep codebase exploration and analysis. Use for understanding code architecture, finding patterns, and gathering context before making changes.
Build and maintain a compounding external-knowledge wiki from clipped articles, papers, and transcripts. Triggers: "llm wiki", "ingest this", "second brain", "compile my reading", "wiki lint", "what do we know about <topic>". Based on Andrej Karpathy's LLM Wiki pattern (April 2026).
Quickly capture insights and lessons learned for future reference.
Access up-to-date OpenAI documentation for building with APIs and products.
Product validation gate for RPI pipeline. Validates goal alignment with PRODUCT.md before discovery. Checks: gap alignment, persona fit, competitive differentiation, precedent, scope boundaries. Council-gated with --preset=product. Triggers: "design", "product validation", "validate product fit", "design gate".
Fork-based PR implementation with mandatory isolation checks for clean contributions.
AgentOps enhances coding agents with bookkeeping, validation, and reusable flows.
Expert code review specialist. Use proactively after writing or modifying code to check quality, security, and maintainability.
Single-screen dashboard showing current work, recent validations, flywheel health, and suggested next action. Triggers: "status", "dashboard", "what am I working on", "where was I".
Multi-model consensus council. Spawns parallel judges with configurable perspectives. Modes: validate, brainstorm, research. Triggers: "council", "get consensus", "multi-model review", "multi-perspective review", "council validate", "council brainstorm", "council research".
Composable security suite for binary and prompt-surface assurance with modular testing primitives.
Manage git-based issue tracking with bd CLI for efficient task organization.
Orchestrates the full discovery phase for project planning and research.
Clarify goals and explore approaches before planning a solution.
Design and validate Grafana dashboards for OpenShift/Kubernetes operations to enhance platform health visibility.
Manage operational contracts for autonomous development loops with ease.
Execute a single issue with full lifecycle. Triggers: "implement", "work on task", "build this", "start feature", "pick up next issue", "work on issue".
One command to set up the full AgentOps product layer, filling gaps as needed.
Automated skill maintenance tool that detects and fixes common skill issues.
Execute a seamless test, commit, and push workflow in one go.
Generate a gold-standard README for any project with guided interviews and validation.
Generates, validates, and syncs documentation for any repository type. Produces code-maps, checks doc coverage, finds missing docs, and validates existing documentation against code. Triggers: doc, documentation, code-map, doc coverage, validate docs, generate docs, sync docs, update docs, find missing docs.
Comprehensive code validation tool that assesses readiness for deployment.
Wrap up completed work and extract valuable insights through a structured post-mortem process.
Plan open source PR contributions with clear scope and acceptance criteria.
Validate plans or specs before implementation using multi-model judgment.
Recovers context after compaction by detecting in-progress sessions and summarizing recent work.
Release your software. Pre-flight validation, changelog generation, version bumps, release commit, tag, curated release notes. Boundary: everything up to the git tag. Triggers: "release", "cut a release", "prepare release", "release check".
Monitor the health of your knowledge flywheel by checking its velocity, pool depths, and staleness.
Generate a comprehensive PRODUCT.md by interviewing users about their product's mission, personas, and competitive landscape.
Cross-platform skill converter that transforms AgentOps skills into formats for Codex and Cursor.
Inject relevant knowledge into session context for enhanced decision-making.
Conduct dependency audits, updates, and vulnerability scans for various ecosystems.
Manage and track fitness goals with the GOALS.yaml and GOALS.md specifications.
Consolidate knowledge across multiple rigs with a single command.
Reinstall all AgentOps skills globally from the latest source. Triggers: "update skills", "reinstall skills", "sync skills".
Provides shared reference documents for multi-agent skills, enhancing collaboration.