
Staff Platform Engineer. Obsessed with figuring out how to make AI as reliable as Infrastructure
Most Used Tags
Access up-to-date OpenAI documentation for building with APIs and products.
Analyze PR outcomes to improve future contributions by learning from accept/reject patterns.
Reinstall all AgentOps skills globally from the latest source. Triggers: "update skills", "reinstall skills", "sync skills".
Generates, validates, and syncs documentation for any repository type. Produces code-maps, checks doc coverage, finds missing docs, and validates existing documentation against code. Triggers: doc, documentation, code-map, doc coverage, validate docs, generate docs, sync docs, update docs, find missing docs.
Systematically investigate bugs or perform proactive code audits.
Safe, incremental refactoring with regression testing at each step.
Create structured handoff for session continuation. Triggers: handoff, pause, save context, end session, pick up later, continue later.
Test generation, coverage analysis, and TDD workflow. Triggers: "test", "generate tests", "test coverage", "write tests", "tdd", "add tests", "test strategy", "missing tests", "coverage gaps".
Inject relevant knowledge into session context for enhanced decision-making.
Validate plans or specs before implementation using multi-model judgment.
Analyze code complexity and identify refactor targets using radon and gocyclo.
Automated skill maintenance tool that detects and fixes common skill issues.
Review incoming PRs, agent-generated changes, or diffs. Structured review with security, correctness, performance, and maintainability checks. Triggers: "review", "review PR", "review changes", "code review", "review this PR", "review agent output", "check this diff".
Deep codebase exploration and analysis. Use for understanding code architecture, finding patterns, and gathering context before making changes.
AgentOps enhances coding agents with bookkeeping, validation, and reusable flows.
Compiles raw knowledge artifacts into an interlinked markdown wiki.
Manage holdout scenarios for behavioral validation, ensuring isolation from implementing agents.
Trace knowledge artifact lineage and identify stale citations or orphans.
Automate project scaffolding, component generation, and CI/CD setup with ease.
Interactive overnight operator mode for Dream, facilitating setup, runs, and reports.
Consolidate knowledge across multiple rigs with a single command.
Validate pull requests for isolation, scope creep, and quality before submission.
Fork-based PR implementation with mandatory isolation checks for clean contributions.
Epic decomposition into trackable issues. Triggers: "create a plan", "plan implementation", "break down into tasks", "decompose into features", "create beads issues from research", "what issues should we create", "plan out the work".
Build and maintain a compounding external-knowledge wiki from clipped articles, papers, and transcripts. Triggers: "llm wiki", "ingest this", "second brain", "compile my reading", "wiki lint", "what do we know about <topic>". Based on Andrej Karpathy's LLM Wiki pattern (April 2026).
Release your software. Pre-flight validation, changelog generation, version bumps, release commit, tag, curated release notes. Boundary: everything up to the git tag. Triggers: "release", "cut a release", "prepare release", "release check".
Performance profiling and optimization tool for various programming languages.
Recovers context after compaction by detecting in-progress sessions and summarizing recent work.
Generate a gold-standard README for any project with guided interviews and validation.
Spawn isolated agents for parallel task execution with automatic runtime selection.
Single-screen dashboard showing current work, recent validations, flywheel health, and suggested next action. Triggers: "status", "dashboard", "what am I working on", "where was I".
Provides language-specific coding standards and validation rules for various programming languages.
Quickly capture insights and lessons learned for future reference.
Design and validate Grafana dashboards for OpenShift/Kubernetes operations to enhance platform health visibility.
Deep codebase exploration. Triggers: research, explore, investigate, understand, deep dive, current state.
Autonomous improvement loop for continuous code evolution and issue resolution.
One command to set up the full AgentOps product layer, filling gaps as needed.
Conduct persona-based adversarial validation to ensure usability of docs and skills.
Wrap up completed work and extract valuable insights through a structured post-mortem process.
Meta skill for the AgentOps operating model, facilitating coding agent workflows.
Execute a seamless test, commit, and push workflow in one go.
Explore upstream codebases to streamline open source contributions.
Expert code review specialist. Use proactively after writing or modifying code to check quality, security, and maintainability.
Continuous repository security scanning and release gating. Triggers: "security scan", "security audit", "pre-release security", "run scanners", "check vulnerabilities".
Orchestrates the full validation phase to ensure implementation quality and extract learnings.
Cross-platform skill converter that transforms AgentOps skills into formats for Codex and Cursor.
Multi-model consensus council. Spawns parallel judges with configurable perspectives. Modes: validate, brainstorm, research. Triggers: "council", "get consensus", "multi-model review", "multi-perspective review", "council validate", "council brainstorm", "council research".
Orchestrate multiple Codex agents to execute focused tasks efficiently.
Clarify goals and explore approaches before planning a solution.
Trace design decisions and concepts through session history, handoffs, and git.
Orchestrates the full discovery phase for project planning and research.
Streamline your PR preparation with automated validation and structured body generation.
Manage operational contracts for autonomous development loops with ease.
Execute a single issue with full lifecycle. Triggers: "implement", "work on task", "build this", "start feature", "pick up next issue", "work on issue".
Plan open source PR contributions with clear scope and acceptance criteria.
Monitor the health of your knowledge flywheel by checking its velocity, pool depths, and staleness.
Manage git-based issue tracking with bd CLI for efficient task organization.
Operationalize a mature .agents corpus into actionable knowledge surfaces.
Automate epic execution hands-free until all tasks are completed.
Scaffold and audit OSS documentation packs for open source projects.
Composable security suite for binary and prompt-surface assurance with modular testing primitives.
Manage and track progress in the RPI workflow with permanent gates.
Conduct dependency audits, updates, and vulnerability scans for various ecosystems.
Reverse-engineer products into feature catalogs, code maps, and specifications with verification gates.
Quickly onboard new users to AgentOps with a streamlined setup guide.
Orchestrates the full RPI lifecycle with a single command, delegating to phase-specific skills.
Extract knowledge from session transcripts to identify decisions, learnings, failures, and patterns.
Product validation gate for RPI pipeline. Validates goal alignment with PRODUCT.md before discovery. Checks: gap alignment, persona fit, competitive differentiation, precedent, scope boundaries. Council-gated with --preset=product. Triggers: "design", "product validation", "validate product fit", "design gate".
Manage and track fitness goals with the GOALS.yaml and GOALS.md specifications.
Comprehensive code validation tool that assesses readiness for deployment.
Generate a comprehensive PRODUCT.md by interviewing users about their product's mission, personas, and competitive landscape.
Provides shared reference documents for multi-agent skills, enhancing collaboration.