Loop Engineering

Set up a loop: an autonomous system that prompts the agent so you don't have to. You define the goal and the stop condition once; the loop finds work, does it, checks it, writes down what happened, and runs again until the goal is met. This skill stands that loop up inside your existing project: it reads how you already test, build, and track work, then fits the loop to your conventions instead of imposing generic ones.

A loop is built from six primitives. Set up only the ones the goal needs.

Primitive	Role in the loop	Claude Code mechanism
Trigger	starts each cycle, on a schedule or an event	`/goal` (work across turns until a condition is met), `/loop`, scheduled tasks, hooks, GitHub Actions
State	the loop's memory between cycles, on disk	a markdown file (`LOOP.md`) or an issue tracker via MCP
Discovery	finds what to work on this cycle	a triage step that reads CI, issues, diffs, a queue
Maker	does one unit of the work	a subagent in `.claude/agents/`
Checker	verifies the maker, independently	a different subagent in `.claude/agents/`
Stop condition	the rule that ends the loop	a verifiable check the checker runs

For any loop that runs unattended, add a monitor: an oversight agent that watches the loop as a whole and halts it on drift (going off-goal), spinning (no net progress), or a crossed bound (too many cycles, too long). The maker and checker keep each item correct; the monitor keeps the whole loop pointed at the goal, so it cannot quietly wander off and run for hours.

Two more optional add-ons: worktrees (git worktree) isolate parallel makers so they don't edit the same files, and MCP connectors let the loop touch real tools (open a PR, update a ticket, post status). Add them when the goal needs parallelism or external side effects.

Setup procedure

This runs inside an existing project. Follow these steps when invoked. Confirm with the user before anything with side effects: scheduled jobs, pushes, ticket writes, deploys.

Read the project first. Before scaffolding anything, learn the repo: how it runs tests, build, and lint (check package.json, Makefile, CI config, CLAUDE.md); what conventions and existing skills or .claude/agents/ it already has; and where work comes from (CI, an issue tracker, a backlog, a queue). The loop has to fit this project. Reuse its real commands and conventions, don't invent new ones.
Pin the goal and the stop condition. Ask the user for the loop's one goal and the verifiable condition that means done (tests exit 0, the queue is empty, zero failing checks). A loop with no stop condition is the one thing you must never ship. If the user can't name one, set a bounded fallback (stop after N cycles) and say so out loud. In Claude Code, /goal <condition> makes this executable: Claude keeps working across turns until the condition is met, and /goal clear ends it early.
Write the state file. Copy ${CLAUDE_SKILL_DIR}/templates/LOOP.md to the repo root and fill in goal, stop condition, cadence, and the human gate. It has three living sections the loop maintains: ## Open (found, not done), ## Done (with the evidence that verified it), ## Blocked (needs a human), plus a ## Rules section that is the standing instruction set every agent in the loop follows. The loop reads this first every cycle and writes it last.
Create the maker subagent. Copy ${CLAUDE_SKILL_DIR}/templates/loop-maker.md to .claude/agents/loop-maker.md and point it at the conventions and test command you found in step 1. It does exactly one item from ## Open, follows the project's existing skills and conventions, and never marks its own work done. If a file by that name already exists, update it rather than overwriting blindly.
Create the checker subagent. Copy ${CLAUDE_SKILL_DIR}/templates/loop-checker.md to .claude/agents/loop-checker.md. It is a different agent from the maker, it runs the project's actual verification (the real test, build, or lint command from step 1, not a description of it), and it is the only agent allowed to move an item to ## Done.
Create the monitor subagent (recommended for any unattended loop). Copy ${CLAUDE_SKILL_DIR}/templates/loop-monitor.md to .claude/agents/loop-monitor.md, and run it every few cycles or on a slower timer than the loop. It watches the loop as a whole rather than any single item: it checks whether the loop is still converging on the goal, and it halts the loop and escalates to a human on drift (work that does not serve the goal), spinning (no net progress, items bouncing between Open and Blocked), or a crossed bound. This is what stops a loop from quietly wandering off and running for hours.
Set the code-review and security-audit cadence (ask the user). Ask how often the loop should run each, and record both in LOOP.md. Offer clear options and always include never:
- Code review (/code-review): every change, before each PR or merge, every N cycles, weekly, or never.
- Security audit (/security-review): every change, before each PR or merge, weekly, monthly, or never.
At the chosen cadence the loop runs that built-in pass and files what it finds into ## Open as new work items, so the maker fixes them and the checker verifies the fix. If the user picks never, skip that pass entirely and record never in LOOP.md so the loop does not reintroduce it.
Define discovery. Write the triage step that fills ## Open each cycle from where this project's work actually lives (CI status, open issues, recent commits, a work queue), and how it turns findings into discrete, individually checkable items. Make it deterministic wherever you can.
Choose isolation (optional). If more than one maker runs per cycle, give each its own git worktree and branch so they never write the same file. One worktree per item.
Wire connectors (optional). If the loop must open PRs, update tickets, or post status, connect the matching MCP server. Otherwise the loop's output stays local and a human relays it.
Set the trigger. Pick the lightest mechanism that fits the cadence:

run until the goal is met, in this session → /goal <stop condition> (Claude works across turns until the condition is true; /goal clear ends it early)
repeat on a cadence, on demand → /loop
periodic and unattended → a scheduled task (or cron / a GitHub Action for CI-side loops)
react to an event (push, PR, file change) → a hook or a GitHub Action

Confirm with the user before creating any scheduled job.

Dry-run, verify, then enable. Run one cycle by hand. Read everything it produced. Confirm the checker actually blocks bad output and that state was written. Only then turn on the trigger.

Guardrails (non-negotiable)

The checker is never the maker. The agent that wrote the code does not get to declare it correct.
The stop condition is verifiable, not vibes. "Looks done" is not a stop condition. A command that exits 0 is.
State is written every cycle. If the loop forgets what it tried, it repeats itself. The model forgets; the file does not.
A human gate stays on anything irreversible. Shipping, deploying, deleting, spending money. The loop drafts, a person confirms. An unattended loop makes unattended mistakes.
Blocked beats guessing. When the loop can't verify or can't proceed, it writes to ## Blocked and stops that item. It does not ship a guess.
A long loop is watched and bounded. Any unattended loop runs with a monitor and a bound. If it drifts off the goal or runs past the bound, the monitor halts it and escalates. A loop never gets to run forever in the wrong direction.

What the loop looks like once it's set up

your-project/
├── LOOP.md                   # goal, stop condition, bounds, rules, Open/Done/Blocked
├── .claude/
│   └── agents/
│       ├── loop-maker.md      # does one unit of work
│       ├── loop-checker.md    # verifies it, independently
│       └── loop-monitor.md    # watches the whole loop for drift + runaway
└── (trigger: /goal, /loop, a scheduled task, or a GitHub Action)

Anti-patterns: do not ship these

A loop with no stop condition (runs forever, burns tokens).
One agent that writes and grades its own work.
A loop with no state file (re-derives everything, repeats work, forgets what failed).
An unattended loop with no monitor or bounds (drifts off-goal and runs for hours before anyone notices).
Unattended shipping with no human gate on irreversible actions.
One loop carrying many unrelated jobs. One loop, one goal, one stop condition. Split the rest into their own loops.

Files in this skill

templates/LOOP.md: the state file to copy into the target repo.
templates/loop-maker.md: the maker subagent definition.
templates/loop-checker.md: the checker subagent definition.
templates/loop-monitor.md: the monitor subagent definition (drift and runaway guard).
examples/keep-ci-green.md: a complete worked loop, from goal to trigger.

sxivansx/loop-engineering

Ask in your favorite AI

文档

Loop Engineering

Setup procedure

Guardrails (non-negotiable)

What the loop looks like once it's set up

Anti-patterns: do not ship these

Files in this skill

相关技能

steipete/notion

affaan-m/seo

affaan-m/brand-voice

affaan-m/crosspost

affaan-m/x-api

affaan-m/content-engine