name: agentic-workflow description: Active orchestrator for multi-agent flight execution. Drives leg design per leg, then batches implementation across all autonomous legs, with a single code review and commit at the end of the flight.
Agentic Workflow
Orchestrate multi-agent flight execution. You drive the full leg cycle — designing legs, spawning Developer and Reviewer agents, and managing git workflow — for a target project's flight. Leg design is reviewed per leg, but code review and commit are deferred until after the last autonomous leg completes. This eliminates per-leg review/commit overhead while keeping the same leg design and implementation structure.
Prerequisites
- Project must be initialized with
/init-project(.flightops/ARTIFACTS.mdmust exist) - A mission must exist and be
active - A flight must exist and be
readyorin-flight
Invocation
/agentic-workflow flight {number} for {project-slug} mission {number}
Example: /agentic-workflow flight 03 for epipen mission 04
Phase 1: Context Loading
- Read
projects.mdto find the target project's path - Read
{target-project}/.flightops/ARTIFACTS.mdfor artifact locations - Read
{target-project}/.flightops/agent-crews/leg-execution.mdfor project crew definitions, interaction protocol, and prompts (fall back to defaults at.claude/skills/init-project/defaults/agent-crews/leg-execution.md)- Validate structure: The phase file MUST contain
## Crew,## Interaction Protocol, and## Promptssections. Each prompt subsection MUST have a fenced code block. - If the file exists but is malformed: STOP. Tell the user: "Phase file
leg-execution.mdis missing required sections. Either fix it manually or re-run/init-projectto reset to defaults." Do NOT improvise missing prompts — halt and get the file fixed.
- Validate structure: The phase file MUST contain
- Read the mission artifact — outcomes, success criteria, constraints
- Read the flight artifact — objective, design decisions, leg list
- Read the flight log — ground truth from prior execution
- Count total legs from the flight spec — track progress throughout
- Determine starting point — which leg is next based on flight log and leg statuses
Mark flight as in-flight: After loading the flight artifact, if the flight status is ready, update it to in-flight before proceeding. If already in-flight, leave it as-is.
If resuming a flight already in progress, verify state consistency:
- Flight log entries must match leg statuses
- If discrepancies exist, remediate before proceeding
Phase 2: Leg Cycle
Repeat for each leg in the flight.
2a: Leg Design
- Design the leg using the
/legskill (if the Skill tool is unavailable, read.claude/skills/leg/SKILL.mdand follow the workflow directly)- Read the flight spec, flight log, and relevant source code
- Create the leg artifact with acceptance criteria
- Spawn a Developer agent for design review (Task tool,
subagent_type: "general-purpose")- Working directory:
{target-project} - Provide the "Review Leg Design" prompt from the leg-execution phase file's Prompts section
- The Developer reads the leg artifact and cross-references against actual codebase state
- The Developer provides a structured assessment: approve, approve with changes, or needs rework
- Working directory:
- Incorporate feedback — update the leg artifact to address any issues raised
- High-severity issues: must fix before proceeding
- Medium-severity issues: fix unless there's a clear reason not to
- Low-severity issues and suggestions: apply at discretion
- Re-review if substantive changes were made — spawn another Developer for a second pass
- Skip if only minor/cosmetic fixes were applied
- If the second review raises new high-severity issues, fix and re-review once more
- Max 2 design review cycles — if issues persist after 2 rounds, escalate to human
- Update leg status to
ready - Signal
[HANDOFF:review-needed]when the leg design is finalized
2b: Leg Implementation
NEVER implement code directly. Spawn a Developer agent via the Task tool.
Interactive/HAT legs: If the leg is a HAT (human acceptance test), alignment, or other interactive leg (identified by slug like hat-*, alignment-*, or explicit marking in the flight spec), do NOT spawn agents to execute it autonomously. The human performs verification — the Flight Director guides them through it:
- Design the leg normally (2a), but keep it lightweight — the acceptance criteria are verification steps, not implementation tasks
- Skip the autonomous implementation cycle (no Developer/Reviewer agents)
- Guide the human through verification steps one at a time — present a single step, wait for the human to perform it and report results, then proceed to the next step
- Fix issues inline — if the human reports a failure, diagnose and fix it (spawning a Developer agent if code changes are needed), then re-verify that step before moving on
- Commit when all steps pass — update artifacts and commit
Standard (autonomous) legs: Spawn a Developer agent — but do NOT review or commit after each leg.
- Spawn a Developer agent (Task tool,
subagent_type: "general-purpose")- Working directory:
{target-project} - Provide the "Implement" prompt from the leg-execution phase file's Prompts section
- The Developer updates leg status to
in-flight, implements to acceptance criteria - When done, the Developer updates leg status to
landedand updates flight log — do NOT let it commit or signal[HANDOFF:review-needed]
- Working directory:
2c: Leg Transition
After the Developer completes a leg:
- Increment
legs_completed - If more autonomous legs remain → return to 2a
- If this was the last autonomous leg → proceed to Phase 2d
2d: Flight Review and Commit
After all autonomous legs are implemented (all uncommitted):
- Spawn a Reviewer agent (Task tool,
subagent_type: "general-purpose")- Working directory:
{target-project} - Provide the "Review" prompt from the leg-execution phase file's Prompts section
- The Reviewer evaluates ALL uncommitted changes against acceptance criteria and code quality
- The Reviewer signals
[HANDOFF:confirmed]or lists issues with severity
- Working directory:
- If issues found, spawn a new Developer agent to fix them
- Provide the "Fix Review Issues" prompt from the leg-execution phase file with the Reviewer's feedback
- Loop review/fix until the Reviewer confirms
- Commit after review passes — include all code changes, updated flight log, and all leg statuses updated to
completed - Manage PR: Open a draft PR with the leg checklist in the body (see PR Body Format below), all legs checked off
Phase 3: Flight Completion
- Verify all legs show
completedstatus - Verify flight log has entries for all legs
- Verify documentation — check that CLAUDE.md, README, and other project docs reflect any new commands, endpoints, configuration, or APIs introduced during the flight. If not, spawn a Developer agent to update them.
- Update flight status to
landed - Check off flight in mission artifact
- Signal
[COMPLETE:flight]
The flight debrief is a separate step run via /flight-debrief after the flight lands. The debrief transitions the flight to completed.
Architecture
The Flight Director (you) orchestrates according to this skill. Project crew composition, roles, models, and prompts are defined in {target-project}/.flightops/agent-crews/leg-execution.md.
Separation is mandatory. Project crew agents run in the target project and load its CLAUDE.md and conventions. The Reviewer has no knowledge of the Developer's reasoning — only the resulting changes. This provides objective review.
Model selection: Follow the model preferences in the phase file. MC may use Opus for complex planning. Never use Opus for the Reviewer.
Handoff Signals
Signals are part of the Flight Control methodology and are NOT configurable per-project. All crew agents must use these exact signals:
| Signal | Emitted By | Meaning |
|---|---|---|
[HANDOFF:review-needed] | Developer | Code/artifact ready for review |
[HANDOFF:confirmed] | Reviewer | Review passed |
[BLOCKED:reason] | Any crew agent | Cannot proceed, needs resolution |
[COMPLETE:leg] | Developer | Leg finished and committed |
[COMPLETE:flight] | Flight Director | Flight landed |
Flight Director Decision Log
The Flight Director must maintain transparency about its own decisions. After each major orchestration step, log what happened and why in the flight log under a ### Flight Director Notes subsection:
- Phase file loading — Record which phase file was loaded (project or default fallback) and what crew was extracted
- Agent spawning — Record which agent was spawned, with what prompt, and what model
- Review cycle decisions — When incorporating feedback, note what was accepted/rejected and why
- Escalation decisions — When choosing between "fix and re-review" vs "escalate to human," note the reasoning
- Signal interpretation — When a crew agent's output is ambiguous, note how it was interpreted
This is not a separate file — it goes in the flight log alongside leg entries. The goal is that anyone reviewing the flight log can understand not just what the crew did, but why the Flight Director made the orchestration choices it did.
Git Workflow
All agents work in the target project root on a feature branch created at flight start.
Branch naming: flight/{number}-{slug}
Flight start: git checkout -b flight/{number}-{slug}
Commit message format:
flight/{number}: {description}
Mission: {mission-number}
PR lifecycle:
| Event | Action |
|---|---|
| All legs complete | Open draft PR with all legs checked off |
| Flight landed | Mark PR ready for review |
PR body format:
## {Flight Title}
{Flight objective — one paragraph}
**Mission**: {Mission Title}
## Legs
- [x] `{leg-slug}` — {brief description}
- [x] `{leg-slug}` — {brief description}
Error Handling
| Situation | Action |
|---|---|
| Developer agent fails mid-leg | Spawn new Developer with context of what failed |
| Design review loops > 2 times | Escalate to human with unresolved design issues |
| Code review loops > 3 times | Escalate to human |
| Leg marked aborted | Escalate to human with abort details |
| Artifact discrepancy | Remediate before proceeding |
| Off the rails | Roll back to last leg commit, escalate |
| Agent hangs on tests | Kill the agent, spawn new Developer to isolate and fix hanging tests |