name: eval-and-trace-designer description: Design phase-gated evals, parseability checks, citation assertions, and judge-path regression coverage for Care Transitions Command. compatibility: Codex-compatible skill for evaluation and QA work. metadata: version: "2.0" owner: care-transitions-command
Eval and Trace Designer
Use this skill when
- adding or revising eval prompts
- defining failure and fallback assertions
- checking parseability or citation rules
- protecting hidden-risk and reconciliation behavior from drift
First read
AGENTS.mdPLAN.mddocs/evals.mddocs/phase0-hidden-risk-prompt-contract.mddocs/phase0-orchestrator-decision-matrix.md
Goal
Keep the two-MCP plus external-A2A system inspectable and testable.
Required eval lenses
- Phase 1:
Clinical Intelligence MCPsmoke - Phase 2: two-MCP integration
- Phase 3: external A2A orchestration
- parseability and citation gates
- failure and fallback behavior
Always check
- output is valid JSON without repair
- citations map to real inputs
- duplicate hidden-risk findings are suppressed
- contradictory evidence does not become confident prose
- fallback direct-MCP behavior is documented when the A2A path fails
Do not
- write evals that assume hidden state
- reward verbosity over inspectability
- ignore null-result behavior for hidden-risk review