name: review description: "Review code changes for correctness, performance, and consistency with project conventions." argument-hint: "PR #N [with multiple models] [and post feedback to GitHub]."

Code Review

Review code changes against conventions established in this repository's .instructions.md files and CONTRIBUTING.md.

Reviewer mindset: Be polite but skeptical. Your job is to find problems the author may have missed and to question whether the change is a net positive for the codebase. Treat the PR description and linked issues as claims to verify, not facts to accept.

When to Use This Skill

Reviewing a PR or code change for the first time
Checking code for correctness, style, or consistency before submitting a PR
Validating that a change follows project conventions
Verifying that previous review feedback was addressed and checking updated code

Step 0: Gather Code Context (No PR Narrative Yet)

Before analyzing anything, collect as much relevant code context as you can. Do NOT read the PR description, linked issues, or existing review comments yet. Form your own independent assessment of what the code does and why before being exposed to the author's framing.

Diff and file list: Fetch the full diff and the list of changed files.
Full source files: For every changed file, read the entire source file, not just the diff hunks. You need the surrounding code to understand invariants, call patterns, and data flow.
Consumers and callers: If the change modifies a public or internal API, search for callers and usages. Understanding how the code is consumed reveals whether the change could break existing behavior.
Sibling types and related code: If the change fixes a bug or adds a pattern in one type, check whether sibling types have the same issue or need the same fix.
Key utility/helper files: If the diff calls into shared utilities, read those to understand the contracts.
Git history: Check recent commits to the changed files (git log --oneline -20 -- <file>). Look for related recent changes, reverts, or prior attempts to fix the same problem.
Applicable instructions: Read the .instructions.md files that apply to the changed files (based on their applyTo patterns) and the closest README.md and CONTRIBUTING.md files.

Step 1: Form an Independent Assessment

Based only on the code context gathered above (without the PR description or issue), answer:

What does this change actually do? Describe the behavioral change in your own words. What was the old behavior? What is the new behavior?
Why might this change be needed? Infer the motivation from the code itself.
Is this the right approach? Would a simpler alternative work? Could the goal be achieved with existing functionality?
What problems do you see? Identify bugs, edge cases, missing validation, test gaps, and anything else that concerns you.

Write down your independent assessment before proceeding.

Motivation & Justification

Every PR must articulate what problem it solves and why.
Challenge every addition with "Do we need this?"
Demand real-world use cases. Hypothetical benefits are insufficient motivation.

Approach & Alternatives

Check whether the PR solves the right problem at the right layer.
When a PR takes a fundamentally wrong approach, redirect early.
Always prefer the simplest solution. The burden of proof is on the complex solution.

Cost-Benefit & Complexity

Explicitly weigh whether the change is a net positive.
Reject over-engineering — complexity is a first-class cost.
Every addition creates a maintenance obligation.

Scope & Focus

Require large or mixed PRs to be split into focused changes. Each PR should address one concern.
Defer tangential improvements to follow-up PRs.

Risk & Compatibility

Flag breaking changes and require documentation.
Assess regression risk proportional to the change's blast radius.

Codebase Fit & History

Ensure new code matches existing patterns and conventions.
Check whether a similar approach has been tried and rejected before.

Step 2: Incorporate PR Narrative and Reconcile

Now read the PR description, labels, linked issues, author information, and existing review comments. Treat all of this as claims to verify, not facts to accept.

PR metadata: Fetch the PR description, labels, linked issues, and author. Read linked issues in full.
CI status: Fetch the PR status checks. Record which checks passed, failed, or are still pending.
Related issues: Search for other open issues in the same area.
Existing review comments: Check if there are already review comments to avoid duplicating feedback.
Reconcile your assessment with the author's claims. Where your independent reading of the code disagrees with the PR description, investigate further — do not defer to the author's framing.
Build a thread inventory: For each unresolved and resolved thread, record the file, line, severity, what was requested, the thread ID, and whether it is resolved or unresolved. This is your verification checklist.
Update your assessment if the additional context genuinely changes your evaluation. Do not soften findings just because the PR description sounds reasonable.
Validate the PR title and description based on the detailed analysis, make sure it is accurate and meets the CONTRIBUTING.md guidelines. Report violation as a separate finding.

Step 3: Verify Addressed Feedback

For each thread in the inventory (both unresolved and resolved):

Read the current code at the location the comment refers to. Account for line shifts — the code may have moved due to other changes. Use the comment's context (surrounding code, function name) to locate it.
Determine the status:
- Addressed: The code now reflects what was requested (or an equivalent fix the author explained in a reply).
- Partially addressed: Some aspects were fixed but others remain. Be specific about what's still missing.
- Not addressed: The code is unchanged or the change doesn't resolve the concern.
- Superseded: The code was removed or refactored in a way that makes the original comment no longer applicable.
Check resolved threads for correctness. Authors resolve their own threads after addressing feedback. If a resolved thread was not adequately addressed, re-open it by replying with what remains unresolved.

Step 4: Detailed Analysis

Apply these rules in both initial and follow-up reviews.

Focus on what matters. Prioritize bugs, safety issues, incorrect assumptions, and problems that would affect consumers. Do not comment on trivial style issues unless they violate an explicit rule in the applicable .instructions.md files.
Consider collateral damage. For every changed code path, brainstorm: what other scenarios, callers, or inputs flow through this code? Could any of them break after this change? Surface plausible risks so the author can evaluate.
Be specific and actionable. Every comment should tell the author exactly what to change and why. Include evidence of how you verified the issue is real.
Flag severity clearly:
- [❌](# "Must fix") — Bugs, security issues, test gaps for behavior changes.
- [⚠️](# "Should fix") — Missing validation, inconsistency with established patterns.
- [💡](# "Consider changing") — Readability wins, minor improvements.
Don't pile on. If the same issue appears many times, flag it once with a note listing all affected locations. Don't re-flag issues from the original review that are still tracked in open threads.
Respect existing style. When modifying existing files, the file's current style takes precedence over general guidelines.
Don't flag what CI catches. Do not flag issues that a compiler, analyzer, or CI build step would catch.
Avoid false positives. Before flagging any issue:
- Verify the concern applies given the full context, not just the diff.
- Skip theoretical concerns with negligible real-world probability.
- If unsure, investigate further or surface it explicitly as a low-confidence question.
- Trust the author's context. If a pattern is consistent with the repo, assume it's intentional.
- Never assert that something "does not exist" or "is unavailable" based on training data alone. Ask rather than assert.
Ensure code suggestions are valid. Any code you suggest must be syntactically correct and complete.
Label in-scope vs. follow-up. Distinguish between issues the PR should fix and out-of-scope improvements.
Test quality should be assessed as its own finding when tests are part of the PR.
CI errors should be reported as its own finding. Include the failed check name and a brief summary of the failure from the logs. When a failure appears to be a flaky test unrelated to the PR, note it in the review and ask the author or a maintainer to re-run the job — the reviewer cannot re-run CI jobs.

Verdict Rules

Never give a blanket LGTM when you are unsure. Use "Needs Human Review" instead.
The verdict must reflect the most severe finding. Only use "Looks Good" when all findings are ✅ or 💡 and you are confident the change is correct.
For follow-up reviews, the verdict reflects the combined state: all threads resolved and no new issues → "Looks Good"; unresolved threads or new issues → "Needs Changes"; fundamental problems remain → "Still Blocked".
Separate code correctness from approach completeness. A change can be correct code that is an incomplete approach. The verdict must reflect the gap.
Classify each ⚠️ and ❌ finding as merge-blocking or advisory.
Devil's advocate check before finalizing. Re-read all ⚠️ findings. For each one, ask: does this represent an unresolved concern? If so, the verdict must reflect that tension.

Multi-Model Review (Optional)

When requested, use the opus, gemini and codex sub-agents to get diverse perspectives. Different models catch different classes of issues. If not requested, proceed with a single-model review using Steps 0–4 above.

Launch each sub-agent in parallel, giving each the same review prompt: the PR diff, the review rules from this skill, and instructions to produce findings in the severity format defined above.
Wait for all agents to complete, then synthesize: deduplicate findings across models, elevate issues flagged by multiple models (higher confidence), and include unique findings that meet the confidence bar. If a sub-agent has not completed after 10 minutes and you have results from others, proceed without it.
Present a single unified review, noting when an issue was flagged by multiple models.

Review Output

Assessment Summary

**<✅ Looks Good / ⚠️ Needs Human Review / ⚠️ Needs Changes / ❌ Reject>** followed by a 2-3 sentence summary of the overall verdict and key points. If "Needs Human Review," state which findings you are uncertain about and what a human reviewer should focus on.

Detailed Assessment

Include the detailed assessment only in the

Initial review to help the human reviewer decide whether investing in the pull request makes sense,
Final "Looks Good" review, to reiterate if the pull request still makes sense after additional changes.

Don't repeat the detailed assessment in the follow-up reviews requesting or suggesting changes.

Issues

Always include open issues. Closed issues should be included only in the final "Looks Good" review. Group related findings under a single heading: ### ✅/⚠️/❌ <Category Name> — <Brief description>. Include specifics - reference code, line numbers, etc.

Post Review to GitHub (Optional)

When requested, post the review as a GitHub PR review with inline comments.

The copilot attribution link [:copilot:](https://docs.github.com/copilot/responsible-use/code-review) must appear in the first sentence of every comment and the review body; on the same line; no line break after it.

<!-- Correct: attribution and first sentence on the same line -->
[:copilot:](https://docs.github.com/copilot/responsible-use/code-review) [❌](# "Must fix") The assembly fixture...

<!-- Wrong: attribution on its own line -->
[:copilot:](https://docs.github.com/copilot/responsible-use/code-review)

[❌](# "Must fix") The assembly fixture...

Use the GitHub MCP tools:

Create a pending review — Use pull_request_review_write with method: "create" (no event or body).
Add inline comments — For each new finding, use add_comment_to_pending_review to post a comment on the relevant file and line. Use the severity tooltip links defined above, e.g. [❌](# "Must fix"), not bare emojis.
For follow-up reviews, reply to existing threads before creating a new review:
- For verified threads, use add_reply_to_pull_request_comment with a confirmation and a Thanks! for the author.
- For unaddressed threads, reply with what remains, keeping the original severity marker.
- Create a new pending review only for new findings in updated code.
Submit the review — Use pull_request_review_write with method: "submit_pending"
- body:
  - Include the Summary section of the review
  - Include the Detailed Assessment section only in the initial and the final reviews.
  - Include in the Issues section only findings that were NOT posted as inline comments. Inline comments already create their own threads — repeating them in the body is redundant.
  - When there are unaddressed threads or findings, tag the author and ask them to take a look.
  - When the author is copilot-swe-agent, tag @copilot instead — that is the handle Copilot responds to.
- event:
  - REQUEST_CHANGES — when the review contains merge-blocking findings.
  - COMMENT — otherwise, leaving the approval decision to the human reviewer.

ナビゲーション

Skillsとは？

リンク

review