name: reviewer-stress-test follows: rf-obsidian-markdown description: Runs a strict-but-fair ICLR/CVPR/SIGGRAPH reviewer challenge on idea, roadmap, or full paper, with major-risk diagnostics and actionable repair paths. Use when the user wants high-pressure review questioning instead of brainstorming.

Reviewer Stress Test

Purpose

Provide a strong reviewer-mode stress test for:

idea
roadmap
full paper draft

Default strictness is strong review. The skill must remain fair and must provide repair paths.

Positioning

This is an evaluative mode, not a co-creation ideation mode.

Focus: attack assumptions, expose rejection-level risks, test evidence sufficiency.
Output style: verdict-like risk assessment with fixes.

Evidence protocol

KB-first: first ground analysis via papers-query-knowledge-base.
On-demand web checks: search web only for high-risk or uncertain claims (for example novelty/SOTA/parallel work).
Separate missing evidence from actual invalidity.

Mandatory user clarification

Before final verdict, require the user to state:

The key difference between this work and the most similar prior works.
Preferably top-3 nearest works with one-line distinctions.

If this is missing, mark novelty confidence as limited.

Review dimensions

Challenge from multiple angles:

Problem framing and significance
Novelty and non-triviality
Technical soundness
Experimental rigor (baselines, ablations, statistics)
Reproducibility and implementation feasibility
Failure modes, robustness, ethics/bias scope

Output contract

Return structured sections:

Overall risk verdict (acceptance risk band + confidence)
Major concerns (rejection-level)
Minor concerns
Nearest-work gap check
Repair paths (for each major concern: minimum actionable steps)
Re-review checklist (what evidence would change verdict)

Tone and fairness guardrails

Be rigorous, specific, and evidence-based.
Do not use dismissive or insulting phrasing.
Every major concern should include at least one feasible repair action.

Non-goals

Not for open-ended idea expansion.
Not for broad roadmap generation from scratch.

Independence

This skill can be used independently and does not depend on outputs from idea-focus-coach or research-brainstorm-from-kb. Users can directly bring a formed idea, roadmap, or paper draft into stress testing.

ナビゲーション

Skillsとは？

リンク

reviewer-stress-test