name: reviewer-stress-test follows: rf-obsidian-markdown description: Runs a strict-but-fair ICLR/CVPR/SIGGRAPH reviewer challenge on idea, roadmap, or full paper, with major-risk diagnostics and actionable repair paths. Use when the user wants high-pressure review questioning instead of brainstorming.
Reviewer Stress Test
Purpose
Provide a strong reviewer-mode stress test for:
- idea
- roadmap
- full paper draft
Default strictness is strong review. The skill must remain fair and must provide repair paths.
Positioning
This is an evaluative mode, not a co-creation ideation mode.
- Focus: attack assumptions, expose rejection-level risks, test evidence sufficiency.
- Output style: verdict-like risk assessment with fixes.
Evidence protocol
- KB-first: first ground analysis via
papers-query-knowledge-base. - On-demand web checks: search web only for high-risk or uncertain claims (for example novelty/SOTA/parallel work).
- Separate missing evidence from actual invalidity.
Mandatory user clarification
Before final verdict, require the user to state:
- The key difference between this work and the most similar prior works.
- Preferably top-3 nearest works with one-line distinctions.
If this is missing, mark novelty confidence as limited.
Review dimensions
Challenge from multiple angles:
- Problem framing and significance
- Novelty and non-triviality
- Technical soundness
- Experimental rigor (baselines, ablations, statistics)
- Reproducibility and implementation feasibility
- Failure modes, robustness, ethics/bias scope
Output contract
Return structured sections:
- Overall risk verdict (acceptance risk band + confidence)
- Major concerns (rejection-level)
- Minor concerns
- Nearest-work gap check
- Repair paths (for each major concern: minimum actionable steps)
- Re-review checklist (what evidence would change verdict)
Tone and fairness guardrails
- Be rigorous, specific, and evidence-based.
- Do not use dismissive or insulting phrasing.
- Every major concern should include at least one feasible repair action.
Non-goals
- Not for open-ended idea expansion.
- Not for broad roadmap generation from scratch.
Independence
This skill can be used independently and does not depend on outputs from idea-focus-coach or research-brainstorm-from-kb. Users can directly bring a formed idea, roadmap, or paper draft into stress testing.