Evaluate and improve Claude Code commands, skills, and agents. Use when testing prompt effectiveness, validating context engineering choices, or measuring improvement quality.
Skills(SKILL.md)は、AIエージェント(Claude Code、Cursor、Codexなど)に特定の能力を追加するための設定ファイルです。
詳しく見る →Evaluate and improve Claude Code commands, skills, and agents. Use when testing prompt effectiveness, validating context engineering choices, or measuring improvement quality.
Comprehensive guide for skill development based on Anthropic's official best practices - use for complex skills requiring detailed structure
Interactive assistant for creating new Claude commands with proper structure, patterns, and MCP tool integration
Create and configure git hooks with intelligent project analysis, suggestions, and automated testing
Guide for creating effective skills. This command should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations. Use when creating new skills, editing existing skills, or verifying skills work before deployment - applies TDD to process documentation by testing with subagents before writing, iterating until bulletproof against rationalization
Create a workflow command that orchestrates multi-step execution through sub-agents with file-based task prompts
Use when creating or editing any prompt (commands, hooks, skills, subagent instructions) to verify it produces desired behavior - applies RED-GREEN-REFACTOR cycle to prompt engineering using subagents for isolated testing
Update and maintain project documentation for local code changes using multi-agent workflow with tech-writer agents. Covers docs/, READMEs, JSDoc, and API documentation.
Load all open issues from GitHub and save them as markdown files
Curates insights from reflections and critiques into CLAUDE.md using Agentic Context Engineering
Launch an intelligent sub-agent with automatic model selection based on task complexity, specialized agent matching, Zero-shot CoT reasoning, and mandatory self-critique verification
Design multi-agent architectures for complex tasks. Use when single-agent context limits are exceeded, when tasks decompose naturally into subtasks, or when specializing agents improves quality.
creates draft task file in .specs/tasks/draft/ with original user intent
Implement a task with automated LLM-as-Judge verification for critical steps
Refine, parallelize, and verify a draft task specification into a fully planned implementation-ready task
Systematically fix all failing tests after business logic changes or refactoring
Use when implementing any feature or bugfix, before writing implementation code - write the test first, watch it fail, write minimal code to pass; ensures tests actually verify behavior by requiring failure first
Refactor high-complexity React components in Dify frontend. Use when `pnpm analyze-component --json` shows complexity > 50 or lineCount > 300, when the user asks for code splitting, hook extraction, or complexity reduction, or when `pnpm analyze-component` warns to refactor before testing; avoid for simple/well-structured components, third-party wrappers, or when the user explicitly wants testing without refactoring.
Design comprehensive test cases using PICT (Pairwise Independent Combinatorial Testing) for any piece of requirements or code. Analyzes inputs, generates PICT models with parameters, values, and constraints for valid scenarios using pairwise testing. Outputs the PICT model, markdown table of test cases, and expected results.
AWS Aurora Serverless v2, RDS Proxy, Data API, connection pooling
Analyze existing repositories, maintain structure, setup guardrails and best practices
TDD iteration loops using Claude Code Stop hooks - runs tests after each response, feeds failures back automatically
AI-first application patterns, LLM testing, prompt management
E2E testing with Playwright - Page Objects, cross-browser, CI/CD
PostHog analytics, event tracking, feature flags, dashboards
React web development with hooks, React Query, Zustand
OWASP security patterns, secrets management, security testing
Create Jira/Asana/Linear tickets optimized for Claude Code execution - AI-native ticket writing
Visual testing - catch invisible buttons, broken layouts, contrast
User experience flows - journey mapping, UX validation, error recovery
'Expert guidance on Swift library and framework design. Use when developers mention: (1) designing a Swift library or framework, (2) public API design patterns, (3) protocol-oriented architecture or associated types, (4) result builders or DSL design, (5) performance optimization for libraries, (6) @inlinable or @usableFromInline, (7) noncopyable types for APIs, (8) progressive disclosure in API design, (9) ResponseGenerator or builder patterns.'
Elite website image-to-code skill for Codex. For visually important web tasks, it must first generate the design image(s) itself, deeply analyze them, then implement the website to match them as closely as possible. In Codex, it must prefer large, readable, section-specific images instead of tiny compressed boards, generate fresh standalone images for sections or detail views instead of cropping old ones, avoid lazy under-generation, avoid cards-inside-cards-inside-cards UI, and keep the hero clean, spacious, readable, and visible on a small laptop.
Migrate test files from `as` type assertions to @total-typescript/shoehorn. Use when user mentions shoehorn, wants to replace `as` in tests, or needs partial test data.
Create new agent skills with proper structure, progressive disclosure, and bundled resources. Use when user wants to create, write, or build a new skill.
Create forensically sound bit-for-bit disk images using dd and dcfldd while preserving evidence integrity through
Analyze advanced persistent threat (APT) group techniques using MITRE ATT&CK Navigator to create layered heatmaps
'Analyzes malware command-and-control (C2) communication protocols to understand beacon patterns, command structures,
'Analyzes DNS query logs to detect data exfiltration via DNS tunneling, DGA domain communication, and covert
Parse and analyze email headers to trace the origin of phishing emails, verify sender authenticity, and identify
Reverse engineer Go-compiled malware using Ghidra with specialized scripts for function recovery, string extraction,
'Performs runtime mobile security exploration of iOS applications using Objection, a Frida-powered toolkit that
'Uses the Linux Audit framework (auditd) with ausearch and aureport utilities to detect intrusion attempts, unauthorized
'Analyzes network traffic captures and flow data to identify adversary activity during security incidents, including
'Captures and analyzes network packet data using Wireshark and tshark to identify malicious traffic patterns,
'Leverages Splunk Enterprise Security and SPL (Search Processing Language) to investigate security incidents
Parse Windows LNK shortcut files to extract target paths, timestamps, volume information, and machine identifiers
'This skill covers deploying Microsoft Sentinel as a cloud-native SIEM and SOAR platform for centralized security
'Builds vendor-agnostic detection rules using the Sigma rule format for threat detection across SIEM platforms
'Designs and documents structured incident response playbooks that define step-by-step procedures for specific
Establish a structured operational process to triage, test, and deploy Microsoft Patch Tuesday security updates