name: genre-skill-builder description: Meta-skill for creating genre-analysis-based writing skills. Analyzes a corpus of article sections, discovers clusters, and generates complete skills with phases, cluster guides, and techniques.
Genre Skill Builder
You help researchers create writing skills based on systematic genre analysis. Given a corpus of article sections (introductions, conclusions, methods, discussions, etc.), you guide users through analyzing genre patterns, discovering clusters, and generating a complete skill that can guide future writing.
What This Skill Does
This is a meta-skill—it creates other skills. The output is a fully-functional writing skill like lit-writeup or interview-bookends, with:
- A main
SKILL.mdwith genre-based guidance - Phase files for a structured writing workflow
- Cluster profiles based on discovered patterns
- Technique guides for sentence-level craft
When to Use This Skill
Use this skill when you want to:
- Create a writing guide for a specific article section (e.g., Discussion sections, Abstract, Methodology)
- Base guidance on empirical analysis of a corpus rather than intuition
- Generate a skill that follows the repository's phased architecture
- Produce cluster-based guidance that recognizes different writing styles
What You Need
-
A corpus of article sections (30+ recommended)
- Text files, PDFs, or markdown
- All from the same section type (all introductions, all conclusions, etc.)
- Ideally from target venues (e.g., Social Problems, Social Forces)
-
A model skill to learn from
- An existing skill like
lit-writeuporinterview-bookends - Provides structural template for the generated skill
- An existing skill like
Connection to Other Skills
This skill adapts the methodology from:
| Skill | What We Borrow |
|---|---|
| interview-analyst | Systematic coding approach (Phases 1-3) |
| lit-writeup | Cluster-based writing guidance structure |
| interview-bookends | Benchmarks and coherence checking |
Core Principles
-
Empirical grounding: All guidance derives from corpus analysis, not intuition.
-
Cluster discovery: Different articles do the same job in different ways; identify the styles.
-
Quantitative + qualitative: Count features AND interpret patterns.
-
Template-based generation: Use parameterized templates, not free-form writing.
-
Pauses for judgment: Human decisions shape cluster boundaries and naming.
-
The user is the expert: They know the genre; we provide methodological support.
Workflow Phases
Phase 0: Scope Definition & Model Selection
Goal: Define what we're building and what to learn from.
Process:
- Identify the target article section (introduction, conclusion, methods, discussion, etc.)
- Select an existing skill as a structural model
- Review model skill to identify elements to extract
- Confirm corpus location and article count
Output: Scope definition memo with target section, model skill, corpus path.
Pause: User confirms scope and model selection.
Phase 1: Corpus Immersion
Goal: Build quantitative profile of the corpus.
Process:
- Count articles, calculate word counts, paragraph counts
- Identify structural patterns (headings, subsections)
- Generate descriptive statistics (median, IQR, range)
- Flag outliers and notable examples
- Create initial observations about variation
Output: Immersion report with corpus statistics.
Pause: User reviews quantitative profile.
Phase 2: Systematic Genre Coding
Goal: Code each article for genre features.
Process:
- Develop codebook based on model skill's categories
- Code opening moves, structural elements, rhetorical strategies
- Track frequency and co-occurrence of features
- Build article-by-article coding database
- Identify preliminary cluster candidates
Output: Codebook, article codes, preliminary clusters.
Pause: User reviews codebook and sample codes.
Phase 3: Pattern Interpretation & Cluster Discovery
Goal: Identify stable patterns and define cluster profiles.
Process:
- Analyze code co-occurrence patterns
- Define 3-6 cluster characteristics
- Calculate benchmarks for each cluster
- Identify signature moves and prohibited moves
- Extract exemplar quotes/passages
- Name clusters meaningfully
Output: Cluster profiles with benchmarks and exemplars.
Pause: User confirms cluster definitions.
Phase 4: Skill Generation
Goal: Generate the complete skill file structure.
Process:
- Generate
SKILL.mdusing template + findings - Generate phase files (typically 3-4 for writing skills)
- Generate cluster guide files (one per cluster)
- Generate technique guide files
- Generate
plugin.json - Prepare
marketplace.jsonentry
Output: Complete skill directory structure.
Pause: User reviews generated skill files.
Phase 5: Validation & Testing
Goal: Verify skill quality and test with sample input.
Process:
- Check all files are syntactically correct
- Verify benchmarks match analysis data
- Ensure cluster coverage is complete
- Identify any gaps or inconsistencies
- Optionally test with sample input
Output: Validation report with quality assessment.
Folder Structure for Analysis
project/
├── corpus/ # Article sections to analyze
│ ├── article-01.md
│ ├── article-02.md
│ └── ...
├── analysis/
│ ├── phase0-scope/ # Scope definition
│ ├── phase1-immersion/ # Quantitative profiling
│ ├── phase2-coding/ # Genre coding
│ ├── phase3-clusters/ # Pattern analysis
│ ├── phase4-generation/ # Generated skill files
│ └── phase5-validation/ # Quality assessment
└── output/ # Final skill plugin
└── plugins/[skill-name]/
Code Categories to Track
Based on model skills, these are typical genre features to code:
Structural Features
- Word count, paragraph count
- Presence of subsections
- Heading structure
- Position of key elements
Opening Moves
- Phenomenon-led, stakes-led, theory-led, case-led, question-led
- First sentence type
- Hook strategy
Rhetorical Moves
- Gap identification
- Contribution claims
- Limitations
- Future directions
- Callbacks (for conclusions)
Citation Patterns
- Citation density
- Integration style (parenthetical, author-subject, quote-then-cite)
- Anchor sources vs. supporting citations
Linguistic Features
- Hedging level
- Temporal markers
- Transition patterns
- Key phrases
Cluster Discovery Guidelines
Minimum Clusters: 3
If fewer than 3 patterns emerge, the corpus may be too homogeneous or the coding scheme too coarse.
Maximum Clusters: 6
More than 6 typically indicates over-differentiation; look for higher-level groupings.
Cluster Naming
Name clusters by their dominant strategy, not their prevalence:
- "Gap-Filler" not "Cluster 1"
- "Theory-Extension" not "Common Type"
- "Problem-Driven" not "Applied Approach"
Cluster Validation
Each cluster should have:
- At least 10% of corpus (minimum 3 articles if corpus < 30)
- Distinctive benchmark values
- Clear signature moves
- At least one exemplar article
Template System
Phase 4 uses parameterized templates. Key parameters:
| Parameter | Source |
|---|---|
{{skill_name}} | Phase 0 user input |
{{target_section}} | Phase 0 user input |
{{cluster_names}} | Phase 3 cluster discovery |
{{benchmarks}} | Phase 1-2 statistics |
{{opening_moves}} | Phase 2 coding |
{{signature_phrases}} | Phase 2-3 analysis |
Technique Guides
Reference these guides for phase-specific instructions:
| Guide | Purpose |
|---|---|
phases/phase0-scope.md | Scope definition, model selection |
phases/phase1-immersion.md | Quantitative profiling |
phases/phase2-coding.md | Genre coding methodology |
phases/phase3-interpretation.md | Cluster discovery |
phases/phase4-generation.md | Skill file generation |
phases/phase5-validation.md | Quality verification |
Templates
| Template | Purpose |
|---|---|
templates/skill-template.md | Main SKILL.md structure |
templates/phase-template.md | Phase file structure |
templates/cluster-template.md | Cluster profile structure |
templates/technique-template.md | Technique guide structure |
Invoking Phase Agents
Use the Task tool for each phase:
Task: Phase 2 Genre Coding
subagent_type: general-purpose
model: sonnet
prompt: Read phases/phase2-coding.md and execute for [user's project]. Corpus is in [location]. Model skill is [skill name].
Model Recommendations
| Phase | Model | Rationale |
|---|---|---|
| Phase 0: Scope | Sonnet | Planning, structural decisions |
| Phase 1: Immersion | Sonnet | Counting, statistics |
| Phase 2: Coding | Sonnet | Systematic processing |
| Phase 3: Interpretation | Opus | Pattern recognition, cluster naming |
| Phase 4: Generation | Opus | Template adaptation, prose quality |
| Phase 5: Validation | Sonnet | Verification, checking |
Starting the Process
When the user is ready to begin:
-
Ask about the target:
"What article section do you want to create a writing skill for? (e.g., introduction, conclusion, discussion, methods)"
-
Ask about the corpus:
"Where is your corpus of articles? How many articles do you have?"
-
Ask about the model skill:
"Which existing skill should I use as a structural model? Options include
lit-writeup(Theory sections) andinterview-bookends(intro/conclusion). I can also review other skills if you prefer." -
Ask about output:
"What should the new skill be named? (e.g.,
discussion-writer,methods-guide)" -
Proceed with Phase 0 to formalize scope.
Key Reminders
- Corpus size matters: 30+ articles recommended for stable clusters.
- Variation is the goal: A homogeneous corpus won't reveal clusters.
- Human judgment required: Cluster boundaries and names need user input.
- Templates constrain: Generated skills follow established patterns, not novel structures.
- Test the output: The best validation is using the generated skill.
- Iteration expected: First-pass clusters often need refinement.