Week 4 — The Autonomous Coding Agent IRL

We recommend reading this entire document before getting started.

This week, your task is to build at least 2 automations within the context of this repository using any combination of the following Claude Code features:

Custom skills (checked into .claude/skills/*/SKILL.md)
CLAUDE.md files for repository or context guidance
Claude SubAgents (role-specialized agents working together)
MCP servers integrated into Claude Code

Your automations should meaningfully improve a developer workflow – for example, by streamlining tests, documentation, refactors, or data-related tasks. You will then use the automations you create to expand upon the starter application found in week4/.

Learn about Claude Code

To gain a deeper understanding of Claude Code and explore your automation options, please read through the following two resources:

Claude Code best practices: anthropic.com/engineering/claude-code-best-practices
SubAgents overview: docs.anthropic.com/en/docs/claude-code/sub-agents
Skills documentation: code.claude.com/docs/en/skills

Explore the Starter Application

Minimal full‑stack starter application designed to be a "developer's command center".

FastAPI backend with SQLite (SQLAlchemy)
Static frontend (no Node toolchain needed)
Minimal tests (pytest)
Pre-commit (black + ruff)
Tasks to practice agent-driven workflows

Use this application as your playground to experiment with the Claude automations you build.

Structure

backend/                # FastAPI app
frontend/               # Static UI served by FastAPI
data/                   # SQLite DB + seed
docs/                   # TASKS for agent-driven workflows

Quickstart

Activate your conda environment.

conda activate cs146s

(Optional) Install pre-commit hooks

pre-commit install

Run the app (from week4/ directory)

make run

Open http://localhost:8000 for the frontend and http://localhost:8000/docs for the API docs.
Play around with the starter application to get a feel for its current features and functionality.

Testing

Run the tests (from week4/ directory)

make test

Formatting/Linting

make format
make lint

Part I: Build Your Automation (Choose 2 or more)

Now that you're familiar with the starter application, your next step is to build automations to enhance or extend it. Below are several automation options you can choose from. You can mix and match across categories.

As you build your automations, document your changes in the writeup.md file. Leave the "How you used the automation to enhance the starter application" section empty for now - you will be returning to this in Part II of the assignment.

A) Claude custom skills

Skills are reusable capabilities defined as SKILL.md files inside .claude/skills/*/. Each skill has a YAML frontmatter with a name and description, followed by markdown instructions. Claude exposes these via /name and can also auto-invoke them when the description matches the current task.

Note: Custom slash commands (.claude/commands/*.md) have been merged into skills. Skills are the current recommended approach, offering the same /-invocation plus additional features: YAML frontmatter for auto-discovery, a directory structure for supporting files, and execution control options. See the skills documentation for details.

Example 1: Test runner with coverage
- Path: .claude/skills/tests/SKILL.md
- Frontmatter: name: tests, description: Run tests and coverage. Use when running tests or verifying code changes.
- Intent: Run pytest -q backend/tests --maxfail=1 -x and, if green, run coverage.
- Inputs: Optional marker or path via $ARGUMENTS.
- Output: Summarize failures and suggest next steps.
Example 2: Docs sync
- Path: .claude/skills/docs-sync/SKILL.md
- Frontmatter: name: docs-sync, description: Sync API docs with OpenAPI spec. Use when API routes change or docs need updating.
- Intent: Read /openapi.json, update docs/API.md, and list route deltas.
- Output: Diff-like summary and TODOs.
Example 3: Refactor harness
- Path: .claude/skills/refactor-module/SKILL.md
- Frontmatter: name: refactor-module, description: Rename a module and update all references. Use when restructuring code.
- Intent: Rename a module (e.g., services/extract.py → services/parser.py), update imports, run lint/tests.
- Output: A checklist of modified files and verification steps.

Tips: Keep skills focused, use $ARGUMENTS for inputs, and prefer idempotent steps. Use disable-model-invocation: true in frontmatter if you want to prevent Claude from auto-invoking the skill. You can also place supporting files (templates, scripts) alongside SKILL.md in the skill directory.

B) `CLAUDE.md` guidance files

The CLAUDE.md file is automatically read when starting a conversation, allowing you to provide repository-specific instructions, context, or guidance that influence Claude's behavior. Create a CLAUDE.md in the repo root (and optionally in week4/ subfolders) to guide Claude's behavior.

Example 1: Code navigation and entry points
- Include: How to run the app, where routers live (backend/app/routers), where tests live, how the DB is seeded.
Example 2: Style and safety guardrails
- Include: Tooling expectations (black/ruff), safe commands to run, commands to avoid, and lint/test gates.
Example 3: Workflow snippets
- Include: "When asked to add an endpoint, first write a failing test, then implement, then run pre-commit."

Tips: Iterate on CLAUDE.md like a prompt, keep it concise and actionable, and document custom tools/scripts you expect Claude to use.

C) SubAgents (role-specialized)

SubAgents are specialized AI assistants configured to handle specific tasks with their own system prompts, tools, and context. Design two or more cooperating agents, each responsible for a distinct step in a single workflow.

Example 1: TestAgent + CodeAgent
- Flow: TestAgent writes/updates tests for a change → CodeAgent implements code to pass tests → TestAgent verifies.
Example 2: DocsAgent + CodeAgent
- Flow: CodeAgent adds a new API route → DocsAgent updates API.md and TASKS.md and checks drift against /openapi.json.
Example 3: DBAgent + RefactorAgent
- Flow: DBAgent proposes a schema change (adjust data/seed.sql) → RefactorAgent updates models/schemas/routers and fixes lints.

Tips: Use checklists/scratchpads, reset context (/clear) between roles, and run agents in parallel for independent tasks.

Part II: Put Your Automations to Work

Now that you've built 2+ automations, let's put them to use! In the writeup.md under section "How you used the automation to enhance the starter application", describe how you leveraged each automation to improve or extend the app's functionality.

e.g. If you implemented the custom skill /generate-test-cases, explain how you used it to interact with and test the starter application.

Deliverables

Two or more automations, which may include:
- Skills in .claude/skills/*/SKILL.md
- CLAUDE.md files
- SubAgent prompts/configuration (documented clearly, files/scripts if any)
A write-up writeup.md under week4/ that includes:

Design inspiration (e.g. cite the best-practices, sub-agents, and/or skills docs)
Design of each automation, including goals, inputs/outputs, steps
How to run it (exact commands), expected outputs, and rollback/safety notes
Before vs. after (i.e. manual workflow vs. automated workflow)
How you used the automation to enhance the starter application

SUBMISSION INSTRUCTIONS

Make sure you have all changes pushed to your remote repository for grading.
Make sure you've added both brentju and febielin as collaborators on your assignment repository.
Submit via Gradescope.

ナビゲーション

Skillsとは？

リンク

Week 4 — The Autonomous Coding Agent IRL

Week 4 — The Autonomous Coding Agent IRL

Learn about Claude Code

Explore the Starter Application

Structure

Quickstart

Testing

Formatting/Linting

Part I: Build Your Automation (Choose 2 or more)

A) Claude custom skills

B) `CLAUDE.md` guidance files

C) SubAgents (role-specialized)

Part II: Put Your Automations to Work

Deliverables

SUBMISSION INSTRUCTIONS

関連スキル(🌐 Web開発)

ナビゲーション

Skillsとは？

リンク

Week 4 — The Autonomous Coding Agent IRL

Week 4 — The Autonomous Coding Agent IRL

Learn about Claude Code

Explore the Starter Application

Structure

Quickstart

Testing

Formatting/Linting

Part I: Build Your Automation (Choose 2 or more)

A) Claude custom skills

B) CLAUDE.md guidance files

C) SubAgents (role-specialized)

Part II: Put Your Automations to Work

Deliverables

SUBMISSION INSTRUCTIONS

関連スキル(🌐 Web開発)

B) `CLAUDE.md` guidance files