Usage with AI Agents | 与 AI 智能体一起使用

官方文档: https://github.com/vercel-labs/agent-browser/blob/main/README.md

Instructions

This example demonstrates how to integrate agent-browser with AI agents.

Key Concepts

Integration with AI agents
Just ask the agent approach
AGENTS.md / CLAUDE.md instructions
Claude Code skill
Optimal workflow patterns

Example: Just Ask the Agent

The simplest approach - just tell your agent to use it:

Use agent-browser to test the login flow. Run agent-browser --help to see available commands.

The --help output is comprehensive and most agents can figure it out from there.

Example: AGENTS.md / CLAUDE.md

For more consistent results, add to your project or global instructions file:

## Browser Automation

Use `agent-browser` for web automation. Run `agent-browser --help` for all commands.

Core workflow:
1. `agent-browser open <url>` - Navigate to page
2. `agent-browser snapshot -i` - Get interactive elements with refs (@e1, @e2)
3. `agent-browser click @e1` / `fill @e2 "text"` - Interact using refs
4. Re-snapshot after page changes

Example: Claude Code Skill

For Claude Code, a skill provides richer context:

Option 1: Copy from node_modules

cp -r node_modules/agent-browser/skills/agent-browser .claude/skills/

Option 2: Download directly

mkdir -p .claude/skills/agent-browser
curl -o .claude/skills/agent-browser/SKILL.md \
  https://raw.githubusercontent.com/vercel-labs/agent-browser/main/skills/agent-browser/SKILL.md

Example: Optimal AI Workflow

# 1. Navigate and get snapshot
agent-browser open example.com
agent-browser snapshot -i --json   # AI parses tree and refs

# 2. AI identifies target refs from snapshot
# 3. Execute actions using refs
agent-browser click @e2
agent-browser fill @e3 "input text"

# 4. Get new snapshot if page changed
agent-browser snapshot -i --json

Example: Integration Patterns

Pattern 1: Simple Automation

# Agent executes commands sequentially
agent-browser open example.com
agent-browser snapshot -i
agent-browser click @e1
agent-browser fill @e2 "value"
agent-browser click @e3

Pattern 2: With Error Handling

# Agent checks for errors
agent-browser open example.com || echo "Failed to open"
agent-browser snapshot -i || echo "Failed to snapshot"
agent-browser click @e1 || echo "Failed to click"

Pattern 3: With JSON Output

# Agent parses JSON responses
agent-browser snapshot -i --json | jq '.data.refs'
agent-browser get text @e1 --json | jq '.data'

Key Points

Just ask the agent - simplest approach
Add instructions to AGENTS.md / CLAUDE.md for consistency
Use Claude Code skill for richer context
Follow optimal workflow: snapshot → identify refs → interact → re-snapshot
Use --json flag for machine-readable output
Combine with error handling for robust automation

ナビゲーション

Skillsとは？

リンク

Usage with AI Agents | 与 AI 智能体一起使用

Usage with AI Agents | 与 AI 智能体一起使用

Instructions

Key Concepts

Example: Just Ask the Agent

Example: AGENTS.md / CLAUDE.md

Example: Claude Code Skill

Example: Optimal AI Workflow

Example: Integration Patterns

Key Points

関連スキル(📄 ドキュメント)