/install agent-browser-assistant
Agent Browser Assistant
An intelligent browser control assistant providing browser automation, data scraping, and testing capabilities.
Use Cases
Opening web pages, clicking/typing/scrolling, taking screenshots/recordings, extracting web content, exporting table data, automated form filling, batch operations, scheduled tasks, login authentication, UI testing, regression testing.
Quick Start
Use the browser tool for all browser operations:
# Open a web page
browser(action="open", url="https://example.com")
# Take a screenshot
browser(action="screenshot")
# Click an element
browser(action="act", kind="click", ref="button-submit")
# Type text
browser(action="act", kind="type", ref="input-username", text="[email protected]")
# Scroll the page
browser(action="act", kind="scroll", y=500)
# Get a page snapshot
browser(action="snapshot")
Core Capabilities
1. Page Operations
| Operation | Description | Example |
|---|---|---|
| open | Open a specified URL | action="open", url="..." |
| snapshot | Get page structure | action="snapshot" |
| screenshot | Take a page screenshot | action="screenshot" |
| navigate | Navigate to a URL | action="navigate", url="..." |
| close | Close a tab | action="close", targetId="..." |
2. Element Interaction
Use the act operation for page interaction:
- click: Click an element (ref: element reference)
- type: Type text (ref: input reference, text: content)
- press: Press a keyboard key (key: key name)
- hover: Hover over an element
- select: Select from a dropdown
- fill: Fill a form (fields: field dictionary)
- scroll: Scroll the page (x/y: coordinates)
3. Data Scraping
Extract data from web pages:
# Get a page snapshot to analyze structure
browser(action="snapshot")
# Extract table data - using selector
browser(action="act", kind="evaluate", selector="table.data", fn="Array.from(document.querySelectorAll('tr')).map(r => Array.from(r.querySelectorAll('td')).map(c => c.innerText))")
4. Automated Workflows
Automated form filling:
browser(action="act", kind="fill", fields=[
{"ref": "input-email", "value": "[email protected]"},
{"ref": "input-password", "value": "password123"}
])
browser(action="act", kind="click", ref="button-login")
Batch operations:
# Iterate through list items
for i in range(1, 6):
browser(action="act", kind="click", ref=f"item-{i}")
5. Testing Capabilities
UI testing scenarios:
- Regression Testing: Verify that page functionality works correctly
- Performance Monitoring: Page load time
- Element Existence Check: Verify that key elements are visible
Advanced Usage
Waiting for Page Load
browser(action="act", kind="wait", loadState="domcontentloaded", timeMs=5000)
Handling Dialogs
browser(action="dialog", kind="accept") # Confirm
# or
browser(action="dialog", kind="dismiss") # Cancel
File Upload
browser(action="upload", ref="input-file", paths=["C:/path/to/file.pdf"])
PDF Export
browser(action="pdf", path="C:/output/page.pdf")
Configuration Options
| Parameter | Description | Default |
|---|---|---|
| profile | Browser profile | "openclaw" |
| target | Browser target | "sandbox" |
| slowly | Slow motion mode | false |
| timeoutMs | Timeout duration | 30000 |
Common Selector Patterns
- Button:
button[type="submit"],#submit-btn - Input:
input[name="email"],#username - Link:
a[href*="login"] - Table:
table.data tr - List:
.item-list li
Notes
- Use
snapshotto get page structure before performing element operations - Dynamic content may require waiting for it to finish loading
- For logged-in state operations, use
profile="user"to reuse the user's browser - For large-scale data scraping, consider pagination to avoid timeouts
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install agent-browser-assistant - 安装完成后,直接呼叫该 Skill 的名称或使用
/agent-browser-assistant触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Agent Browser Assistant 是什么?
For browser automation tasks, web data scraping, form filling, page screenshots, UI testing, and more. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 69 次。
如何安装 Agent Browser Assistant?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install agent-browser-assistant」即可一键安装,无需额外配置。
Agent Browser Assistant 是免费的吗?
是的,Agent Browser Assistant 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Agent Browser Assistant 支持哪些平台?
Agent Browser Assistant 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Agent Browser Assistant?
由 OpenLark(@openlark)开发并维护,当前版本 v1.0.0。