← 返回 Skills 市场
openlark

Agent Browser Assistant

作者 OpenLark · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
69
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install agent-browser-assistant
功能描述
For browser automation tasks, web data scraping, form filling, page screenshots, UI testing, and more.
使用说明 (SKILL.md)

Agent Browser Assistant

An intelligent browser control assistant providing browser automation, data scraping, and testing capabilities.

Use Cases

Opening web pages, clicking/typing/scrolling, taking screenshots/recordings, extracting web content, exporting table data, automated form filling, batch operations, scheduled tasks, login authentication, UI testing, regression testing.

Quick Start

Use the browser tool for all browser operations:

# Open a web page
browser(action="open", url="https://example.com")

# Take a screenshot
browser(action="screenshot")

# Click an element
browser(action="act", kind="click", ref="button-submit")

# Type text
browser(action="act", kind="type", ref="input-username", text="[email protected]")

# Scroll the page
browser(action="act", kind="scroll", y=500)

# Get a page snapshot
browser(action="snapshot")

Core Capabilities

1. Page Operations

Operation Description Example
open Open a specified URL action="open", url="..."
snapshot Get page structure action="snapshot"
screenshot Take a page screenshot action="screenshot"
navigate Navigate to a URL action="navigate", url="..."
close Close a tab action="close", targetId="..."

2. Element Interaction

Use the act operation for page interaction:

  • click: Click an element (ref: element reference)
  • type: Type text (ref: input reference, text: content)
  • press: Press a keyboard key (key: key name)
  • hover: Hover over an element
  • select: Select from a dropdown
  • fill: Fill a form (fields: field dictionary)
  • scroll: Scroll the page (x/y: coordinates)

3. Data Scraping

Extract data from web pages:

# Get a page snapshot to analyze structure
browser(action="snapshot")

# Extract table data - using selector
browser(action="act", kind="evaluate", selector="table.data", fn="Array.from(document.querySelectorAll('tr')).map(r => Array.from(r.querySelectorAll('td')).map(c => c.innerText))")

4. Automated Workflows

Automated form filling:

browser(action="act", kind="fill", fields=[
    {"ref": "input-email", "value": "[email protected]"},
    {"ref": "input-password", "value": "password123"}
])
browser(action="act", kind="click", ref="button-login")

Batch operations:

# Iterate through list items
for i in range(1, 6):
    browser(action="act", kind="click", ref=f"item-{i}")

5. Testing Capabilities

UI testing scenarios:

  • Regression Testing: Verify that page functionality works correctly
  • Performance Monitoring: Page load time
  • Element Existence Check: Verify that key elements are visible

Advanced Usage

Waiting for Page Load

browser(action="act", kind="wait", loadState="domcontentloaded", timeMs=5000)

Handling Dialogs

browser(action="dialog", kind="accept")  # Confirm
# or
browser(action="dialog", kind="dismiss")  # Cancel

File Upload

browser(action="upload", ref="input-file", paths=["C:/path/to/file.pdf"])

PDF Export

browser(action="pdf", path="C:/output/page.pdf")

Configuration Options

Parameter Description Default
profile Browser profile "openclaw"
target Browser target "sandbox"
slowly Slow motion mode false
timeoutMs Timeout duration 30000

Common Selector Patterns

  • Button: button[type="submit"], #submit-btn
  • Input: input[name="email"], #username
  • Link: a[href*="login"]
  • Table: table.data tr
  • List: .item-list li

Notes

  1. Use snapshot to get page structure before performing element operations
  2. Dynamic content may require waiting for it to finish loading
  3. For logged-in state operations, use profile="user" to reuse the user's browser
  4. For large-scale data scraping, consider pagination to avoid timeouts
安全使用建议
Before installing or enabling this skill, consider the following: - Only use if you trust the skill's author/source; no homepage or source is provided. - Avoid using profile="user" or any setting that reuses your real browser profile (cookies, logins). Prefer ephemeral or sandboxed profiles. - Do not allow uploads or file path operations involving sensitive local files (password stores, private documents, keys). - If you need scraping/testing, run the skill in a restricted environment or VM with non-sensitive accounts/data first. - Ask the owner for source/instructions that limit where extracted data may be sent and for explicit safeguards (e.g., no network exfiltration, logging of actions). - Consider disabling autonomous invocation for this skill or restricting its permissions so it cannot run without explicit user confirmation. - If you decide to proceed, monitor activity (what URLs are opened, files accessed, and outputs written) and test with throwaway credentials/accounts.
功能分析
Type: OpenClaw Skill Name: agent-browser-assistant Version: 1.0.0 The skill bundle provides standard browser automation capabilities such as navigation, element interaction, and data scraping. The documentation in SKILL.md is entirely consistent with the stated purpose of a browser assistant and contains no evidence of malicious instructions, hidden payloads, or unauthorized data exfiltration.
能力评估
Purpose & Capability
Name/description and the SKILL.md align: it is an instruction-only browser automation/scraping/testing helper and does not declare unrelated binaries or credentials. However, the guidance to reuse a user's browser profile (profile="user") implies access to cookies, sessions, and other local browser data which goes beyond simple remote page automation and should be considered sensitive.
Instruction Scope
The SKILL.md instructs the agent to open arbitrary URLs, take snapshots/evaluate page JS, fill forms (including login flows), upload files from local paths, and write PDF/screenshots to local paths. These operations can access or expose sensitive data (saved sessions, cookies, local files). The instructions do not limit where scraped/extracted data may be sent or place bounds on which profiles/files may be used, giving broad discretion that could be used to exfiltrate data.
Install Mechanism
No install specification and no code files are present; this is an instruction-only skill so nothing is written to disk by an installer. That lowers supply-chain risk.
Credentials
The skill declares no environment variables or credentials, which is proportionate. However, the documented ability to use a named profile (e.g., profile="user") and file path parameters means the agent may be expected to access local browser profiles and arbitrary file system paths—sensitive capabilities that are not reflected in requires.* metadata and should be treated as elevated access.
Persistence & Privilege
always is false and the skill is user-invocable; model invocation is allowed (platform default). There is no request for permanent presence or modification of other skills. Autonomous invocation combined with the scope above increases blast radius, but autonomous invocation alone is normal and not flagged.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install agent-browser-assistant
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /agent-browser-assistant 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Agent Browser Assistant 1.0.0 – Initial Release - Provides browser automation for opening pages, clicking, form filling, scrolling, and more. - Supports web data scraping, page structure analysis, and table data export. - Includes tools for UI testing, regression testing, and performance checks. - Offers screenshot, recording, PDF export, file upload, and automated batch operations. - Features configurable browser profiles, targets, timeout settings, and slow motion mode.
元数据
Slug agent-browser-assistant
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Agent Browser Assistant 是什么?

For browser automation tasks, web data scraping, form filling, page screenshots, UI testing, and more. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 69 次。

如何安装 Agent Browser Assistant?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install agent-browser-assistant」即可一键安装,无需额外配置。

Agent Browser Assistant 是免费的吗?

是的,Agent Browser Assistant 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Agent Browser Assistant 支持哪些平台?

Agent Browser Assistant 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Agent Browser Assistant?

由 OpenLark(@openlark)开发并维护,当前版本 v1.0.0。

💬 留言讨论