← Back to Skills Marketplace
openlark

Agent Browser Assistant

by OpenLark · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
69
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install agent-browser-assistant
Description
For browser automation tasks, web data scraping, form filling, page screenshots, UI testing, and more.
README (SKILL.md)

Agent Browser Assistant

An intelligent browser control assistant providing browser automation, data scraping, and testing capabilities.

Use Cases

Opening web pages, clicking/typing/scrolling, taking screenshots/recordings, extracting web content, exporting table data, automated form filling, batch operations, scheduled tasks, login authentication, UI testing, regression testing.

Quick Start

Use the browser tool for all browser operations:

# Open a web page
browser(action="open", url="https://example.com")

# Take a screenshot
browser(action="screenshot")

# Click an element
browser(action="act", kind="click", ref="button-submit")

# Type text
browser(action="act", kind="type", ref="input-username", text="[email protected]")

# Scroll the page
browser(action="act", kind="scroll", y=500)

# Get a page snapshot
browser(action="snapshot")

Core Capabilities

1. Page Operations

Operation Description Example
open Open a specified URL action="open", url="..."
snapshot Get page structure action="snapshot"
screenshot Take a page screenshot action="screenshot"
navigate Navigate to a URL action="navigate", url="..."
close Close a tab action="close", targetId="..."

2. Element Interaction

Use the act operation for page interaction:

  • click: Click an element (ref: element reference)
  • type: Type text (ref: input reference, text: content)
  • press: Press a keyboard key (key: key name)
  • hover: Hover over an element
  • select: Select from a dropdown
  • fill: Fill a form (fields: field dictionary)
  • scroll: Scroll the page (x/y: coordinates)

3. Data Scraping

Extract data from web pages:

# Get a page snapshot to analyze structure
browser(action="snapshot")

# Extract table data - using selector
browser(action="act", kind="evaluate", selector="table.data", fn="Array.from(document.querySelectorAll('tr')).map(r => Array.from(r.querySelectorAll('td')).map(c => c.innerText))")

4. Automated Workflows

Automated form filling:

browser(action="act", kind="fill", fields=[
    {"ref": "input-email", "value": "[email protected]"},
    {"ref": "input-password", "value": "password123"}
])
browser(action="act", kind="click", ref="button-login")

Batch operations:

# Iterate through list items
for i in range(1, 6):
    browser(action="act", kind="click", ref=f"item-{i}")

5. Testing Capabilities

UI testing scenarios:

  • Regression Testing: Verify that page functionality works correctly
  • Performance Monitoring: Page load time
  • Element Existence Check: Verify that key elements are visible

Advanced Usage

Waiting for Page Load

browser(action="act", kind="wait", loadState="domcontentloaded", timeMs=5000)

Handling Dialogs

browser(action="dialog", kind="accept")  # Confirm
# or
browser(action="dialog", kind="dismiss")  # Cancel

File Upload

browser(action="upload", ref="input-file", paths=["C:/path/to/file.pdf"])

PDF Export

browser(action="pdf", path="C:/output/page.pdf")

Configuration Options

Parameter Description Default
profile Browser profile "openclaw"
target Browser target "sandbox"
slowly Slow motion mode false
timeoutMs Timeout duration 30000

Common Selector Patterns

  • Button: button[type="submit"], #submit-btn
  • Input: input[name="email"], #username
  • Link: a[href*="login"]
  • Table: table.data tr
  • List: .item-list li

Notes

  1. Use snapshot to get page structure before performing element operations
  2. Dynamic content may require waiting for it to finish loading
  3. For logged-in state operations, use profile="user" to reuse the user's browser
  4. For large-scale data scraping, consider pagination to avoid timeouts
Usage Guidance
Before installing or enabling this skill, consider the following: - Only use if you trust the skill's author/source; no homepage or source is provided. - Avoid using profile="user" or any setting that reuses your real browser profile (cookies, logins). Prefer ephemeral or sandboxed profiles. - Do not allow uploads or file path operations involving sensitive local files (password stores, private documents, keys). - If you need scraping/testing, run the skill in a restricted environment or VM with non-sensitive accounts/data first. - Ask the owner for source/instructions that limit where extracted data may be sent and for explicit safeguards (e.g., no network exfiltration, logging of actions). - Consider disabling autonomous invocation for this skill or restricting its permissions so it cannot run without explicit user confirmation. - If you decide to proceed, monitor activity (what URLs are opened, files accessed, and outputs written) and test with throwaway credentials/accounts.
Capability Analysis
Type: OpenClaw Skill Name: agent-browser-assistant Version: 1.0.0 The skill bundle provides standard browser automation capabilities such as navigation, element interaction, and data scraping. The documentation in SKILL.md is entirely consistent with the stated purpose of a browser assistant and contains no evidence of malicious instructions, hidden payloads, or unauthorized data exfiltration.
Capability Assessment
Purpose & Capability
Name/description and the SKILL.md align: it is an instruction-only browser automation/scraping/testing helper and does not declare unrelated binaries or credentials. However, the guidance to reuse a user's browser profile (profile="user") implies access to cookies, sessions, and other local browser data which goes beyond simple remote page automation and should be considered sensitive.
Instruction Scope
The SKILL.md instructs the agent to open arbitrary URLs, take snapshots/evaluate page JS, fill forms (including login flows), upload files from local paths, and write PDF/screenshots to local paths. These operations can access or expose sensitive data (saved sessions, cookies, local files). The instructions do not limit where scraped/extracted data may be sent or place bounds on which profiles/files may be used, giving broad discretion that could be used to exfiltrate data.
Install Mechanism
No install specification and no code files are present; this is an instruction-only skill so nothing is written to disk by an installer. That lowers supply-chain risk.
Credentials
The skill declares no environment variables or credentials, which is proportionate. However, the documented ability to use a named profile (e.g., profile="user") and file path parameters means the agent may be expected to access local browser profiles and arbitrary file system paths—sensitive capabilities that are not reflected in requires.* metadata and should be treated as elevated access.
Persistence & Privilege
always is false and the skill is user-invocable; model invocation is allowed (platform default). There is no request for permanent presence or modification of other skills. Autonomous invocation combined with the scope above increases blast radius, but autonomous invocation alone is normal and not flagged.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install agent-browser-assistant
  3. After installation, invoke the skill by name or use /agent-browser-assistant
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Agent Browser Assistant 1.0.0 – Initial Release - Provides browser automation for opening pages, clicking, form filling, scrolling, and more. - Supports web data scraping, page structure analysis, and table data export. - Includes tools for UI testing, regression testing, and performance checks. - Offers screenshot, recording, PDF export, file upload, and automated batch operations. - Features configurable browser profiles, targets, timeout settings, and slow motion mode.
Metadata
Slug agent-browser-assistant
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Agent Browser Assistant?

For browser automation tasks, web data scraping, form filling, page screenshots, UI testing, and more. It is an AI Agent Skill for Claude Code / OpenClaw, with 69 downloads so far.

How do I install Agent Browser Assistant?

Run "/install agent-browser-assistant" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Agent Browser Assistant free?

Yes, Agent Browser Assistant is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Agent Browser Assistant support?

Agent Browser Assistant is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Agent Browser Assistant?

It is built and maintained by OpenLark (@openlark); the current version is v1.0.0.

💬 Comments