← Back to Skills Marketplace
ericpearson

Scrapclaw

by Eric Pearson · GitHub ↗ · v0.0.6 · MIT-0
cross-platform ✓ Security Clean
277
Downloads
0
Stars
0
Active Installs
6
Versions
Install in OpenClaw
/install scrapclaw
Description
Run Scrapclaw as a Dockerized browser-backed scraping service, then use this skill to fetch HTML from JavaScript-heavy or Cloudflare-protected pages through...
Usage Guidance
This skill appears to do what it says: run/call a Dockerized Scrapclaw service. Before installing, (1) decide whether you'll use the published ghcr.io image or build from source — if building, review the Dockerfile and repository because docker-compose up --build can run arbitrary code. (2) Prefer running the container in an isolated VM or restricted host if you are unsure about target pages. (3) Confirm whether SCRAPCLAW_API_TOKEN must be set for your deployment — the SKILL.md treats it as optional but the registry metadata marks it required. (4) Do not expose the service to public networks and avoid scraping localhost/private RFC1918 targets unless explicitly allowed. (5) If you rely on the published image, verify the image source (ghcr.io/ericpearson/scrapclaw:v0.0.6) and its release notes before trusting it.
Capability Analysis
Type: OpenClaw Skill Name: scrapclaw Version: 0.0.6 The scrapclaw skill is a legitimate tool for interfacing with a Dockerized browser-based scraping service. The SKILL.md file includes several proactive security instructions for the AI agent, such as warnings against accessing internal network ranges (SSRF protection), treating scraped HTML as untrusted input to prevent indirect prompt injection, and properly handling sensitive API tokens. No malicious behaviors, such as unauthorized data exfiltration or hidden execution of commands, were found.
Capability Assessment
Purpose & Capability
Name/description describe a Dockerized browser-backed scraper and the skill only requires docker/curl/git and a base URL + optional API token — these align with running a container and calling its HTTP API. Note: git is only needed if building from source; the published image path does not require git.
Instruction Scope
SKILL.md contains concrete, scoped runtime instructions (health check, POST /v1 payload, cURL examples) and explicitly warns about reviewing Dockerfiles and avoiding internal-only targets. It does not instruct reading unrelated files or exfiltrating data.
Install Mechanism
Instruction-only skill (no install spec, no code files). The author recommends running an image from GitHub Container Registry or building from source; the doc correctly warns that building from unreviewed source can execute arbitrary host code.
Credentials
Requested environment variables (SCRAPCLAW_BASE_URL and SCRAPCLAW_API_TOKEN) are appropriate for calling the service. However, metadata lists SCRAPCLAW_API_TOKEN as required while SKILL.md treats it as optional (use only if set). Also primaryEnv is set to SCRAPCLAW_BASE_URL (a URL) which is unusual but not harmful. The declared required binaries include git even though using the published image does not need git.
Persistence & Privilege
Skill is not force-included (always:false), does not request persistent system-wide changes in the instructions, and does not modify other skills. Autonomous invocation is allowed (default) but not combined with other red flags.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install scrapclaw
  3. After installation, invoke the skill by name or use /scrapclaw
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.0.6
Add optional response truncation so callers can cap HTML or text payload size per request.
v0.0.5
Add text extraction mode, security hardening, improved install guidance, and ibarsi's contribution.
v0.0.4
Security hardening, improved install guidance, and ibarsi's contribution.
v0.0.3
Improve ClawHub description and install guidance
v0.0.2
Release 0.0.2
v0.0.1
Initial release
Metadata
Slug scrapclaw
Version 0.0.6
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 6
Frequently Asked Questions

What is Scrapclaw?

Run Scrapclaw as a Dockerized browser-backed scraping service, then use this skill to fetch HTML from JavaScript-heavy or Cloudflare-protected pages through... It is an AI Agent Skill for Claude Code / OpenClaw, with 277 downloads so far.

How do I install Scrapclaw?

Run "/install scrapclaw" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Scrapclaw free?

Yes, Scrapclaw is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Scrapclaw support?

Scrapclaw is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Scrapclaw?

It is built and maintained by Eric Pearson (@ericpearson); the current version is v0.0.6.

💬 Comments