← Back to Skills Marketplace
xuxingling55-sketch

Hilda Puppeteer

by xuxingling55-sketch · GitHub ↗ · v1.0.0 · MIT-0
linuxdarwinwin32 ⚠ suspicious
113
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install hilda-puppeteer
Description
Automate Chrome and Chromium with Puppeteer for scraping, testing, screenshots, and browser workflows.
README (SKILL.md)

Setup

On first use, read setup.md for integration guidelines.

When to Use

User needs browser automation: web scraping, E2E testing, PDF generation, screenshots, or any headless Chrome task. Agent handles page navigation, element interaction, waiting strategies, and data extraction.

Architecture

Scripts and outputs in ~/puppeteer/. See memory-template.md for structure.

~/puppeteer/
├── memory.md       # Status + preferences
├── scripts/        # Reusable automation scripts
└── output/         # Screenshots, PDFs, scraped data

Quick Reference

Topic File
Setup process setup.md
Memory template memory-template.md
Selectors guide selectors.md
Waiting patterns waiting.md

Core Rules

1. Always Wait Before Acting

Never click or type immediately after navigation. Always wait for the element:

await page.waitForSelector('#button');
await page.click('#button');

Clicking without waiting causes "element not found" errors 90% of the time.

2. Use Specific Selectors

Prefer stable selectors in this order:

  1. [data-testid="submit"] — test attributes (most stable)
  2. #unique-id — IDs
  3. form button[type="submit"] — semantic combinations
  4. .class-name — classes (least stable, changes often)

Avoid: div > div > div > button — breaks on any DOM change.

3. Handle Navigation Explicitly

After clicks that navigate, wait for navigation:

await Promise.all([
  page.waitForNavigation(),
  page.click('a.next-page')
]);

Without this, the script continues before the new page loads.

4. Set Realistic Viewport

Always set viewport for consistent rendering:

await page.setViewport({ width: 1280, height: 800 });

Default viewport is 800x600 — many sites render differently or show mobile views.

5. Handle Popups and Dialogs

Dismiss dialogs before they block interaction:

page.on('dialog', async dialog => {
  await dialog.dismiss(); // or dialog.accept()
});

Unhandled dialogs freeze the script.

6. Close Browser on Errors

Always wrap in try/finally:

const browser = await puppeteer.launch();
try {
  // ... automation code
} finally {
  await browser.close();
}

Leaked browser processes consume memory and ports.

7. Respect Rate Limits

Add delays between requests to avoid blocks:

await page.waitForTimeout(1000 + Math.random() * 2000);

Hammering sites triggers CAPTCHAs and IP bans.

Common Traps

  • page.click() on invisible element → fails silently, use waitForSelector with visible: true
  • Screenshots of elements off-screen → blank image, scroll into view first
  • page.evaluate() returns undefined → cannot return DOM nodes, only serializable data
  • Headless blocked by site → use headless: 'new' or set user agent
  • Form submit reloads page → page.waitForNavigation() or data is lost
  • Shadow DOM elements invisible to selectors → use page.evaluateHandle() to pierce shadow roots
  • Cookies not persisting → launch with userDataDir for session persistence

Security & Privacy

Data that stays local:

  • All scraped data in ~/puppeteer/output/
  • Browser profile in specified userDataDir

This skill does NOT:

  • Send scraped data anywhere
  • Store credentials (you provide them per-script)
  • Access files outside ~/puppeteer/

Related Skills

Install with clawhub install \x3Cslug> if user confirms:

  • playwright — Cross-browser automation alternative
  • chrome — Chrome DevTools and debugging
  • web — General web development

Feedback

  • If useful: clawhub star puppeteer
  • Stay updated: clawhub sync
Usage Guidance
This skill appears to do what it claims (Puppeteer automation) and doesn't demand any secrets, but there are red flags you should consider before installing or running it: 1) The setup encourages running npm install, which will download code and (for full puppeteer) Chromium — run installs only in a controlled environment (sandbox/container/VM) and inspect package versions. 2) The instructions exhort the agent to act without asking the user and to save a local memory file without surfacing the path; confirm whether the agent will actually run commands or only provide scripts, and insist on explicit user approval before the agent executes installs or scripts. 3) Review any scripts the agent creates under ~/puppeteer/scripts/ before running them — look for network calls, unexpected POSTs, or references to external endpoints. 4) If you will scrape sites requiring credentials or handling PII, restrict where memory/data are stored and consider encrypting or opting out of automatic persistence. 5) Prefer running initial tests in an isolated environment and enable logging or auditing so you can see what files were created and what network requests the agent made. If you need, ask the maintainer/source for signed release notes or a repository you can review before use.
Capability Analysis
Type: OpenClaw Skill Name: hilda-puppeteer Version: 1.0.0 The skill bundle provides a comprehensive and well-documented framework for browser automation using Puppeteer. The instructions in SKILL.md and setup.md focus on best practices for web scraping and testing, such as handling asynchronous navigation and element selection. There are no indicators of data exfiltration, malicious execution, or unauthorized access; the code explicitly emphasizes keeping data local to the ~/puppeteer/ directory.
Capability Assessment
Purpose & Capability
Name/description, required binary (node), and guidance about puppeteer/pupeteer-core are coherent. The skill is instruction-only and focuses on browser automation (scraping, testing, screenshots), so asking for node and recommending npm install puppeteer is expected.
Instruction Scope
SKILL.md and setup.md instruct the agent to detect system state (node version, npm list), create and write scripts under ~/puppeteer/, and persist a memory file. Two instructions are concerning: (1) 'Don't ask — just start naturally' encourages agent action without explicit user consent; (2) 'Store in ~/puppeteer/memory.md without mentioning file paths to them' asks the agent to save learned target sites and patterns while intentionally not telling the user where they're stored. The skill also asserts 'This skill does NOT: Send scraped data anywhere', but as an instruction-only skill there is no enforcement — scripts the agent generates could still exfiltrate data unless audited.
Install Mechanism
No formal install spec (lowest disk-write risk). However setup.md explicitly guides running 'npm install puppeteer' or 'puppeteer-core', which will download packages and (for full puppeteer) a Chromium binary from the network. That is expected for this purpose but is an install action executed at runtime and should be approved by the user and run in a controlled environment.
Credentials
The skill requests no environment variables, no credentials, and no config paths. It mentions per-script credentials are 'provided by you', which is proportionate. Still note: stored memory may include 'target sites or apps' and 'auth patterns'—sensitive data that will be kept locally per the spec, so users should verify where and how that data is saved.
Persistence & Privilege
always:false (good). But the skill encourages autonomous actions ('Don't ask — just start naturally') and instructs the agent to persist memory without visible disclosure. Because the platform allows autonomous skill invocation by default, these instructions increase the risk of the agent acting or storing sensitive data without clear user consent. This combination raises transparency and privacy concerns even if not an outright privilege escalation.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install hilda-puppeteer
  3. After installation, invoke the skill by name or use /hilda-puppeteer
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of hilda-puppeteer for browser automation. - Automate Chrome/Chromium for scraping, E2E testing, screenshots, and PDF generation. - Includes guidance on stable selectors, waiting patterns, rate limiting, and browser management. - Organizes automation scripts and output in a dedicated directory structure. - Addresses common Puppeteer pitfalls and browser automation best practices. - Stores all data locally; no credentials or data sent externally. - Lists related skills for extended web automation capabilities.
Metadata
Slug hilda-puppeteer
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Hilda Puppeteer?

Automate Chrome and Chromium with Puppeteer for scraping, testing, screenshots, and browser workflows. It is an AI Agent Skill for Claude Code / OpenClaw, with 113 downloads so far.

How do I install Hilda Puppeteer?

Run "/install hilda-puppeteer" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Hilda Puppeteer free?

Yes, Hilda Puppeteer is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Hilda Puppeteer support?

Hilda Puppeteer is cross-platform and runs anywhere OpenClaw / Claude Code is available (linux, darwin, win32).

Who created Hilda Puppeteer?

It is built and maintained by xuxingling55-sketch (@xuxingling55-sketch); the current version is v1.0.0.

💬 Comments