← 返回 Skills 市场
Playwright Scraper CN
作者
onlyloveher
· GitHub ↗
· v1.2.0
· MIT-0
162
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install playwright-scraper-cn
功能描述
Playwright-based web scraping OpenClaw Skill with anti-bot protection. Successfully tested on complex sites like Discuss.com.hk.
安全使用建议
This package appears to be what it says: Playwright scripts for normal and stealth scraping. Before installing or running it, take these precautions:
- Inspect and remove or parameterize the hardcoded phone number in scripts/xianyu-login.js (it contains PII and is unnecessary for most uses).
- Review all scripts you plan to run (especially xianyu-*.js) — they can automate logins and will print page contents and save screenshots/HTML locally.
- Run npm install and Playwright installation in an isolated environment (container or VM). Playwright will download browser binaries (substantial disk/network activity).
- If you plan to use stealth/anti-bot features, understand they purposely mask automation (navigator overrides, function toString tampering); only use these where legally and ethically appropriate and consistent with site terms of service.
- Check package-lock.json integrity and your npm registry settings (the lockfile references a registry mirror) and prefer official registries if you have policy concerns.
- Avoid running the skill with elevated privileges or exposing sensitive accounts; if you need login automation, prefer manual credential injection and avoid storing credentials in code.
- If you will allow the agent to invoke the skill autonomously, restrict or monitor that behavior — scraping+evasion can have legal/abuse consequences and increases blast radius if misused.
If you want, I can point out the exact lines that set navigator overrides and the hardcoded phone number so you can edit them, or suggest a minimal sandboxed command to test the skill safely.
功能分析
Type: OpenClaw Skill
Name: playwright-scraper-cn
Version: 1.2.0
The skill bundle contains undocumented scripts specifically targeting the Xianyu (Goofish) platform, most notably `scripts/xianyu-login.js`, which includes a hardcoded Chinese phone number ('15982192571') and automates a login sequence to trigger SMS verification codes. While the core functionality in `scripts/playwright-stealth.js` appears to be a legitimate anti-bot scraping tool, the inclusion of targeted, undocumented login automation with hardcoded identifiers is highly irregular. This behavior suggests the skill may have been repurposed from a specific botting operation or contains artifacts intended for unauthorized account access, though no direct data exfiltration to external domains was observed.
能力评估
Purpose & Capability
The name/description (Playwright stealth scraper) align with the included scripts, package.json, and README. The code implements simple and stealth Playwright scrapers, example usage, and optional login helpers — all coherent with the stated purpose. Minor mismatch: SKILL metadata declares no required env/creds, yet scripts expect Playwright to be installed (npm + npx commands in docs) and support several environment variables for operation (HEADLESS, WAIT_TIME, USER_AGENT, etc.).
Instruction Scope
SKILL.md instructs installing Playwright and running the included scripts; runtime instructions and scripts stay within scraping behavior (navigating pages, taking screenshots, saving HTML, printing JSON). The scripts intentionally inject anti-detection code (overriding navigator.webdriver, faking navigator properties, manipulating Function.prototype.toString) — expected for 'stealth' scraping but also powerful evasion techniques. The xianyu-login.js script automates a login flow and prints page content and asks for an SMS code; that can expose PII/credentials if used incorrectly. No instructions send scraped data to remote endpoints — output is console and local files.
Install Mechanism
There is no automatic installer in the skill bundle; package.json and package-lock.json indicate normal npm dependencies (playwright and @playwright/test). package-lock entries resolve to public npm registries; one resolved URL uses a Tencent mirror which is a package registry mirror (not an arbitrary binary URL). No arbitrary downloads/extract-from-unknown-URL were found in the repository.
Credentials
The skill does not declare required secrets or credentials. Scripts accept benign environment variables (HEADLESS, WAIT_TIME, SCREENSHOT_PATH, SAVE_HTML, USER_AGENT) — proportional to web-scraping functionality. Notable issue: scripts/xianyu-login.js contains a hardcoded phone number (15982192571), which is unnecessary for general scraping and may contain PII that should be removed or parameterized. The README mentions future integration with CAPTCHA-solving services (2captcha/Anti-Captcha) which would require API keys; these are not currently implemented but would increase secret requirements if added.
Persistence & Privilege
The skill does not request persistent presence (always: false). It does not modify other skills or system-wide configurations. It writes local files (screenshots, HTML) and prints to stdout, which is normal for a scraper. Autonomous invocation is allowed by platform default (disable-model-invocation: false); combined with the skill's capabilities this increases potential impact if the agent were allowed to run it without human oversight, but this is platform-standard behaviour and not unique to this skill.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install playwright-scraper-cn - 安装完成后,直接呼叫该 Skill 的名称或使用
/playwright-scraper-cn触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.2.0
Initial release with CN suffix
元数据
常见问题
Playwright Scraper CN 是什么?
Playwright-based web scraping OpenClaw Skill with anti-bot protection. Successfully tested on complex sites like Discuss.com.hk. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 162 次。
如何安装 Playwright Scraper CN?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install playwright-scraper-cn」即可一键安装,无需额外配置。
Playwright Scraper CN 是免费的吗?
是的,Playwright Scraper CN 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Playwright Scraper CN 支持哪些平台?
Playwright Scraper CN 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Playwright Scraper CN?
由 onlyloveher(@onlyloveher)开发并维护,当前版本 v1.2.0。
推荐 Skills