← Back to Skills Marketplace
Novel Scraper SPA
by
yuzhihui886
· GitHub ↗
· v1.0.0
· MIT-0
95
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install novel-scraper-spa
Description
支持 JavaScript 加载的小说抓取工具,使用 Playwright 渲染 SPA 网站。自动检测网站类型,静态网站使用 curl,SPA 网站使用浏览器渲染。
Usage Guidance
Before installing or running this skill:
- Expect to install Python dependencies manually: at minimum 'playwright', 'requests', and 'beautifulsoup4'. Also run 'playwright install' (or equivalent) to download browser engines — the skill does not declare or perform this.
- Because Playwright downloads browser binaries, running the skill will cause additional network downloads and write large files to disk; consider running in an isolated environment or virtualenv.
- The SKILL.md contains leftover template/TODO text and no dependency declarations — treat this as low-quality packaging rather than proof of malicious intent, but exercise caution.
- The scripts only fetch the target URL(s) and save scraped text locally (~/.openclaw/workspace/novels/). They do not exfiltrate data to other endpoints, but the tool will fetch arbitrary URLs you provide — avoid pointing it at internal/private endpoints or URLs requiring secrets unless you understand the risks.
- If you plan to use it, inspect and run the code in a sandboxed environment first, verify dependencies and that browser downloads are acceptable, and consider adding a requirements.txt and explicit install instructions (and lock Playwright/browser behavior) before using in production.
Capability Analysis
Type: OpenClaw Skill
Name: novel-scraper-spa
Version: 1.0.0
The skill is a functional web scraping tool designed to handle both static and Single Page Application (SPA) websites using Playwright and BeautifulSoup. The implementation in scripts/scraper.py and scripts/browser.py is well-structured, follows the stated purpose, and saves data only to a local workspace directory (~/.openclaw/workspace/novels/). No evidence of data exfiltration, malicious execution, or prompt injection was found.
Capability Assessment
Purpose & Capability
The skill's stated purpose (scraping SPA and static novel sites) matches the included scripts: requests/BeautifulSoup for static pages and Playwright-based rendering for SPA. However, the skill does not declare required Python packages (playwright, requests, bs4) or the browser binaries Playwright needs. That omission is disproportionate to the task and may surprise users at runtime.
Instruction Scope
SKILL.md instructs running the provided CLI (python3 scripts/scraper.py ...) and describes parameters and output location (~/.openclaw/workspace/novels/). The instructions do not ask the agent to read unrelated files, credentials, or to transmit scraped data to external endpoints. The doc contains leftover template TODO text, indicating the author didn't finish editing the README.
Install Mechanism
There is no install spec. That reduces automated risk (nothing is auto-downloaded by the skill itself) but is a practical problem because the code requires external runtime dependencies (Playwright and its browser engines). Playwright typically requires a manual 'playwright install' (which downloads browser binaries) or package install; the skill provides no guidance or declared dependency list.
Credentials
The skill requests no environment variables, credentials, or config paths. That is proportionate to its stated purpose: it scrapes arbitrary URLs provided by the user and saves output locally. There are no signs of hidden credential access or exfiltration in the included files.
Persistence & Privilege
The skill does not request 'always: true' or elevated persistence. It is user-invocable and can run autonomously per platform defaults, which is expected. It does not modify other skills or system configuration beyond writing output to the user's ~/.openclaw workspace.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install novel-scraper-spa - After installation, invoke the skill by name or use
/novel-scraper-spa - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
初始版本
Metadata
Frequently Asked Questions
What is Novel Scraper SPA?
支持 JavaScript 加载的小说抓取工具,使用 Playwright 渲染 SPA 网站。自动检测网站类型,静态网站使用 curl,SPA 网站使用浏览器渲染。 It is an AI Agent Skill for Claude Code / OpenClaw, with 95 downloads so far.
How do I install Novel Scraper SPA?
Run "/install novel-scraper-spa" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Novel Scraper SPA free?
Yes, Novel Scraper SPA is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Novel Scraper SPA support?
Novel Scraper SPA is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Novel Scraper SPA?
It is built and maintained by yuzhihui886 (@yuzhihui886); the current version is v1.0.0.
More Skills