← Back to Skills Marketplace
freedompixels

网页内容提取

by freedompixels · GitHub ↗ · v1.1.1 · MIT-0
cross-platform ✓ Security Clean
156
Downloads
0
Stars
0
Active Installs
3
Versions
Install in OpenClaw
/install cn-readability-extractor
Description
网页内容提取器。输入URL,提取正文内容,去除广告和导航。
README (SKILL.md)

网页内容提取器

输入URL,提取干净正文。

功能

  • 从URL提取正文
  • 去除广告、导航、脚本
  • 提取标题和描述
  • 中英文支持

用法

python3 scripts/readability.py https://example.com

依赖

  • Python 3.7+
  • requests, certifi
Usage Guidance
This skill appears coherent and low-risk: it only fetches the URL you provide and extracts text locally. Before installing, note: (1) SKILL.md mentions the 'requests' package but the script uses urllib — you do not need to provide API keys or credentials; (2) the skill will make network requests to any URL you pass, so avoid running it on untrusted or internal URLs you don't want fetched; (3) the extractor is simple and may not perfectly remove all ads or boilerplate; (4) the package comes from an unknown source — if you plan to run it on sensitive systems, review the script locally (it’s small and readable) or run it in a sandboxed environment.
Capability Analysis
Type: OpenClaw Skill Name: cn-readability-extractor Version: 1.1.1 The skill is a legitimate web content extractor designed to fetch and clean text from user-provided URLs. The Python script `scripts/readability.py` uses standard libraries (urllib, html.parser) to remove ads and navigation elements, and the instructions in `SKILL.md` are consistent with this purpose. No evidence of data exfiltration, malicious execution, or prompt injection was found.
Capability Assessment
Purpose & Capability
Name/description (网页内容提取) match the included script: the Python tool fetches a provided URL and extracts readable text while skipping common structural tags. One minor mismatch: SKILL.md lists 'requests' as a dependency but the script uses urllib.request and does not import requests.
Instruction Scope
SKILL.md instructs the agent to run 'python3 scripts/readability.py <URL>'. The script only fetches the given URL, parses HTML locally, and prints results. It does not read local files, environment variables, or transmit data to third-party endpoints beyond the requested URL.
Install Mechanism
No install spec — instruction-only with a bundled Python script. No downloads or external installers invoked by the skill itself.
Credentials
No environment variables, credentials, or config paths are requested. The script uses certifi for SSL verification; this is proportionate to its network fetch behavior.
Persistence & Privilege
Skill is not always-enabled and uses normal autonomous invocation defaults. It does not modify other skills or request persistent system-wide privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install cn-readability-extractor
  3. After installation, invoke the skill by name or use /cn-readability-extractor
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.1
- Improved README clarity and formatting. - Updated skill name and description for conciseness. - Added emoji metadata. - Listed package dependencies and minimum Python version. - Clarified usage instructions.
v1.1.0
v1.1.0: 修复SKILL.md格式,修复缺失脚本
v1.0.0
v1.0.0: initial release - extract clean article text from any URL
Metadata
Slug cn-readability-extractor
Version 1.1.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 3
Frequently Asked Questions

What is 网页内容提取?

网页内容提取器。输入URL,提取正文内容,去除广告和导航。 It is an AI Agent Skill for Claude Code / OpenClaw, with 156 downloads so far.

How do I install 网页内容提取?

Run "/install cn-readability-extractor" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 网页内容提取 free?

Yes, 网页内容提取 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 网页内容提取 support?

网页内容提取 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 网页内容提取?

It is built and maintained by freedompixels (@freedompixels); the current version is v1.1.1.

💬 Comments