← Back to Skills Marketplace

网页内容提取

Name: 网页内容提取
Author: freedompixels

by freedompixels · GitHub ↗ · v1.1.1 · MIT-0

cross-platform ✓ Security Clean

156

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install cn-readability-extractor

Description

网页内容提取器。输入URL，提取正文内容，去除广告和导航。

README (SKILL.md)

网页内容提取器

输入URL，提取干净正文。

功能

从URL提取正文
去除广告、导航、脚本
提取标题和描述
中英文支持

用法

python3 scripts/readability.py https://example.com

依赖

Python 3.7+
requests, certifi

Usage Guidance

This skill appears coherent and low-risk: it only fetches the URL you provide and extracts text locally. Before installing, note: (1) SKILL.md mentions the 'requests' package but the script uses urllib — you do not need to provide API keys or credentials; (2) the skill will make network requests to any URL you pass, so avoid running it on untrusted or internal URLs you don't want fetched; (3) the extractor is simple and may not perfectly remove all ads or boilerplate; (4) the package comes from an unknown source — if you plan to run it on sensitive systems, review the script locally (it’s small and readable) or run it in a sandboxed environment.

Capability Analysis

Type: OpenClaw Skill Name: cn-readability-extractor Version: 1.1.1 The skill is a legitimate web content extractor designed to fetch and clean text from user-provided URLs. The Python script `scripts/readability.py` uses standard libraries (urllib, html.parser) to remove ads and navigation elements, and the instructions in `SKILL.md` are consistent with this purpose. No evidence of data exfiltration, malicious execution, or prompt injection was found.

Capability Assessment

ℹ Purpose & Capability

Name/description (网页内容提取) match the included script: the Python tool fetches a provided URL and extracts readable text while skipping common structural tags. One minor mismatch: SKILL.md lists 'requests' as a dependency but the script uses urllib.request and does not import requests.

✓ Instruction Scope

SKILL.md instructs the agent to run 'python3 scripts/readability.py <URL>'. The script only fetches the given URL, parses HTML locally, and prints results. It does not read local files, environment variables, or transmit data to third-party endpoints beyond the requested URL.

✓ Install Mechanism

No install spec — instruction-only with a bundled Python script. No downloads or external installers invoked by the skill itself.

✓ Credentials

No environment variables, credentials, or config paths are requested. The script uses certifi for SSL verification; this is proportionate to its network fetch behavior.

✓ Persistence & Privilege

Skill is not always-enabled and uses normal autonomous invocation defaults. It does not modify other skills or request persistent system-wide privileges.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install cn-readability-extractor
After installation, invoke the skill by name or use /cn-readability-extractor
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.1.1

- Improved README clarity and formatting. - Updated skill name and description for conciseness. - Added emoji metadata. - Listed package dependencies and minimum Python version. - Clarified usage instructions.

v1.1.0

v1.1.0: 修复SKILL.md格式，修复缺失脚本

v1.0.0

v1.0.0: initial release - extract clean article text from any URL

Metadata

Slug cn-readability-extractor

Version 1.1.1

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 3

Frequently Asked Questions

What is 网页内容提取?

网页内容提取器。输入URL，提取正文内容，去除广告和导航。 It is an AI Agent Skill for Claude Code / OpenClaw, with 156 downloads so far.

How do I install 网页内容提取?

Run "/install cn-readability-extractor" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 网页内容提取 free?

Yes, 网页内容提取 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 网页内容提取 support?

网页内容提取 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 网页内容提取?

It is built and maintained by freedompixels (@freedompixels); the current version is v1.1.1.

More Skills