← 返回 Skills 市场
网页内容提取小助手
作者
shuishouxinboda
· GitHub ↗
· v1.0.3
· MIT-0
134
总下载
1
收藏
0
当前安装
4
版本数
在 OpenClaw 中安装
/install jiayinclaw-12345
功能描述
从网页URL中提取标题、正文、图片链接等内容
使用说明 (SKILL.md)
网页内容提取器
这是一个实用的网页内容提取技能,可以从任意网页中提取结构化信息。
功能特点
- 自动提取网页标题和元数据
- 提取正文内容并清理HTML标签
- 提取所有图片链接
- 提取所有外链
- 支持指定提取元素
- 输出格式化JSON结果
使用方法
基本用法
技能输入:https://example.com
技能输出:{"title": "...", "content": "...", "images": [...], "links": [...]}
高级用法
- 指定提取特定元素
- 设置内容长度限制
- 自定义输出格式
技术规格
- 编程语言:Python 3
- 依赖库:requests, beautifulsoup4
- 网络要求:需要互联网连接
安全使用建议
This appears to be a straightforward web scraper. Before installing: (1) run it in a sandboxed or virtualenv environment and review/inspect scripts (the code is short and readable); (2) only pass URLs you trust — do not use it on internal dashboards or pages containing secrets; (3) respect robots.txt and site terms; (4) install dependencies via pip in an isolated environment; (5) if you need stronger guarantees, run it with network egress controls so it can only reach target sites.
功能分析
Type: OpenClaw Skill
Name: jiayinclaw-12345
Version: 1.0.3
The skill bundle is a standard web content extraction tool that uses the requests and BeautifulSoup4 libraries to scrape titles, text, and images from user-provided URLs. The code in scripts/extractor.py follows its stated purpose without any signs of data exfiltration, malicious execution, or prompt injection attempts.
能力评估
Purpose & Capability
Name/description (extract titles, content, images, links) match the included script and SKILL.md. Required libraries (requests, BeautifulSoup) are appropriate for the stated purpose and no unrelated binaries or credentials are requested.
Instruction Scope
SKILL.md and the script instruct only to fetch the target URL and parse its HTML. The runtime behavior is limited to requesting the provided URL, parsing content, and returning structured data; it does not read local files, access environment variables, or POST data to external endpoints other than the target site.
Install Mechanism
There is no automated install spec (no downloads or installers), which lowers risk. The package includes a Python script and requirements.txt that expect dependencies to be installed via pip; users should ensure dependencies are installed in a controlled environment (virtualenv) before running.
Credentials
The skill requires no environment variables, credentials, or config paths. The permissions indicated (network) are proportional and necessary for fetching webpages.
Persistence & Privilege
The skill does not request always:true, does not modify other skills or system-wide settings, and does not store credentials. Autonomous invocation is allowed by default but presents no additional incoherence here.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install jiayinclaw-12345 - 安装完成后,直接呼叫该 Skill 的名称或使用
/jiayinclaw-12345触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.3
Version 1.0.3
- No file changes detected in this release.
- Functionality and documentation remain unchanged from the previous version.
v1.0.2
- Version bump to 1.0.2 with no file or documentation changes.
- No functional updates or changes in this release.
v1.0.1
Version 1.0.1
- No changes detected from the previous version.
- All documentation, code, and configuration remain the same.
v1.0.0
Initial release of web-content-extractor:
- Extracts structured information (title, metadata, main text, images, links) from any web page URL.
- Supports extraction of specific elements and output in formatted JSON.
- Cleans HTML tags from the main content.
- Allows setting content length limits and customizing output format.
- Requires Python 3, requests, and beautifulsoup4; needs internet access.
元数据
常见问题
网页内容提取小助手 是什么?
从网页URL中提取标题、正文、图片链接等内容. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 134 次。
如何安装 网页内容提取小助手?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install jiayinclaw-12345」即可一键安装,无需额外配置。
网页内容提取小助手 是免费的吗?
是的,网页内容提取小助手 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
网页内容提取小助手 支持哪些平台?
网页内容提取小助手 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 网页内容提取小助手?
由 shuishouxinboda(@shuishouxinboda)开发并维护,当前版本 v1.0.3。
推荐 Skills