← Back to Skills Marketplace
josephxie1

web-to-markdown

by josephxie1 · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
326
Downloads
1
Stars
6
Active Installs
1
Versions
Install in OpenClaw
/install web-to-markdown
Description
网页内容抓取与图片提取技能。支持:(1) 将网页转换为 Markdown 格式阅读 (2) 从任意网站提取图片 URL (3) 批量下载网页图片。当需要抓取网页内容、读取文章、提取或下载网站图片时使用此技能。支持 markdown.new、defuddle.md、r.jina.ai 等转换服务,自动降级确保成功。...
Usage Guidance
This skill appears to implement the advertised scraping and image‑downloading features, but note two issues before installing: (1) the scripts require the third‑party Python package 'requests' (and scrape.py needs 'scrapling') despite a claim they only use the standard library — install those packages or the scripts will fail; (2) the tool sends page URLs/content to external conversion/fetch services (r.jina.ai, markdown.new, defuddle.md) — any content you fetch will be visible to those services, so avoid scraping sensitive/internal URLs. Also review and confirm the output directory (~/.openclaw/images) is acceptable, and be mindful of site terms and server load when bulk downloading.
Capability Analysis
Type: OpenClaw Skill Name: web-to-markdown Version: 1.0.0 The skill bundle provides legitimate web scraping and image extraction functionality using a combination of third-party conversion services (like r.jina.ai) and local Python scripts (download_images.py, extract_images.py, scrape.py). The code uses standard libraries like requests and scrapling to fetch and process web content, with no evidence of data exfiltration, unauthorized execution, or malicious prompt injection.
Capability Assessment
Purpose & Capability
Name/description match the included scripts and instructions: converting pages to Markdown and extracting/downloading images. However, SKILL.md incorrectly states that extract_images.py and download_images.py use only the Python standard library, while both scripts import the third‑party 'requests' package (and scrape.py requires 'scrapling'). This is an incoherence between claimed dependencies and actual requirements.
Instruction Scope
Instructions are narrowly scoped to web scraping, URL extraction, and downloading images. They explicitly instruct using external conversion services (markdown.new, defuddle.md, r.jina.ai). This means page URLs and page contents will be sent to those remote services — a privacy/ data‑sharing implication that is declared but should be highlighted to users. The scripts write files to a default local directory (~/.openclaw/images), which is expected for a downloader.
Install Mechanism
There is no install spec (instruction-only), so nothing is auto-downloaded/installed by the platform. The included scripts, however, depend on third‑party Python packages: 'requests' (used by extract_images.py and download_images.py) and 'scrapling' (required by scrape.py). SKILL.md partly documents scrapling installation but incorrectly claims the first two scripts need only the standard library. This mismatch could cause runtime failures and surprises for users.
Credentials
The skill requests no environment variables or credentials and does not access system config paths beyond creating a per‑user output directory. There are no undeclared credential requirements.
Persistence & Privilege
always is false, the skill is user-invocable and does not request persistent/privileged presence or modify other skill configurations. It only writes files to its output directory.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install web-to-markdown
  3. After installation, invoke the skill by name or use /web-to-markdown
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of web-to-markdown skill. - Convert any web page to clean Markdown for easier reading. - Extract all image URLs from any website with simple commands or scripts. - Batch-download images from web pages, with options for file type, size, and destination folder. - Supports multiple conversion services (markdown.new, defuddle.md, r.jina.ai) with automatic fallback for reliability. - Special handling for sites like Pinterest; works with dynamic and lazy-loaded images.
Metadata
Slug web-to-markdown
Version 1.0.0
License
All-time Installs 6
Active Installs 6
Total Versions 1
Frequently Asked Questions

What is web-to-markdown?

网页内容抓取与图片提取技能。支持:(1) 将网页转换为 Markdown 格式阅读 (2) 从任意网站提取图片 URL (3) 批量下载网页图片。当需要抓取网页内容、读取文章、提取或下载网站图片时使用此技能。支持 markdown.new、defuddle.md、r.jina.ai 等转换服务,自动降级确保成功。... It is an AI Agent Skill for Claude Code / OpenClaw, with 326 downloads so far.

How do I install web-to-markdown?

Run "/install web-to-markdown" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is web-to-markdown free?

Yes, web-to-markdown is completely free (open-source). You can download, install and use it at no cost.

Which platforms does web-to-markdown support?

web-to-markdown is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created web-to-markdown?

It is built and maintained by josephxie1 (@josephxie1); the current version is v1.0.0.

💬 Comments