← Back to Skills Marketplace
mengzi53

xiaofei自用-WeChat Article Scraper

by mengzi53 · GitHub ↗ · v2.0.0 · MIT-0
cross-platform ⚠ suspicious
119
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install xiaofei-ziyong-wechat-article-scraper
Description
微信公众号文章抓取工具。从 mp.weixin.qq.com 抓取公开文章(文字+图片+视频),解析内容块顺序,下载图片,按原顺序写入飞书知识库。
Usage Guidance
This skill reliably implements scraping and local image download using headless Chrome, but its advertised Feishu import feature is not implemented in the provided code and no Feishu credentials or CLI are requested. Before installing or running: - Expect the tool to only scrape and produce local Markdown + images unless you separately provide a Feishu integration (a CLI or API glue that supplies credentials). The current package does not contain Feishu API calls. - Do not assume it will post data to Feishu automatically—verify how you or your environment will supply Feishu tokens if you want automated import. If there is an external feishu_create_doc tool you plan to use, ensure it is trustworthy. - Be aware it uses headless Chrome with anti-detection flags to bypass WeChat protection; scraping mp.weixin.qq.com may violate the site’s terms of service or local laws—consider legal/ethical implications. - Run the scripts in an isolated environment (e.g., disposable VM or container) until you confirm behavior. Check the cache directory (/tmp or supplied cache-dir) for downloaded images and remove sensitive files afterwards. - If you need automated Feishu import, request the author to either (a) include explicit Feishu integration code and declare required env vars (tokens) or (b) document the exact external CLI/tools required. Without that, the skill is incomplete and its description is misleading.
Capability Analysis
Type: OpenClaw Skill Name: xiaofei-ziyong-wechat-article-scraper Version: 2.0.0 The skill bundle is a legitimate tool for scraping WeChat articles and preparing them for import into Feishu (Lark). It uses headless Google Chrome via subprocess calls to bypass anti-bot protections and extracts content blocks (text and images) as described in the documentation. The code in `scrape.py` and `scrape_and_import.py` follows safe practices for command execution and restricts image downloads to the WeChat CDN (mmbiz.qpic.cn), showing no signs of data exfiltration, persistence, or malicious intent.
Capability Assessment
Purpose & Capability
The skill's name and description claim both scraping mp.weixin.qq.com and writing the article into Feishu (飞书) knowledge base. The included Python scripts implement scraping, HTML parsing, and local image downloading, but do not contain any Feishu API calls or logic to authenticate/write to Feishu. SKILL.md references commands like feishu_create_doc / feishu_update_doc / feishu_doc_media that are not provided and are not declared as required binaries or env vars. Requiring only python3 and google-chrome is appropriate for scraping but insufficient for the claimed Feishu import capability.
Instruction Scope
Runtime instructions and scripts are narrowly scoped to fetching pages with headless Chrome, parsing HTML, extracting content blocks, and downloading images to a local cache directory; they do not read unrelated system files or environment variables. The scripts use aggressive anti-detection Chrome flags to bypass WeChat anti-scraping, which is expected for this purpose but may have legal/ToS implications. The SKILL.md asks the user to manually re-locate images in Feishu due to API limitations—consistent with the code producing Markdown and local images.
Install Mechanism
There is no install specification (instruction-only skill plus included scripts). No external downloads or archive extraction occur. The only runtime dependency declared is google-chrome and python3, which matches the code's use of subprocess to invoke Chrome and Python execution.
Credentials
The skill requests no environment variables or credentials, yet advertises writing to Feishu. Writing to Feishu would normally require API credentials (app_id/app_secret or access token) or a feishu CLI binary; none are declared. SKILL.md references Feishu API actions but the package neither asks for nor provides a means to supply Feishu credentials or a Feishu client. This is a clear mismatch between claimed capabilities and required permissions/configuration.
Persistence & Privilege
The skill does not request 'always: true' and does not modify other skills or global agent settings. It writes downloaded images to a local cache directory (default /tmp/wechat_article_<timestamp>) and otherwise prints outputs; this is normal for a scraping tool and does not escalate privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install xiaofei-ziyong-wechat-article-scraper
  3. After installation, invoke the skill by name or use /xiaofei-ziyong-wechat-article-scraper
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.0.0
首次发布:微信公众号文章抓取+飞书导入,支持Chrome headless绕过反爬、完整内容块解析、图片末尾追加
Metadata
Slug xiaofei-ziyong-wechat-article-scraper
Version 2.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is xiaofei自用-WeChat Article Scraper?

微信公众号文章抓取工具。从 mp.weixin.qq.com 抓取公开文章(文字+图片+视频),解析内容块顺序,下载图片,按原顺序写入飞书知识库。 It is an AI Agent Skill for Claude Code / OpenClaw, with 119 downloads so far.

How do I install xiaofei自用-WeChat Article Scraper?

Run "/install xiaofei-ziyong-wechat-article-scraper" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is xiaofei自用-WeChat Article Scraper free?

Yes, xiaofei自用-WeChat Article Scraper is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does xiaofei自用-WeChat Article Scraper support?

xiaofei自用-WeChat Article Scraper is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created xiaofei自用-WeChat Article Scraper?

It is built and maintained by mengzi53 (@mengzi53); the current version is v2.0.0.

💬 Comments