← Back to Skills Marketplace
joannaxing

WeChat Article Extract

by JoannaXing · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
40
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install wechat-article-extract
Description
Extract public WeChat Official Account articles from mp.weixin.qq.com links or saved HTML into clean Markdown or structured JSON, including title, account na...
README (SKILL.md)

WeChat Article Extract

Use this skill to extract a public WeChat Official Account article into portable Markdown or JSON. It is intentionally local and generic: it does not require the user's Feishu, knowledge-base profile, database, or API keys.

Quick Start

Run the bundled script from the skill directory:

python3 scripts/extract_wechat_article.py "https://mp.weixin.qq.com/s/..." --format markdown --output article.md
python3 scripts/extract_wechat_article.py "https://mp.weixin.qq.com/s/..." --format json --output article.json

For an HTML file already saved from a browser:

python3 scripts/extract_wechat_article.py --html-file article.html --source-url "https://mp.weixin.qq.com/s/..." --format markdown

Workflow

  1. Confirm the input is a public https://mp.weixin.qq.com/s/... article URL or a saved HTML file. Private drafts, logged-in backend pages, and non-WeChat URLs are out of scope.
  2. Extract with scripts/extract_wechat_article.py.
  3. If network fetching fails because WeChat blocks the request, ask the user to save the article HTML from a browser and rerun with --html-file.
  4. Use Markdown for human-readable archives and JSON for downstream import pipelines.
  5. Keep copyright boundaries: summarize or transform extracted content when sharing externally; do not republish full articles unless the user has rights to do so.

Outputs

Markdown output contains:

  • article title, account name, publish time, source URL, and image count
  • full text with blank-line paragraph separation
  • tables converted to Markdown tables when possible
  • inline image placeholders like [[WECHAT_IMAGE_1]]
  • image URL list at the end

JSON output contains:

  • articleId
  • title
  • author
  • publishTime
  • sourceUrl
  • content
  • contentWithImageMarkers
  • imageEntries
  • imageUrls
  • imageCount
  • coverImageUrl

Notes

  • The script uses only the Python standard library.
  • It preserves image positions with markers but does not download images by default.
  • Add --download-images \x3Cdir> when the user explicitly wants local image files.
  • WeChat article pages change over time; if live extraction fails, saved HTML is the most reliable fallback.
Usage Guidance
Install if you want an agent to fetch or parse public WeChat articles and save extracted output. Review the source URL and output path before running, and use image downloading only for articles and destinations you trust.
Capability Assessment
Purpose & Capability
The artifacts consistently describe extracting public mp.weixin.qq.com articles or user-supplied saved HTML into Markdown or JSON, with optional image downloading only when requested.
Instruction Scope
The trigger text uses several generic verbs, but it is still tied to WeChat/Weixin public article content and the workflow explicitly excludes private drafts, logged-in backend pages, and non-WeChat URLs.
Install Mechanism
Installation consists of a skill document, a small OpenAI agent YAML, tests, and a bundled Python standard-library script; there are no package installs, startup hooks, or hidden setup steps.
Credentials
Network access to public WeChat article URLs, reading a user-provided HTML file, and writing requested output files are proportionate to the stated extraction/archive workflow.
Persistence & Privilege
No background process, credential access, privilege escalation, profile/session use, or persistent agent behavior was found; output files and optional downloaded images are user-directed artifacts.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install wechat-article-extract
  3. After installation, invoke the skill by name or use /wechat-article-extract
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: extract public WeChat Official Account articles to Markdown or JSON without Feishu or knowledge-base dependencies.
Metadata
Slug wechat-article-extract
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is WeChat Article Extract?

Extract public WeChat Official Account articles from mp.weixin.qq.com links or saved HTML into clean Markdown or structured JSON, including title, account na... It is an AI Agent Skill for Claude Code / OpenClaw, with 40 downloads so far.

How do I install WeChat Article Extract?

Run "/install wechat-article-extract" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is WeChat Article Extract free?

Yes, WeChat Article Extract is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does WeChat Article Extract support?

WeChat Article Extract is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created WeChat Article Extract?

It is built and maintained by JoannaXing (@joannaxing); the current version is v1.0.0.

💬 Comments