content-extraction
/install content-extraction
Content Extraction — Executable Skill
This skill is the local executable version. It keeps the source-aware routing design and restores a concrete extraction workflow.
What it does
- Detects the input source
- Selects the best extraction channel
- Produces clean Markdown
- Saves long content locally when needed
- Explains fallback failures instead of hiding them
Main entrypoints
scripts/extract_router.py— classify input and build a route planscripts/extract.py— generate an executable extraction spec
Route priorities
- WeChat → browser chain
- Feishu doc/wiki → Feishu tools
- YouTube → transcript chain
- Generic URL →
r.jina.ai→defuddle.md→web_fetch→ browser fallback
Output contract
Always return:
- title
- author when available
- source
- url
- summary
- Markdown body
- save path when content is long
Fallback rule
Never claim success when extraction is partial. If a layer fails, report:
- where it failed
- why it failed
- what fallback was tried next
Notes
- The ClawHub abstracted package stays abstract.
- This local version restores the executable workflow for OpenClaw use and ClawDex publishing.
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install content-extraction - After installation, invoke the skill by name or use
/content-extraction - Provide required inputs per the skill's parameter spec and get structured output
What is content-extraction?
OpenClaw-native executable content extraction skill for URLs, Feishu, YouTube, and web pages. It is an AI Agent Skill for Claude Code / OpenClaw, with 155 downloads so far.
How do I install content-extraction?
Run "/install content-extraction" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is content-extraction free?
Yes, content-extraction is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does content-extraction support?
content-extraction is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created content-extraction?
It is built and maintained by halfmoon82 (@halfmoon82); the current version is v1.1.0.