/install html-analysis
HTML Analysis
Analyze and extract structured content from local HTML files using MinerU. Preserves document structure as Markdown. For live web page URLs, use mineru-open-api crawl.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Quick Start
# Analyze a local HTML file (requires token)
mineru-open-api extract page.html -o ./out/
# Analyze a remote HTML file by URL (requires token)
mineru-open-api extract https://example.com/page.html -o ./out/
# Crawl a live web page (requires token)
mineru-open-api crawl https://example.com/article -o ./out/
Authentication
Token required:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
Capabilities
- Supported input: local .html file or remote HTML URL
- HTML input requires
extract(token required) — not supported byflash-extract - For live web pages (rendered JS content), use
mineru-open-api crawl - Language hint with
--language(default:ch, useenfor English)
Notes
- HTML is NOT supported by
flash-extract— useextractwith token - For web page crawling, use
mineru-open-api crawl \x3CURL>instead ofextract - Output goes to stdout by default; use
-o \x3Cdir>to save to a file or directory - All progress/status messages go to stderr; document content goes to stdout
- MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install html-analysis - After installation, invoke the skill by name or use
/html-analysis - Provide required inputs per the skill's parameter spec and get structured output
What is HTML Analysis?
Analyze the structure and content of HTML documents using MinerU. Returns structured Markdown with layout information, headings, and content hierarchy preser... It is an AI Agent Skill for Claude Code / OpenClaw, with 176 downloads so far.
How do I install HTML Analysis?
Run "/install html-analysis" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is HTML Analysis free?
Yes, HTML Analysis is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does HTML Analysis support?
HTML Analysis is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created HTML Analysis?
It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.