/install html-parse
HTML Parse
Parse local HTML files into structured Markdown using MinerU. Preserves document hierarchy. For live web pages, use mineru-open-api crawl.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Quick Start
# Parse a local HTML file (requires token)
mineru-open-api extract page.html -o ./out/
# Parse a remote HTML URL (requires token)
mineru-open-api extract https://example.com/page.html -o ./out/
# Parse a live web page (requires token)
mineru-open-api crawl https://example.com/article -o ./out/
Authentication
Token required:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
Capabilities
- Supported input: local .html file or remote HTML URL
- HTML requires
extractorcrawl(token required) - HTML is NOT supported by
flash-extract - Language hint with
--language(default:ch, useenfor English)
Notes
- HTML is NOT supported by
flash-extract— useextractorcrawl - For live web pages with dynamic content, use
crawlinstead ofextract - Output goes to stdout by default; use
-o \x3Cdir>to save to a file or directory - All progress/status messages go to stderr; document content goes to stdout
- MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install html-parse - After installation, invoke the skill by name or use
/html-parse - Provide required inputs per the skill's parameter spec and get structured output
What is HTML Parse?
Parse HTML documents into structured Markdown using MinerU. Analyzes HTML structure and converts it into well-organized Markdown preserving hierarchy and for... It is an AI Agent Skill for Claude Code / OpenClaw, with 190 downloads so far.
How do I install HTML Parse?
Run "/install html-parse" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is HTML Parse free?
Yes, HTML Parse is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does HTML Parse support?
HTML Parse is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created HTML Parse?
It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.