/install html-to-html
HTML to HTML
Fetch a remote web page or local HTML file and convert it to clean structured HTML using MinerU. Strips noise and preserves semantic content.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Quick Start
# Crawl a web page and output clean HTML (requires token)
mineru-open-api crawl https://example.com/article -f html -o ./out/
# Re-extract a local HTML file to clean HTML (requires token)
mineru-open-api extract page.html -f html -o ./out/
# Batch crawl multiple URLs to HTML (requires token)
mineru-open-api crawl url1 url2 -f html -o ./pages/
Authentication
Token required:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
Capabilities
- Input: remote web page URL or local .html file
- Output: clean structured HTML (
-f html) - For remote URLs: use
crawl -f html - For local HTML files: use
extract -f html - Requires token — not available in
flash-extract
Notes
- HTML output (
-f html) requires token; not available inflash-extract crawlsupports output formats: md, html, jsonextractsupports output formats: md, html, latex, docx, json- Output goes to stdout by default; use
-o \x3Cdir>to save to a file or directory - All progress/status messages go to stderr; document content goes to stdout
- MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install html-to-html - After installation, invoke the skill by name or use
/html-to-html - Provide required inputs per the skill's parameter spec and get structured output
What is HTML to HTML?
Clean and restructure HTML documents using MinerU. Takes messy or complex HTML and produces clean, well-formatted HTML output with proper structure preserved... It is an AI Agent Skill for Claude Code / OpenClaw, with 167 downloads so far.
How do I install HTML to HTML?
Run "/install html-to-html" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is HTML to HTML free?
Yes, HTML to HTML is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does HTML to HTML support?
HTML to HTML is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created HTML to HTML?
It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.