/install html-to-text
HTML to Text
Extract plain readable text from HTML files or web pages using MinerU. MinerU outputs Markdown as the closest format to plain text.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Quick Start
# Extract text from a local HTML file (requires token)
mineru-open-api extract page.html -o ./out/
# Extract text from a web page (requires token)
mineru-open-api crawl https://example.com/article
# JSON output contains text fields (requires token)
mineru-open-api extract page.html -f json -o ./out/
Authentication
Token required:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
Capabilities
- Supported input: local .html file or web page URL
- HTML requires
extractorcrawl(token required) — not supported byflash-extract - MinerU does not have a
-f textoption; Markdown is the closest plain-text output - For truly plain text: use
extract -f jsonand read the text fields from JSON output - Language hint with
--language(default:ch, useenfor English)
Notes
- MinerU has no
-f textformat; use Markdown output or-f jsonfor text fields - HTML is NOT supported by
flash-extract - Output goes to stdout by default; use
-o \x3Cdir>to save to a file or directory - All progress/status messages go to stderr; document content goes to stdout
- MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install html-to-text - After installation, invoke the skill by name or use
/html-to-text - Provide required inputs per the skill's parameter spec and get structured output
What is HTML to Text?
Convert HTML to plain readable text using MinerU. Strips HTML markup and extracts clean text content from web pages and HTML files. Features: HTML to text co... It is an AI Agent Skill for Claude Code / OpenClaw, with 159 downloads so far.
How do I install HTML to Text?
Run "/install html-to-text" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is HTML to Text free?
Yes, HTML to Text is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does HTML to Text support?
HTML to Text is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created HTML to Text?
It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.