← Back to Skills Marketplace
mzlzyca

HTML Markdown

by mzlzyCA · GitHub ↗ · v0.4.0 · MIT-0
cross-platform ✓ Security Clean
170
Downloads
0
Stars
0
Active Installs
5
Versions
Install in OpenClaw
/install html-markdown
Description
Convert HTML to Markdown using MinerU's document processing engine. Transforms HTML pages and files into clean, readable Markdown format. Features: HTML to M...
README (SKILL.md)

HTML Markdown

Convert HTML files to clean Markdown using MinerU. For local HTML files use extract; for live web pages use crawl.

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Convert local HTML file to Markdown (requires token)
mineru-open-api extract page.html -o ./out/

# Convert remote HTML file to Markdown (requires token)
mineru-open-api extract https://example.com/doc.html -o ./out/

# Convert live web page to Markdown via crawl (requires token)
mineru-open-api crawl https://example.com/article -o ./out/

Authentication

Token required:

mineru-open-api auth             # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

  • Supported input: local .html file or HTTP/HTTPS URL
  • Local .html file: use extract (token required)
  • Live web page URL: use crawl (token required)
  • HTML is NOT supported by flash-extract
  • Language hint with --language (default: ch, use en for English)

Notes

  • HTML input always requires token (no flash-extract support)
  • For live web pages with JavaScript rendering, prefer crawl
  • Output goes to stdout by default; use -o \x3Cdir> to save to a file or directory
  • All progress/status messages go to stderr; document content goes to stdout
  • MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
Usage Guidance
This skill appears coherent: it calls the MinerU CLI and needs only MINERU_TOKEN. Before installing, verify the npm package and GitHub repo (https://github.com/opendatalab/MinerU-Ecosystem) to ensure you trust the upstream code. Treat MINERU_TOKEN like any secret: do not reuse high-privilege tokens, restrict its scope if possible, and run the CLI in a controlled environment when processing untrusted HTML. If you cannot or will not install third-party binaries, do not grant the skill access to an environment where mineru-open-api is available.
Capability Analysis
Type: OpenClaw Skill Name: html-markdown Version: 0.4.0 The html-markdown skill is a legitimate wrapper for the MinerU document processing engine (OpenDataLab/Shanghai AI Lab). It facilitates HTML-to-Markdown conversion using the 'mineru-open-api' CLI tool. The SKILL.md file provides standard usage instructions, authentication requirements (MINERU_TOKEN), and installation steps via npm or Go that align with the official MinerU ecosystem. No indicators of data exfiltration, malicious execution, or prompt injection were found.
Capability Assessment
Purpose & Capability
Name/description match the requested artifacts: the skill requires the mineru-open-api binary and a MINERU_TOKEN, both of which are expected for a CLI that calls MinerU's service.
Instruction Scope
SKILL.md instructs the agent to run mineru-open-api extract/crawl and to set MINERU_TOKEN. It does not request other environment variables, read unrelated system files, or send data to unexpected endpoints beyond MinerU's service.
Install Mechanism
Installers are npm (mineru-open-api) and a go install from a GitHub repo — common and traceable mechanisms. As with any third-party package, installing a global npm package or go binary executes upstream code; review the npm package and GitHub repo before installing.
Credentials
Only MINERU_TOKEN is required and declared as the primary credential. That is proportionate for a hosted document-processing API. No unrelated credentials or config paths are requested.
Persistence & Privilege
always is false and the skill does not request system-wide configuration changes. It is an instruction-only skill that invokes an external CLI at runtime.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install html-markdown
  3. After installation, invoke the skill by name or use /html-markdown
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.4.0
SEO: expand description for better ClawHub vector search discovery
v0.3.0
Rollback to original version
v0.2.0
SEO optimization v0.2.0
v1.0.1
Fix: declare MINERU_TOKEN credential in metadata
v1.0.0
HTML to Markdown - convert local HTML files or web pages to clean Markdown using MinerU. Use html-ex
Metadata
Slug html-markdown
Version 0.4.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 5
Frequently Asked Questions

What is HTML Markdown?

Convert HTML to Markdown using MinerU's document processing engine. Transforms HTML pages and files into clean, readable Markdown format. Features: HTML to M... It is an AI Agent Skill for Claude Code / OpenClaw, with 170 downloads so far.

How do I install HTML Markdown?

Run "/install html-markdown" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is HTML Markdown free?

Yes, HTML Markdown is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does HTML Markdown support?

HTML Markdown is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created HTML Markdown?

It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.

💬 Comments