← Back to Skills Marketplace
mzlzyca

Doc To HTML

by mzlzyCA · GitHub ↗ · v0.4.0 · MIT-0
cross-platform ✓ Security Clean
218
Downloads
0
Stars
0
Active Installs
6
Versions
Install in OpenClaw
/install doc-to-html
Description
Convert Word documents (.doc, .docx) to HTML using MinerU's document processing engine. Produces clean HTML output preserving document structure and formatti...
README (SKILL.md)

Doc To HTML

Convert Word (.doc/.docx) documents to HTML using MinerU.

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Convert .docx to HTML (requires token)
mineru-open-api extract report.docx -f html -o ./out/

# Convert .doc to HTML (requires token)
mineru-open-api extract report.doc -f html -o ./out/

# With language hint
mineru-open-api extract report.docx -f html --language en -o ./out/

Authentication

Token required:

mineru-open-api auth             # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

  • Supported input: .doc, .docx (local file or URL)
  • Output format: HTML (-f html)
  • HTML output requires extract with token — not available in flash-extract
  • Language hint with --language (default: ch, use en for English)

Notes

  • HTML output (-f html) is only available via extract with token
  • Output goes to stdout by default; use -o \x3Cdir> to save to a file
  • All progress/status messages go to stderr; document content goes to stdout
  • MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
Usage Guidance
This skill is coherent: it invokes the MinerU CLI and needs one API token. Before installing, verify the mineru-open-api package and the referenced GitHub repo (opendatalab) to ensure they are the legitimate project, and confirm the npm package owner matches MinerU/OpenDataLab. Be aware that using the token typically uploads documents to the MinerU service for processing—do not send sensitive or private documents unless you trust the service and understand token scopes. Prefer creating an ephemeral or least-privilege token for the skill and rotate/revoke it when no longer needed.
Capability Analysis
Type: OpenClaw Skill Name: doc-to-html Version: 0.4.0 The doc-to-html skill is a legitimate wrapper for the MinerU document processing engine (OpenDataLab). It facilitates Word-to-HTML conversion using the 'mineru-open-api' CLI tool and requires a standard API token (MINERU_TOKEN). No evidence of data exfiltration, malicious execution, or prompt injection was found in SKILL.md or _meta.json.
Capability Assessment
Purpose & Capability
Name/description, required binary (mineru-open-api), and required env var (MINERU_TOKEN) are consistent with a document-conversion CLI front-end. The requested items are expected for a MinerU-based converter and there are no unrelated credentials or binaries.
Instruction Scope
SKILL.md directs the agent to run the mineru-open-api CLI to convert local files (or URLs) and to authenticate with a token. It does not instruct arbitrary file reads, environment inspection beyond MINERU_TOKEN, or exfiltration to unexpected endpoints; output behavior (stdout/stderr) is documented.
Install Mechanism
Installers are standard: npm package mineru-open-api and a Go 'go install' from a GitHub path (opendatalab). This is proportional for a CLI tool, but the skill source is marked 'unknown' and global installs will place a binary on the system—verify the npm package identity and GitHub repository before installing.
Credentials
Only a single credential (MINERU_TOKEN) is required and declared as primaryEnv; that aligns with the CLI's documented need for an API token. No unrelated secrets or config paths are requested.
Persistence & Privilege
always is false and the skill does not request system-wide configuration changes or persistent elevated privileges. Installing the CLI creates a user-accessible binary, which is expected for this functionality.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install doc-to-html
  3. After installation, invoke the skill by name or use /doc-to-html
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.4.0
SEO: expand description for better ClawHub vector search discovery
v0.3.0
Rollback to original version
v0.2.0
SEO optimization: expanded description with rich keywords, trigger phrases, and bilingual content for better ClawHub vector search ranking.
v1.1.0
Update to v1.1.0
v1.0.1
Fix: declare MINERU_TOKEN credential in metadata
v1.0.0
Doc to HTML - convert Word (.doc/.docx) documents to HTML using MinerU. Use when you need HTML outpu
Metadata
Slug doc-to-html
Version 0.4.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 6
Frequently Asked Questions

What is Doc To HTML?

Convert Word documents (.doc, .docx) to HTML using MinerU's document processing engine. Produces clean HTML output preserving document structure and formatti... It is an AI Agent Skill for Claude Code / OpenClaw, with 218 downloads so far.

How do I install Doc To HTML?

Run "/install doc-to-html" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Doc To HTML free?

Yes, Doc To HTML is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Doc To HTML support?

Doc To HTML is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Doc To HTML?

It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.

💬 Comments