← Back to Skills Marketplace
mzlzyca

Doc Analysis

by mzlzyCA · GitHub ↗ · v0.4.0 · MIT-0
cross-platform ✓ Security Clean
213
Downloads
0
Stars
0
Active Installs
6
Versions
Install in OpenClaw
/install doc-analysis
Description
Analyze the structure, layout, and content of Word documents (.doc, .docx) using MinerU. Returns structured Markdown with headings, paragraphs, tables, and l...
README (SKILL.md)

Doc Analysis

Analyze and extract structured content from Word (.doc/.docx) files using MinerU. Returns Markdown with layout, headings, and structure preserved.

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Analyze a .docx file (requires token)
mineru-open-api extract report.docx -o ./out/

# Analyze a .doc file (requires token)
mineru-open-api extract report.doc -o ./out/

# Specify language
mineru-open-api extract report.docx --language en -o ./out/

Authentication

Token required:

mineru-open-api auth             # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

  • Supported input: .doc, .docx (local file or URL)
  • Preserves document structure: headings, paragraphs, lists, tables
  • Requires token (mineru-open-api auth or MINERU_TOKEN env)
  • Language hint with --language (default: ch, use en for English)

Notes

  • .doc (legacy Word format) is only supported by extract (requires token)
  • .docx supports both flash-extract (no token, quick) and extract (full features)
  • Output goes to stdout by default; use -o \x3Cdir> to save to a file or directory
  • All progress/status messages go to stderr; document content goes to stdout
  • MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
Usage Guidance
This skill appears coherent and does what it claims: it runs the MinerU CLI against .doc/.docx files and uses a MINERU_TOKEN for authenticated extracts. Before installing: (1) Confirm you trust the npm package name and the GitHub repo (inspect the repo/source if you can). (2) Understand that documents processed by the CLI may be sent to MinerU servers when using the authenticated 'extract' mode — avoid sending highly sensitive documents unless you’ve verified the service’s privacy/security. (3) If you only need quick, local/no-token extraction, check whether 'flash-extract' actually operates without a token in your environment. (4) Prefer installing in a sandbox or container first and verify behavior and network activity if you have strict security requirements.
Capability Analysis
Type: OpenClaw Skill Name: doc-analysis Version: 0.4.0 The skill bundle provides documentation and configuration for using the MinerU document intelligence engine to analyze Word files. It specifies the installation of the 'mineru-open-api' CLI tool via npm or Go and requires a 'MINERU_TOKEN' for API access. The instructions in SKILL.md are consistent with the stated purpose of document analysis and do not contain any malicious commands, data exfiltration patterns, or prompt injection attempts.
Capability Assessment
Purpose & Capability
Name/description (Word document analysis) match the declared binary (mineru-open-api) and the single required env var (MINERU_TOKEN). Requiring a MinerU CLI and token is expected for a hosted/open-source document analysis service.
Instruction Scope
Runtime instructions stick to running the mineru-open-api CLI on local files or URLs and handling stdout/stderr. One minor inconsistency: SKILL.md notes a 'flash-extract' mode that requires no token while metadata marks MINERU_TOKEN as required; this is likely an over-assertion in metadata rather than malicious scope creep.
Install Mechanism
Installers are standard: npm package and a Go 'go install' from a GitHub repo. No download-from-untrusted-URL or archive extraction steps are present. These are moderate-risk (npm/GitHub) but appropriate for a CLI tool.
Credentials
Only one credential is requested (MINERU_TOKEN) which is proportional for a remote MinerU service. The SKILL.md's mention that some quick extraction works without a token suggests the token may not be strictly required for all operations; metadata requiring it unconditionally is slightly overbroad but not a strong red flag.
Persistence & Privilege
always is false, the skill is user-invocable, and it does not request to modify other skills or system-wide configs. It only requires presence of the mineru-open-api binary.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install doc-analysis
  3. After installation, invoke the skill by name or use /doc-analysis
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.4.0
SEO: expand description for better ClawHub vector search discovery
v0.3.0
Rollback to original version
v0.2.0
SEO optimization: expanded description with rich keywords, trigger phrases, and bilingual content for better ClawHub vector search ranking.
v1.1.0
Update to v1.1.0
v1.0.1
Fix: declare MINERU_TOKEN credential in metadata
v1.0.0
Doc Analysis - analyze the structure and content of Word (.doc/.docx) documents using MinerU. Return
Metadata
Slug doc-analysis
Version 0.4.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 6
Frequently Asked Questions

What is Doc Analysis?

Analyze the structure, layout, and content of Word documents (.doc, .docx) using MinerU. Returns structured Markdown with headings, paragraphs, tables, and l... It is an AI Agent Skill for Claude Code / OpenClaw, with 213 downloads so far.

How do I install Doc Analysis?

Run "/install doc-analysis" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Doc Analysis free?

Yes, Doc Analysis is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Doc Analysis support?

Doc Analysis is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Doc Analysis?

It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.

💬 Comments