← 返回 Skills 市场
mzlzyca

Formula Ocr

作者 mzlzyCA · GitHub ↗ · v0.4.0 · MIT-0
cross-platform ✓ 安全检测通过
182
总下载
0
收藏
0
当前安装
6
版本数
在 OpenClaw 中安装
/install formula-ocr
功能描述
OCR and recognize mathematical formulas from PDFs and images using MinerU. Converts printed or handwritten equations into structured LaTeX or text representa...
使用说明 (SKILL.md)

Formula Ocr

Convert and extract content from .pdf / images (.png/.jpg/.jpeg/.webp) using MinerU (mineru-open-api).

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Extract formulas from PDF (requires token)
mineru-open-api extract paper.pdf -o ./out/

# With VLM for better formula accuracy
mineru-open-api extract paper.pdf --model vlm -o ./out/

Authentication

Token required for extract and crawl:

mineru-open-api auth            # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

  • Supports local files and URLs
  • Requires token (mineru-open-api auth or MINERU_TOKEN env)
  • Supported input: .pdf / images (.png/.jpg/.jpeg/.webp)
  • Language hint with --language (default: ch, use en for English)
  • Page range with --pages (where applicable)

Notes

  • Formula recognition requires extract with token. The --formula flag is enabled by default.
  • Output goes to stdout by default; use -o \x3Cdir> to save to file
  • Binary formats (docx) require -o flag (cannot stream to stdout)
  • All progress/status messages go to stderr
  • MinerU is an open-source project by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
安全使用建议
This skill appears coherent, but check a few practical points before installing: 1) Verify the mineru-open-api npm package or the GitHub repo are the official MinerU distribution and review their source if you have security concerns (npm installs can run install-time scripts). 2) Be aware that using the skill will send PDFs/images to MinerU's service (mineru.net) — do not upload sensitive or confidential documents unless you trust the service and its privacy/retention policies. 3) MINERU_TOKEN grants access to your account; treat it like any API secret (store it securely and revoke/regenerate it if compromised). 4) If you require higher assurance, run the CLI in an isolated environment or review the repository code before installing. Overall the requests and instructions match the stated OCR purpose.
功能分析
Type: OpenClaw Skill Name: formula-ocr Version: 0.4.0 The skill bundle provides instructions and metadata for integrating the MinerU OCR tool (by OpenDataLab) to recognize mathematical formulas. It correctly identifies its dependencies (mineru-open-api), required environment variables (MINERU_TOKEN), and provides standard usage examples without any evidence of malicious intent, data exfiltration, or prompt injection attacks in SKILL.md.
能力评估
Purpose & Capability
Name/description request OCR of mathematical formulas. The skill only requires the mineru-open-api CLI and a MINERU_TOKEN, which are directly relevant to invoking MinerU's service—no unrelated credentials or binaries are requested.
Instruction Scope
SKILL.md instructs the agent to run mineru-open-api commands (extract, auth) on local files or URLs and to provide MINERU_TOKEN. It does not ask the agent to read unrelated files, other environment variables, or exfiltrate data to unexpected endpoints; the only external endpoint referenced is MinerU (mineru.net).
Install Mechanism
Install paths are standard: npm package (mineru-open-api) or go install from the project's GitHub repo. This is appropriate for a CLI-only integration. Note: npm packages (and go installs from remote repos) can run code at install time, so you should review the package/repo if you require a higher trust level.
Credentials
Only MINERU_TOKEN is required and declared as the primary credential. That single API token is proportional to a service that requires authentication. No other secrets or unrelated environment variables are requested.
Persistence & Privilege
The skill does not request always:true or any elevated persistent presence, and it does not indicate modifying system or other-skill configuration. Autonomous invocation (default) is allowed but not exceptional here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install formula-ocr
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /formula-ocr 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.4.0
SEO: expand description for better ClawHub vector search discovery
v0.3.0
Rollback to original version
v0.2.1
SEO optimization v0.2.1
v0.2.0
SEO optimization v0.2.0
v1.0.1
Fix: declare MINERU_TOKEN credential in metadata
v1.0.0
Formula OCR - recognize and extract mathematical formulas from PDFs or images using MinerU. Use for
元数据
Slug formula-ocr
版本 0.4.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 6
常见问题

Formula Ocr 是什么?

OCR and recognize mathematical formulas from PDFs and images using MinerU. Converts printed or handwritten equations into structured LaTeX or text representa... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 182 次。

如何安装 Formula Ocr?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install formula-ocr」即可一键安装,无需额外配置。

Formula Ocr 是免费的吗?

是的,Formula Ocr 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Formula Ocr 支持哪些平台?

Formula Ocr 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Formula Ocr?

由 mzlzyCA(@mzlzyca)开发并维护,当前版本 v0.4.0。

💬 留言讨论