← 返回 Skills 市场
mzlzyca

Extract Formulas From Pdf

作者 mzlzyCA · GitHub ↗ · v0.4.0 · MIT-0
cross-platform ✓ 安全检测通过
173
总下载
0
收藏
0
当前安装
6
版本数
在 OpenClaw 中安装
/install extract-formulas-from-pdf
功能描述
Extract mathematical formulas and equations from PDF documents using MinerU. Identifies and converts formula content from academic papers, textbooks, and tec...
使用说明 (SKILL.md)

Extract Formulas From Pdf

Convert and extract content from .pdf using MinerU (mineru-open-api).

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Extract formulas from PDF (requires token)
mineru-open-api extract paper.pdf -o ./out/

# With VLM for better formula accuracy
mineru-open-api extract paper.pdf --model vlm -o ./out/

Authentication

Token required for extract and crawl:

mineru-open-api auth            # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

  • Supports local files and URLs
  • Requires token (mineru-open-api auth or MINERU_TOKEN env)
  • Supported input: .pdf
  • Language hint with --language (default: ch, use en for English)
  • Page range with --pages (where applicable)

Notes

  • Formula recognition requires extract with token. Use --formula flag (enabled by default). Output is Markdown with LaTeX inline.
  • Output goes to stdout by default; use -o \x3Cdir> to save to file
  • Binary formats (docx) require -o flag (cannot stream to stdout)
  • All progress/status messages go to stderr
  • MinerU is an open-source project by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
安全使用建议
This skill appears to do what it says: it calls the mineru-open-api CLI and requires a MINERU_TOKEN. Before installing, verify the npm package and the GitHub repo are the official MinerU sources, and prefer installing in a sandboxed environment if you are unsure. Understand that using the service will likely upload PDF contents to MinerU's servers — avoid sending confidential documents unless you trust the vendor and have reviewed their privacy/security policy. Keep the MINERU_TOKEN secret, revoke it if compromised, and consider limiting global installs (use a local or containerized install) and pinning to a specific release.
功能分析
Type: OpenClaw Skill Name: extract-formulas-from-pdf Version: 0.4.0 The skill is a legitimate wrapper for the MinerU document intelligence engine by OpenDataLab. It facilitates PDF formula extraction via the 'mineru-open-api' tool, requiring a standard API token (MINERU_TOKEN). The installation instructions and usage patterns in SKILL.md align with the stated purpose and point to official repositories (GitHub/npm), with no evidence of malicious intent, data exfiltration, or prompt injection.
能力评估
Purpose & Capability
Name/description, required binary (mineru-open-api), and required env var (MINERU_TOKEN) all align with a MinerU-based PDF formula extraction tool. Nothing requested is unrelated to the stated purpose.
Instruction Scope
SKILL.md only instructs using the mineru-open-api CLI (extract/auth), exporting MINERU_TOKEN, and CLI flags. It does not ask the agent to read unrelated files, other environment variables, or modify system configuration.
Install Mechanism
Install uses npm (mineru-open-api) or go install from the GitHub repo — both are common and expected for a CLI. Recommend verifying the npm package and GitHub repo are the official MinerU releases before installing globally.
Credentials
Only MINERU_TOKEN is required and is the primary credential — this is proportionate. However, the token grants access to the MinerU service and the CLI likely uploads PDF content to that service for processing, so treat the token and any uploaded documents as sensitive.
Persistence & Privilege
Skill does not request always:true, does not require system config paths, and is not installing any persistent agent-wide configuration. Autonomous invocation is allowed (platform default) but not a special privilege here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install extract-formulas-from-pdf
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /extract-formulas-from-pdf 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.4.0
SEO: expand description for better ClawHub vector search discovery
v0.3.0
Rollback to original version
v0.2.1
SEO optimization v0.2.1
v0.2.0
SEO optimization v0.2.0
v1.0.1
Fix: declare MINERU_TOKEN credential in metadata
v1.0.0
Extract Formulas from PDF - extract mathematical formulas from PDF documents using MinerU. Use when
元数据
Slug extract-formulas-from-pdf
版本 0.4.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 6
常见问题

Extract Formulas From Pdf 是什么?

Extract mathematical formulas and equations from PDF documents using MinerU. Identifies and converts formula content from academic papers, textbooks, and tec... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 173 次。

如何安装 Extract Formulas From Pdf?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install extract-formulas-from-pdf」即可一键安装,无需额外配置。

Extract Formulas From Pdf 是免费的吗?

是的,Extract Formulas From Pdf 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Extract Formulas From Pdf 支持哪些平台?

Extract Formulas From Pdf 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Extract Formulas From Pdf?

由 mzlzyCA(@mzlzyca)开发并维护,当前版本 v0.4.0。

💬 留言讨论