← Back to Skills Marketplace
mzlzyca

Extract Formulas From Pdf

by mzlzyCA · GitHub ↗ · v0.4.0 · MIT-0
cross-platform ✓ Security Clean
173
Downloads
0
Stars
0
Active Installs
6
Versions
Install in OpenClaw
/install extract-formulas-from-pdf
Description
Extract mathematical formulas and equations from PDF documents using MinerU. Identifies and converts formula content from academic papers, textbooks, and tec...
README (SKILL.md)

Extract Formulas From Pdf

Convert and extract content from .pdf using MinerU (mineru-open-api).

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Extract formulas from PDF (requires token)
mineru-open-api extract paper.pdf -o ./out/

# With VLM for better formula accuracy
mineru-open-api extract paper.pdf --model vlm -o ./out/

Authentication

Token required for extract and crawl:

mineru-open-api auth            # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

  • Supports local files and URLs
  • Requires token (mineru-open-api auth or MINERU_TOKEN env)
  • Supported input: .pdf
  • Language hint with --language (default: ch, use en for English)
  • Page range with --pages (where applicable)

Notes

  • Formula recognition requires extract with token. Use --formula flag (enabled by default). Output is Markdown with LaTeX inline.
  • Output goes to stdout by default; use -o \x3Cdir> to save to file
  • Binary formats (docx) require -o flag (cannot stream to stdout)
  • All progress/status messages go to stderr
  • MinerU is an open-source project by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
Usage Guidance
This skill appears to do what it says: it calls the mineru-open-api CLI and requires a MINERU_TOKEN. Before installing, verify the npm package and the GitHub repo are the official MinerU sources, and prefer installing in a sandboxed environment if you are unsure. Understand that using the service will likely upload PDF contents to MinerU's servers — avoid sending confidential documents unless you trust the vendor and have reviewed their privacy/security policy. Keep the MINERU_TOKEN secret, revoke it if compromised, and consider limiting global installs (use a local or containerized install) and pinning to a specific release.
Capability Analysis
Type: OpenClaw Skill Name: extract-formulas-from-pdf Version: 0.4.0 The skill is a legitimate wrapper for the MinerU document intelligence engine by OpenDataLab. It facilitates PDF formula extraction via the 'mineru-open-api' tool, requiring a standard API token (MINERU_TOKEN). The installation instructions and usage patterns in SKILL.md align with the stated purpose and point to official repositories (GitHub/npm), with no evidence of malicious intent, data exfiltration, or prompt injection.
Capability Assessment
Purpose & Capability
Name/description, required binary (mineru-open-api), and required env var (MINERU_TOKEN) all align with a MinerU-based PDF formula extraction tool. Nothing requested is unrelated to the stated purpose.
Instruction Scope
SKILL.md only instructs using the mineru-open-api CLI (extract/auth), exporting MINERU_TOKEN, and CLI flags. It does not ask the agent to read unrelated files, other environment variables, or modify system configuration.
Install Mechanism
Install uses npm (mineru-open-api) or go install from the GitHub repo — both are common and expected for a CLI. Recommend verifying the npm package and GitHub repo are the official MinerU releases before installing globally.
Credentials
Only MINERU_TOKEN is required and is the primary credential — this is proportionate. However, the token grants access to the MinerU service and the CLI likely uploads PDF content to that service for processing, so treat the token and any uploaded documents as sensitive.
Persistence & Privilege
Skill does not request always:true, does not require system config paths, and is not installing any persistent agent-wide configuration. Autonomous invocation is allowed (platform default) but not a special privilege here.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install extract-formulas-from-pdf
  3. After installation, invoke the skill by name or use /extract-formulas-from-pdf
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.4.0
SEO: expand description for better ClawHub vector search discovery
v0.3.0
Rollback to original version
v0.2.1
SEO optimization v0.2.1
v0.2.0
SEO optimization v0.2.0
v1.0.1
Fix: declare MINERU_TOKEN credential in metadata
v1.0.0
Extract Formulas from PDF - extract mathematical formulas from PDF documents using MinerU. Use when
Metadata
Slug extract-formulas-from-pdf
Version 0.4.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 6
Frequently Asked Questions

What is Extract Formulas From Pdf?

Extract mathematical formulas and equations from PDF documents using MinerU. Identifies and converts formula content from academic papers, textbooks, and tec... It is an AI Agent Skill for Claude Code / OpenClaw, with 173 downloads so far.

How do I install Extract Formulas From Pdf?

Run "/install extract-formulas-from-pdf" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Extract Formulas From Pdf free?

Yes, Extract Formulas From Pdf is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Extract Formulas From Pdf support?

Extract Formulas From Pdf is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Extract Formulas From Pdf?

It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.

💬 Comments