← 返回 Skills 市场
177
总下载
0
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install mineru-document-explorer
功能描述
REQUIRED for any task involving reading or understanding PDF contents. Use when a user asks about a .pdf file — reading pages, answering questions, extractin...
安全使用建议
What to consider before installing/running this skill:
- Functionally this looks like a legitimate PDF explorer CLI + optional local server. It will install a Python package and many dependencies from the bundled source; run the included scripts/setup.sh only inside an isolated Python virtual environment or container to avoid touching system Python packages (the script uses --break-system-packages which can alter system site-packages).
- The skill can run a FastAPI server (default host 127.0.0.1) and accepts uploads and networked OCR/embedding requests. Before running the server, set a non-empty server_api_key in the config or provide a strong API key via env var, and keep the server bound to localhost unless you intentionally want external access.
- Optional API keys (MinerU / PageIndex / embedder / reranker / extractor) enable cloud features and must be provided by you. Only supply keys you trust the code to use; keys will be written into the skill's config.yaml if you configure them via the setup prompt. If you do not need cloud OCR or external models, skip entering those keys to keep processing local.
- The skill will download PDFs when given http/https URLs and store files under its cache root inside the skill directory. Treat that like any code that downloads files: avoid pointing it at untrusted URLs, and inspect downloaded PDFs before reusing them.
- Review the bundled requirements.txt and code (server.py, pdf_utils.py, config.py) if you have concerns about network calls, data retention, or where cached files are stored. If you want minimal risk, run the tool in a disposable VM or container, with no external API keys provided.
If you'd like, I can point out the exact lines that: (a) read MINERU_TOKEN and other env vars, (b) perform downloads of remote PDFs, and (c) start the FastAPI server, or produce a short checklist for safely running setup.sh.
功能分析
Type: OpenClaw Skill
Name: mineru-document-explorer
Version: 1.0.1
The MinerU Document Explorer is a comprehensive PDF analysis toolkit that provides OCR, semantic search, and hierarchical document outlining. The bundle includes a CLI tool (`doc-search`), a FastAPI server, and various backends for integrating with MinerU and OpenAI-compatible APIs. While the skill requires network access and handles sensitive credentials (API keys), its behavior is entirely consistent with its stated purpose of document exploration. The `setup.sh` script performs standard installation tasks, and the `SKILL.md` instructions guide the agent to use the tool efficiently without any evidence of malicious prompt injection or unauthorized data exfiltration. The default external endpoint (staging.mineru.org.cn) is aligned with the MinerU service branding.
能力评估
Purpose & Capability
The name/description promise a PDF reading/explorer utility and the repository contains a complete 'doc-search' CLI, FastAPI server, OCR/embedding/reranker/backends, client, and utilities that match that purpose. Optional integrations (MinerU OCR, PageIndex, embedder, reranker, AgenticOCR) are expected for a full-featured document explorer.
Instruction Scope
SKILL.md and code instruct the agent to run a bundled setup script, optionally launch a local FastAPI server, download PDFs from arbitrary URLs, and write user-provided API keys into the bundled config.yaml. It also mandates appending short notes to references/tips.md after tasks (i.e., modifying files in the skill bundle). These behaviors are coherent with a local CLI/server tool, but they involve filesystem writes and optional network calls that you should be aware of.
Install Mechanism
There's no formal registry 'install' spec but a provided scripts/setup.sh installs the bundled package via pip -e with the flag --break-system-packages. That will install many third-party dependencies (fastapi, uvicorn, openai, mineru-open-sdk, etc.) from requirements.txt into the Python environment; using --break-system-packages and editable installs can affect the system Python environment. The install is local-source based (no remote arbitrary URL downloads), which is expected, but you should run it in an isolated venv/container.
Credentials
Registry metadata declared no required env vars, but SKILL.md lists several optional credentials (DOC_SEARCH_PAGEINDEX_API_KEY, DOC_SEARCH_PAGEINDEX_BASE_URL, DOC_SEARCH_EMBEDDING_BASE_URL, DOC_SEARCH_RERANKER_BASE_URL, MINERU_TOKEN) and the code reads DOC_SEARCH_* env vars and MINERU_TOKEN. These are relevant to the optional networked features. The mismatch between 'no required env vars' in registry metadata and the SKILL.md's optional credentials is worth noting but not necessarily malicious.
Persistence & Privilege
The skill is not always-enabled and does not request platform-wide privileges. It will write state/config files into its own SKILL_DIR (config-state.json, config.yaml copy, caches) and can run a local HTTP server by design (default host is 127.0.0.1). Those are normal for a local CLI/server tool, but if you run the server you should ensure it is bound to localhost and protected by server_api_key before exposing beyond your machine.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install mineru-document-explorer - 安装完成后,直接呼叫该 Skill 的名称或使用
/mineru-document-explorer触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Added optional credential fields for external services (PageIndex, embedding/reranker APIs, MinerU cloud).
- Enabled optional network capabilities with external API calls and FastAPI server support; all network features are opt-in and disabled by default.
- Updated documentation to clarify network functionality and new credential options.
- No code or file structure changes.
v1.0.0
mineru-document-explorer 1.0.0
- Initial release of the PDF reading toolkit using the `doc-search` CLI.
- Designed for any task involving reading or understanding PDF contents (search, QA, extraction, topic location).
- Strictly requires searching to narrow down content before reading—never full-document scans.
- Provides setup instructions for optional PageIndex configuration to auto-generate outlines.
- Mandates checking included reference docs before running any command or handling uncertainties.
- Summarizes all CLI commands and key usage reminders, emphasizing best practices and lessons capture.
元数据
常见问题
MinerU Document Explorer 是什么?
REQUIRED for any task involving reading or understanding PDF contents. Use when a user asks about a .pdf file — reading pages, answering questions, extractin... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 177 次。
如何安装 MinerU Document Explorer?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install mineru-document-explorer」即可一键安装,无需额外配置。
MinerU Document Explorer 是免费的吗?
是的,MinerU Document Explorer 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
MinerU Document Explorer 支持哪些平台?
MinerU Document Explorer 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 MinerU Document Explorer?
由 Rise-1210(@rise-1210)开发并维护,当前版本 v1.0.1。
推荐 Skills