← 返回 Skills 市场
cjboy007

Product Doc Reader

作者 Jaden's built a claw · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
144
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install ssa-product-doc-reader
功能描述
产品工程图纸结构化提取器 v5.0。pdftotext 优先 + Vision 兜底,支持软连字符清理/跨行关联/数据校验。专为 Farreach 线材产品图纸设计。
安全使用建议
What to consider before installing/running: - Expect this to run code: the package includes multiple Python scripts (extract_hybrid.py, extract_vision.py, batch_process_drive.py). Review those scripts before running. Search them for network calls (requests, urllib, openai/openrouter SDKs, googleapiclient, sockets) and any hardcoded endpoints. - Credentials mismatch: the docs mention Vision (Gemini/OpenRouter) and Google Drive operations but the manifest does not declare required API keys. If you run the scripts, they may look for API keys in your environment, local config, or attempt to prompt/use CLI tools — be cautious about providing secrets. - Sandbox first: run the tool offline or in a VM/container without network access (or with network blocked) to confirm local-only behavior when you only want pdftotext-based extraction. Then enable network only after you’ve inspected code and configured minimal, scoped credentials. - Inspect batch_process_drive.py carefully: it may upload or delete files on Drive. If you use it, ensure it targets a test Drive account or that you understand exactly what it will read/write. - If you need Vision fallback, prefer creating a dedicated, limited-scope API key/account and monitor outgoing traffic. Consider replacing or instrumenting the script calls to OpenRouter/OpenAI to log endpoints and data before sending. - If you’re not comfortable auditing Python code, ask the maintainer for a short summary of what external endpoints the scripts call, and for a minimal configuration example that shows required env vars and where outputs are sent. If you provide the content of scripts/extract_hybrid.py and batch_process_drive.py I can do a targeted review and point to exact lines that perform network or Drive/OpenRouter calls.
功能分析
Type: OpenClaw Skill Name: ssa-product-doc-reader Version: 1.0.0 The skill bundle contains significant security vulnerabilities, most notably a hardcoded OpenRouter API key in `scripts/extract_hybrid.py` and `scripts/extract_vision.py`, which is a critical credential leak. Furthermore, several scripts (including `scripts/extract_drawing.py` and `scripts/batch_599.py`) utilize `subprocess.run` to execute system commands with unsanitized file paths, posing a high risk of shell injection. While the code's functionality appears consistent with its stated purpose of extracting data from engineering drawings, the combination of hardcoded secrets, poor input sanitization, and the transmission of document data to third-party APIs (OpenRouter and Bailian) represents a high risk to the user's environment.
能力评估
Purpose & Capability
Name/description (PDF → structured JSON/MD) aligns with the included Python scripts and examples. Expectation of pdftotext / pdftoppm / optional Tesseract is coherent for the stated extraction job. However the skill also documents and includes code/notes that reference Vision API (Gemini via OpenRouter) and Google Drive batch processing — those capabilities legitimately require API access and credentials, yet no environment variables or credentials are declared in the metadata.
Instruction Scope
SKILL.md and DEVELOPMENT_SUMMARY explicitly mention Vision API fallback (Gemini 2.5 Flash / OpenRouter) and Drive search / batch upload logic. The runtime instructions show only running python3 scripts locally, but the scripts (e.g., batch_process_drive.py, extract_vision.py, extract_hybrid.py) almost certainly make network requests or call external services. SKILL.md does not specify where API keys should come from, nor does it constrain network behavior — that grants broad discretion to the bundled scripts and is scope-creeping compared to a purely local pdftotext-based extractor.
Install Mechanism
No install spec — skill is delivered as files + instructions and expects system binaries (python3, pdftoppm). That is lower-risk than an installer that downloads/extracts arbitrary code. The presence of several Python scripts means code will run when invoked; no remote installer URLs or archive extraction steps were provided in the manifest.
Credentials
The skill declares no required env vars, but multiple parts of the documentation and examples reference external services: Vision/OpenRouter and Google Drive. Those services require API keys or OAuth tokens (not declared as requires.env or primaryEnv). This is an inconsistency: the code likely expects credentials in the environment or local config but the manifest gives no explicit, least-privileged credential requirements. That increases the chance of accidental credential exposure or of the scripts attempting to prompt for/use credentials from unrelated places.
Persistence & Privilege
always is false and the skill is user-invocable. There is no request for permanent platform-wide presence. Nothing in the provided metadata indicates the skill will modify other skills or agent-wide settings. That is proportionate.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ssa-product-doc-reader
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ssa-product-doc-reader 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release: Extract product specs from documents
元数据
Slug ssa-product-doc-reader
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Product Doc Reader 是什么?

产品工程图纸结构化提取器 v5.0。pdftotext 优先 + Vision 兜底,支持软连字符清理/跨行关联/数据校验。专为 Farreach 线材产品图纸设计。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 144 次。

如何安装 Product Doc Reader?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ssa-product-doc-reader」即可一键安装,无需额外配置。

Product Doc Reader 是免费的吗?

是的,Product Doc Reader 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Product Doc Reader 支持哪些平台?

Product Doc Reader 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Product Doc Reader?

由 Jaden's built a claw(@cjboy007)开发并维护,当前版本 v1.0.0。

💬 留言讨论