← Back to Skills Marketplace
reaperchen

PDF Processor

by reaperchen · GitHub ↗ · v2.0.0 · MIT-0
cross-platform ✓ Security Clean
271
Downloads
0
Stars
4
Active Installs
1
Versions
Install in OpenClaw
/install pdf-processor
Description
学术PDF处理:提取文字、判断语言、翻译(英文→中文)、生成200字纯中文概述。使用本地Ollama模型,不消耗线上API。适用于学术论文、研究报告等PDF文件的处理。当用户说"处理PDF"、"翻译论文"、"生成论文概述"时,或用户将PDF放入论文处理目录时使用。
Usage Guidance
This skill appears to do what it claims: extract text from PDFs, call a local Ollama model to translate and summarize, and organize results under ~/Documents/论文处理. Before running: (1) confirm you have Ollama and the qwen2.5:7b model installed locally and that you're comfortable starting a local Ollama process; (2) inspect scripts/process_pdf.py and scripts/generate_index.py (they are included) and consider running them on non-sensitive PDFs first; (3) note the metadata mismatch (package.json lists python3/pdfplumber/ollama while registry metadata listed none) — ensure you install required Python packages (pdfplumber, requests) in a controlled environment (virtualenv/container) to avoid impacting system Python; (4) back up any sensitive PDFs you don't want moved or deleted, because the script moves PDF files and deletes temporary extraction files; (5) run the tool locally (no external network calls are present other than localhost) and consider limiting its filesystem access by running in a dedicated directory or container if you want extra containment.
Capability Analysis
Type: OpenClaw Skill Name: pdf-processor Version: 2.0.0 The pdf-processor skill bundle is a legitimate tool designed for local academic PDF processing using the Ollama framework. It provides a complete workflow for text extraction, language detection, translation, and summary generation, as evidenced in scripts/process_pdf.py and scripts/generate_index.py. The code utilizes standard libraries (pdfplumber, requests) and performs file management tasks (moving processed files, managing temporary progress JSONs) that are strictly aligned with its stated purpose. No evidence of data exfiltration, unauthorized network access, or malicious prompt injection was found.
Capability Assessment
Purpose & Capability
Overall coherent: name/description (PDF extraction, translate, summarize using local Ollama) matches the included scripts which call a local Ollama HTTP API and read/write files under a user Documents directory. Minor inconsistency: the registry metadata at the top lists no required binaries/env, while package.json's openclaw.requires lists 'python3' and 'pdfplumber' and service 'ollama' — carnet: 'pdfplumber' is a Python package (not a system binary) and package.json also includes install steps. This appears to be a packaging/metadata mismatch rather than a functional red flag.
Instruction Scope
SKILL.md and scripts limit activity to PDF text extraction, local model calls (http://localhost:11434), progress files, and moving files within ~/Documents/论文处理; instructions do not attempt to read unrelated system paths or exfiltrate data to remote servers. The scripts do start 'ollama serve' locally and make HTTP requests to localhost only.
Install Mechanism
This is instruction-only in the skill bundle (no remote downloads). package.json documents Python dependency installation and manual instructions to install Ollama from the official site — no suspicious external URLs or automatic archive downloads are present.
Credentials
The skill requests no environment variables or credentials. The scripts operate against a local HTTP service and the user's home Documents directories only, which is proportionate to the stated purpose.
Persistence & Privilege
always is false and the skill does not request elevated privileges or attempt to alter other skills or global agent settings. It creates and deletes progress files only within its working directory.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install pdf-processor
  3. After installation, invoke the skill by name or use /pdf-processor
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.0.0
v2.0新功能:实时进度显示、断点续传、索引生成、串行翻译优化
Metadata
Slug pdf-processor
Version 2.0.0
License MIT-0
All-time Installs 4
Active Installs 4
Total Versions 1
Frequently Asked Questions

What is PDF Processor?

学术PDF处理:提取文字、判断语言、翻译(英文→中文)、生成200字纯中文概述。使用本地Ollama模型,不消耗线上API。适用于学术论文、研究报告等PDF文件的处理。当用户说"处理PDF"、"翻译论文"、"生成论文概述"时,或用户将PDF放入论文处理目录时使用。 It is an AI Agent Skill for Claude Code / OpenClaw, with 271 downloads so far.

How do I install PDF Processor?

Run "/install pdf-processor" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is PDF Processor free?

Yes, PDF Processor is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does PDF Processor support?

PDF Processor is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created PDF Processor?

It is built and maintained by reaperchen (@reaperchen); the current version is v2.0.0.

💬 Comments