← Back to Skills Marketplace

Pdf Vocab Audio

Name: Pdf Vocab Audio
Author: effeceee

by effeceee · GitHub ↗ · v2.0.0 · MIT-0

cross-platform ⚠ suspicious

144

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install pdf-vocab-audio

Description

从 PDF 提取词汇生成朗读音频，每个词组读两遍

Usage Guidance

This skill is mostly coherent for generating TTS from a PDF, but two issues deserve attention before installing: (1) The code uses a hardcoded fallback directory (/root/.openclaw/media/inbound) when you don't pass a PDF path — if you don't want the skill to read agent/internal files, always invoke it with an explicit PDF path. (2) The extraction regex does not actually strip Chinese translations as the documentation states, which may produce mixed-language audio. If you plan to use it, run it in an isolated environment first, verify the outputs on sample PDFs, and consider patching the extraction logic (e.g., capture only [A-Za-z ]+ at line start or split on non‑ASCII) and changing the default inbound path to a safer location. Also ensure edge-tts and ffmpeg are installed from trusted sources.

Capability Analysis

Type: OpenClaw Skill Name: pdf-vocab-audio Version: 2.0.0 The skill bundle is a legitimate utility for extracting English vocabulary from PDF files and generating audio recordings using edge-tts and ffmpeg. The Python script (pdf_vocab_audio.py) follows security best practices by using parameterized subprocess calls to prevent shell injection, validating dependencies with shutil.which, and utilizing temporary directories for intermediate file processing. No evidence of data exfiltration, malicious execution, or prompt injection was found.

Capability Assessment

ℹ Purpose & Capability

Dependencies (pymupdf for PDF parsing, edge-tts for TTS, ffmpeg for silence/concat) are appropriate for the stated purpose. Requiring edge-tts and ffmpeg is coherent. The SKILL.md install metadata includes pip packages that match runtime needs.

⚠ Instruction Scope

SKILL.md promises extracting "only the English part" from each line, but the implemented regex (re.match(r'^([a-zA-Z].*)$')) will capture the entire line starting with a letter — including trailing Chinese — so the code does not actually strip translations as described. Also the script falls back to a hardcoded directory '/root/.openclaw/media/inbound' when no PDF path is supplied; that is an internal path and may give the skill access to files placed in agent/system-owned locations. These two issues are scope mismatches between the spec and the implementation and can lead to surprising behavior or data exposure.

✓ Install Mechanism

This is an instruction-only skill with no external download URLs or extract steps. The SKILL.md lists pip packages (edge-tts, pymupdf) which align with the code. No high-risk installers or remote archives are used.

ℹ Credentials

The skill requests no environment variables or credentials, which is appropriate. However, it defaults to reading from a specific internal path (/root/.openclaw/media/inbound) if no PDF is provided — this implicit file access could expose internal files if users don't explicitly pass a path.

✓ Persistence & Privilege

The skill does not request permanent/autonomous enrollment (always:false) and does not modify other skills or system settings. It runs as a one-off script and uses temporary directories for intermediate work; final outputs are written to /tmp by default.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install pdf-vocab-audio
After installation, invoke the skill by name or use /pdf-vocab-audio
Provide required inputs per the skill's parameter spec and get structured output

Version History

v2.0.0

v2.0

v1.2.0

v1.2.0: 修复concat_list.txt路径安全问题

v1.1.0

v1.1.0安全性修复: 添加依赖检查、输入验证、超时保护

v1.0.0

Initial release - PDF词汇朗读音频生成器，支持Apache 2.0 License

Metadata

Slug pdf-vocab-audio

Version 2.0.0

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 4

Frequently Asked Questions

What is Pdf Vocab Audio?

从 PDF 提取词汇生成朗读音频，每个词组读两遍. It is an AI Agent Skill for Claude Code / OpenClaw, with 144 downloads so far.

How do I install Pdf Vocab Audio?

Run "/install pdf-vocab-audio" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Pdf Vocab Audio free?

Yes, Pdf Vocab Audio is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Pdf Vocab Audio support?

Pdf Vocab Audio is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Pdf Vocab Audio?

It is built and maintained by effeceee (@effeceee); the current version is v2.0.0.

More Skills