← 返回 Skills 市场
ckchzh

Pdf To Markdown

作者 BytesAgain2 · GitHub ↗ · v3.0.1 · MIT-0
cross-platform ⚠ suspicious
947
总下载
0
收藏
9
当前安装
15
版本数
在 OpenClaw 中安装
/install pdf-to-markdown
功能描述
Convert PDF files to Markdown with table recognition. Use when extracting text, converting tables, summarizing documents, fixing formatting issues.
安全使用建议
This skill appears to be what it says: a local PDF→Markdown converter. Before installing or invoking it, review and accept that it will create ~/.pdf-to-markdown/, write output files and a history.log, and may log filenames/operations. It may require system tools (pdftotext/pdfinfo) or Python packages (PyPDF2, pdfminer.six) to be installed — the scripts print helpful error messages if missing. Note the SKILL.md examples are terse: you typically need to pass a PDF filename to commands (or set PDF_INPUT for some helpers). If you process sensitive documents, run the tool in a controlled environment, inspect the scripts yourself, and/or ensure the output/history location is secure or changed via PDF_TO_MARKDOWN_DIR.
功能分析
Type: OpenClaw Skill Name: pdf-to-markdown Version: 3.0.1 The skill bundle provides a comprehensive set of tools for converting PDF files to Markdown, including table and metadata extraction. However, it is classified as suspicious due to multiple command injection vulnerabilities in `scripts/script.sh`. Specifically, functions like `_extract_with_python`, `_get_page_count`, and `cmd_info` expand the `$file` shell variable directly into Python heredocs without sanitization (using unquoted `<<PYEOF`), which could allow arbitrary code execution via maliciously crafted filenames. While these appear to be unintentional security flaws rather than deliberate malice, they pose a significant risk to the environment where the agent operates.
能力评估
Purpose & Capability
The name/description (PDF → Markdown with table recognition) align with the shipped scripts: they extract PDF text (pdftotext or Python fallbacks), convert/format text, and write Markdown. Required resources are local files and optional Python libs; nothing requests unrelated cloud credentials or unrelated platform access.
Instruction Scope
SKILL.md directs the agent to run scripts/script.sh commands which map to the implemented operations. Minor inconsistency: the SKILL.md command list omits file/argument examples for most commands (script functions expect PDF file args or PDF_INPUT env var), and the repo contains an extra scripts/pdf.sh with its own CLI that also expects PDF_INPUT in some Python fragments — this creates ambiguity about how input should be provided but does not indicate malicious behavior.
Install Mechanism
No install spec — instruction-only with local shell scripts. The code may rely on system tools (pdftotext, pdfinfo) or Python packages (PyPDF2, pdfminer.six) but does not download arbitrary code or contact external URLs.
Credentials
No credentials or external environment variables are required; a single optional PDF_TO_MARKDOWN_DIR controls storage. The tool creates ~/.pdf-to-markdown, writes output, a config file, and a history log — users should note logs/config may contain file names or metadata from processed documents.
Persistence & Privilege
The skill is not configured always:true and does not modify other skills. It persists only its own config/history under the user's home directory, which is normal for a CLI utility.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install pdf-to-markdown
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /pdf-to-markdown 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v3.0.1
v3.0.1: SKILL.md from script help output
v3.0.0
v3.0.0: Real PDF to Markdown converter with pdftotext.
v2.0.1
update
v2.0.0
v2.5 standard: Use-when desc, homepage, source, security fix
v2.3.6
yaml-fix+quality
v2.3.5
yaml-fix+quality
v2.3.4
Quality upgrade
v2.3.3
Quality upgrade: custom functionality
v2.3.2
Standards compliance: unique content, no template text
v2.3.1
Quality update: docs, examples, standards compliance
v2.3.0
Quality fixes: removed third-party references, aligned docs with implementation
v2.2.0
Enhanced descriptions for better AI triggering
v1.0.2
Runtime fix
v1.0.1
Add runtime declaration + security compliance
v1.0.0
Initial release of PDF to Markdown: - Convert PDF text to Markdown format, making it easy to copy and edit. - Automatically recognize and convert tables from PDFs into Markdown. - Extract key information and summarize PDF documents. - Repair and enhance formatting during conversion. - Compare content between different PDF documents.
元数据
Slug pdf-to-markdown
版本 3.0.1
许可证 MIT-0
累计安装 9
当前安装数 9
历史版本数 15
常见问题

Pdf To Markdown 是什么?

Convert PDF files to Markdown with table recognition. Use when extracting text, converting tables, summarizing documents, fixing formatting issues. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 947 次。

如何安装 Pdf To Markdown?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-to-markdown」即可一键安装,无需额外配置。

Pdf To Markdown 是免费的吗?

是的,Pdf To Markdown 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Pdf To Markdown 支持哪些平台?

Pdf To Markdown 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Pdf To Markdown?

由 BytesAgain2(@ckchzh)开发并维护,当前版本 v3.0.1。

💬 留言讨论