← Back to Skills Marketplace
Pdf To Markdown
by
BytesAgain2
· GitHub ↗
· v3.0.1
· MIT-0
947
Downloads
0
Stars
9
Active Installs
15
Versions
Install in OpenClaw
/install pdf-to-markdown
Description
Convert PDF files to Markdown with table recognition. Use when extracting text, converting tables, summarizing documents, fixing formatting issues.
Usage Guidance
This skill appears to be what it says: a local PDF→Markdown converter. Before installing or invoking it, review and accept that it will create ~/.pdf-to-markdown/, write output files and a history.log, and may log filenames/operations. It may require system tools (pdftotext/pdfinfo) or Python packages (PyPDF2, pdfminer.six) to be installed — the scripts print helpful error messages if missing. Note the SKILL.md examples are terse: you typically need to pass a PDF filename to commands (or set PDF_INPUT for some helpers). If you process sensitive documents, run the tool in a controlled environment, inspect the scripts yourself, and/or ensure the output/history location is secure or changed via PDF_TO_MARKDOWN_DIR.
Capability Analysis
Type: OpenClaw Skill
Name: pdf-to-markdown
Version: 3.0.1
The skill bundle provides a comprehensive set of tools for converting PDF files to Markdown, including table and metadata extraction. However, it is classified as suspicious due to multiple command injection vulnerabilities in `scripts/script.sh`. Specifically, functions like `_extract_with_python`, `_get_page_count`, and `cmd_info` expand the `$file` shell variable directly into Python heredocs without sanitization (using unquoted `<<PYEOF`), which could allow arbitrary code execution via maliciously crafted filenames. While these appear to be unintentional security flaws rather than deliberate malice, they pose a significant risk to the environment where the agent operates.
Capability Assessment
Purpose & Capability
The name/description (PDF → Markdown with table recognition) align with the shipped scripts: they extract PDF text (pdftotext or Python fallbacks), convert/format text, and write Markdown. Required resources are local files and optional Python libs; nothing requests unrelated cloud credentials or unrelated platform access.
Instruction Scope
SKILL.md directs the agent to run scripts/script.sh commands which map to the implemented operations. Minor inconsistency: the SKILL.md command list omits file/argument examples for most commands (script functions expect PDF file args or PDF_INPUT env var), and the repo contains an extra scripts/pdf.sh with its own CLI that also expects PDF_INPUT in some Python fragments — this creates ambiguity about how input should be provided but does not indicate malicious behavior.
Install Mechanism
No install spec — instruction-only with local shell scripts. The code may rely on system tools (pdftotext, pdfinfo) or Python packages (PyPDF2, pdfminer.six) but does not download arbitrary code or contact external URLs.
Credentials
No credentials or external environment variables are required; a single optional PDF_TO_MARKDOWN_DIR controls storage. The tool creates ~/.pdf-to-markdown, writes output, a config file, and a history log — users should note logs/config may contain file names or metadata from processed documents.
Persistence & Privilege
The skill is not configured always:true and does not modify other skills. It persists only its own config/history under the user's home directory, which is normal for a CLI utility.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install pdf-to-markdown - After installation, invoke the skill by name or use
/pdf-to-markdown - Provide required inputs per the skill's parameter spec and get structured output
Version History
v3.0.1
v3.0.1: SKILL.md from script help output
v3.0.0
v3.0.0: Real PDF to Markdown converter with pdftotext.
v2.0.1
update
v2.0.0
v2.5 standard: Use-when desc, homepage, source, security fix
v2.3.6
yaml-fix+quality
v2.3.5
yaml-fix+quality
v2.3.4
Quality upgrade
v2.3.3
Quality upgrade: custom functionality
v2.3.2
Standards compliance: unique content, no template text
v2.3.1
Quality update: docs, examples, standards compliance
v2.3.0
Quality fixes: removed third-party references, aligned docs with implementation
v2.2.0
Enhanced descriptions for better AI triggering
v1.0.2
Runtime fix
v1.0.1
Add runtime declaration + security compliance
v1.0.0
Initial release of PDF to Markdown:
- Convert PDF text to Markdown format, making it easy to copy and edit.
- Automatically recognize and convert tables from PDFs into Markdown.
- Extract key information and summarize PDF documents.
- Repair and enhance formatting during conversion.
- Compare content between different PDF documents.
Metadata
Frequently Asked Questions
What is Pdf To Markdown?
Convert PDF files to Markdown with table recognition. Use when extracting text, converting tables, summarizing documents, fixing formatting issues. It is an AI Agent Skill for Claude Code / OpenClaw, with 947 downloads so far.
How do I install Pdf To Markdown?
Run "/install pdf-to-markdown" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Pdf To Markdown free?
Yes, Pdf To Markdown is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Pdf To Markdown support?
Pdf To Markdown is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Pdf To Markdown?
It is built and maintained by BytesAgain2 (@ckchzh); the current version is v3.0.1.
More Skills