← 返回 Skills 市场
wu-uk

marker

作者 wu-uk · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ✓ 安全检测通过
80
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install latex-formula-extraction-marker
功能描述
Convert PDF documents to Markdown using marker_single. Use when Claude needs to extract text content from PDFs while preserving LaTeX formulas, equations, an...
使用说明 (SKILL.md)

Marker PDF-to-Markdown Converter

Convert PDFs to Markdown while preserving LaTeX formulas and document structure. Uses the marker_single CLI from the marker-pdf package.

Dependencies

  • marker_single on PATH (pip install marker-pdf if missing)
  • Python 3.10+ (available in the task image)

Quick Start

from scripts.marker_to_markdown import pdf_to_markdown

markdown_text = pdf_to_markdown("paper.pdf")
print(markdown_text)

Python API

  • pdf_to_markdown(pdf_path, *, timeout=600, cleanup=True) -> str
    • Runs marker_single --output_format markdown --disable_image_extraction
    • cleanup=True: use a temp directory and delete after reading the Markdown
    • cleanup=False: keep outputs in \x3Cpdf_stem>_marker/ next to the PDF
    • Exceptions: FileNotFoundError if the PDF is missing, RuntimeError for marker failures, TimeoutError if it exceeds the timeout
  • Tips: bump timeout for large PDFs; set cleanup=False to inspect intermediate files

Command-Line Usage

# Basic conversion (prints markdown to stdout)
python scripts/marker_to_markdown.py paper.pdf

# Keep temporary files
python scripts/marker_to_markdown.py paper.pdf --keep-temp

# Custom timeout
python scripts/marker_to_markdown.py paper.pdf --timeout 600

Output Locations

  • cleanup=True: outputs stored in a temporary directory and removed automatically
  • cleanup=False: outputs saved to \x3Cpdf_stem>_marker/; markdown lives at \x3Cpdf_stem>_marker/\x3Cpdf_stem>/\x3Cpdf_stem>.md when present (otherwise the first .md file is used)

Troubleshooting

  • marker_single not found: install marker-pdf or ensure the CLI is on PATH
  • No Markdown output: re-run with --keep-temp/cleanup=False and check stdout/stderr saved in the output folder
安全使用建议
This skill is a thin wrapper around the external marker_single CLI (marker-pdf). The included Python is straightforward and only runs marker_single on a local PDF and reads resulting .md files. Before using: (1) ensure you trust the marker-pdf package you install (inspect its source or install from a reputable PyPI package), since the external CLI will be executed on your machine; (2) run it on non-sensitive documents first or in a sandbox if you have concerns about third-party tools; and (3) remember cleanup=False will leave output files next to the PDF. No credentials or network endpoints are required by the skill itself.
功能分析
Type: OpenClaw Skill Name: latex-formula-extraction-marker Version: 0.1.0 The skill is a straightforward wrapper for the legitimate 'marker-pdf' library to convert PDF documents into Markdown. The implementation in 'scripts/marker_to_markdown.py' follows security best practices by using list-based subprocess calls to prevent shell injection and standard temporary directory handling for cleanup, with no evidence of malicious intent or data exfiltration.
能力评估
Purpose & Capability
Name/description match the included code and instructions. The script only requires the marker_single CLI (marker-pdf) and operates on local PDF files; there are no unrelated dependencies or credentials.
Instruction Scope
SKILL.md and the script limit operations to running marker_single, reading its output, and optionally writing output folders next to the input PDF or a temp dir. The instructions do not ask for unrelated files, system config, or network endpoints.
Install Mechanism
No install spec is provided (instruction-only). The README suggests installing marker-pdf via pip, which is proportional. Nothing is fetched or extracted by the skill itself.
Credentials
The skill requires no environment variables, credentials, or config paths. Its access model (local filesystem for input PDF and optional output directory) is appropriate for its purpose.
Persistence & Privilege
The skill does not request always:true, does not modify other skills or global agent settings, and does not persist credentials. It only writes output files when requested or when cleanup=False.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install latex-formula-extraction-marker
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /latex-formula-extraction-marker 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0
Bulk publish from all-task-skills-dedup
元数据
Slug latex-formula-extraction-marker
版本 0.1.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

marker 是什么?

Convert PDF documents to Markdown using marker_single. Use when Claude needs to extract text content from PDFs while preserving LaTeX formulas, equations, an... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 80 次。

如何安装 marker?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install latex-formula-extraction-marker」即可一键安装,无需额外配置。

marker 是免费的吗?

是的,marker 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

marker 支持哪些平台?

marker 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 marker?

由 wu-uk(@wu-uk)开发并维护,当前版本 v0.1.0。

💬 留言讨论