/install docling
Docling - Document & Web Content Extraction
CLI tool for parsing documents and web pages into clean, structured text. Uses GPU acceleration for OCR and ML models.
Prerequisites
doclingCLI must be installed (e.g., viapipx install docling)- For GPU support: NVIDIA GPU with CUDA drivers
When to Use
- Extract content from a URL → Use docling (not web_fetch)
- Search for information → Use web_search (Brave)
- Parse PDFs, DOCX, PPTX → Use docling
- OCR on images → Use docling
Quick Commands
Web Page → Markdown (default)
docling "\x3CURL>" --from html --to md
Output: creates a .md file in current directory (or use --output)
Web Page → Plain Text
docling "\x3CURL>" --from html --to text --output /tmp/docling_out
PDF with OCR
docling "/path/to/file.pdf" --ocr --device cuda --output /tmp/docling_out
Key Options
| Option | Values | Description |
|---|---|---|
--from |
html, pdf, docx, pptx, image, md, csv, xlsx | Input format |
--to |
md, text, json, yaml, html | Output format |
--device |
auto, cuda, cpu | Accelerator (default: auto) |
--output |
path | Output directory (recommended: use controlled temp dir) |
--ocr |
flag | Enable OCR for images/scanned PDFs |
--tables |
flag | Extract tables (default: on) |
Security Notes
⚠️ Avoid these flags unless you trust the source:
--enable-remote-services- can send data to remote endpoints--allow-external-plugins- loads third-party code- Custom
--headerswith untrusted values - can redirect requests
Workflow
- For web content extraction: Use
docling "\x3CURL>" --from html --to text --output /tmp/docling_out - Read the output file from the specified output directory
- Clean up the output directory after reading
GPU Support
Docling supports GPU acceleration via CUDA (NVIDIA). Verify CUDA is available:
python -c "import torch; print(torch.cuda.is_available())"
Full CLI Reference
See references/cli-reference.md for complete option list.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install docling - 安装完成后,直接呼叫该 Skill 的名称或使用
/docling触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Docling 是什么?
Extract and parse content from web pages, PDFs, documents (docx, pptx), and images using the docling CLI with GPU acceleration. Use INSTEAD of web_fetch for extracting content from specific URLs when you need clean, structured text. Use Brave (web_search) for searching/discovering pages. Use docling when you HAVE a URL and need its content parsed. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1367 次。
如何安装 Docling?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install docling」即可一键安装,无需额外配置。
Docling 是免费的吗?
是的,Docling 完全免费(开源免费),可自由下载、安装和使用。
Docling 支持哪些平台?
Docling 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Docling?
由 Er3mit4(@er3mit4)开发并维护,当前版本 v1.0.2。