Pilot Document Processing Setup
/install pilot-document-processing-setup
Document Processing Setup
Deploy 3 agents that automate document ingestion, data extraction, and search indexing.
Roles
| Role | Hostname | Skills | Purpose |
|---|---|---|---|
| ingester | \x3Cprefix>-ingester |
pilot-stream-data, pilot-share, pilot-archive | Accepts documents, converts to processable format |
| extractor | \x3Cprefix>-extractor |
pilot-task-router, pilot-dataset, pilot-receipt | Extracts structured data — tables, entities, amounts |
| indexer | \x3Cprefix>-indexer |
pilot-webhook-bridge, pilot-announce, pilot-metrics | Indexes data for search, publishes to downstream systems |
Setup Procedure
Step 1: Ask the user which role this agent should play and what prefix to use.
Step 2: Install the skills for the chosen role:
# ingester:
clawhub install pilot-stream-data pilot-share pilot-archive
# extractor:
clawhub install pilot-task-router pilot-dataset pilot-receipt
# indexer:
clawhub install pilot-webhook-bridge pilot-announce pilot-metrics
Step 3: Set the hostname:
pilotctl --json set-hostname \x3Cprefix>-\x3Crole>
Step 4: Write the setup manifest:
mkdir -p ~/.pilot/setups
cat > ~/.pilot/setups/document-processing.json \x3C\x3C 'MANIFEST'
\x3CUSE ROLE TEMPLATE BELOW>
MANIFEST
Step 5: Tell the user to initiate handshakes with direct communication peers.
Manifest Templates Per Role
ingester
{"setup":"document-processing","setup_name":"Document Processing","role":"ingester","role_name":"Document Ingester","hostname":"\x3Cprefix>-ingester","description":"Accepts documents (PDF, DOCX, images) via upload or webhook, converts to processable format.","skills":{"pilot-stream-data":"Stream raw document bytes to extractor for processing.","pilot-share":"Share converted document files with extractor.","pilot-archive":"Archive original documents for audit and reprocessing."},"peers":[{"role":"extractor","hostname":"\x3Cprefix>-extractor","description":"Receives raw documents for data extraction"}],"data_flows":[{"direction":"send","peer":"\x3Cprefix>-extractor","port":1002,"topic":"raw-document","description":"Raw documents in processable format"}],"handshakes_needed":["\x3Cprefix>-extractor"]}
extractor
{"setup":"document-processing","setup_name":"Document Processing","role":"extractor","role_name":"Data Extractor","hostname":"\x3Cprefix>-extractor","description":"Pulls structured data from documents — tables, key-value pairs, entities, dates, amounts.","skills":{"pilot-task-router":"Route documents to specialized extractors by type (invoice, contract, form).","pilot-dataset":"Store extraction results and training data for accuracy improvement.","pilot-receipt":"Confirm document receipt and report extraction status."},"peers":[{"role":"ingester","hostname":"\x3Cprefix>-ingester","description":"Sends raw documents"},{"role":"indexer","hostname":"\x3Cprefix>-indexer","description":"Receives extracted structured data"}],"data_flows":[{"direction":"receive","peer":"\x3Cprefix>-ingester","port":1002,"topic":"raw-document","description":"Raw documents in processable format"},{"direction":"send","peer":"\x3Cprefix>-indexer","port":1002,"topic":"extracted-data","description":"Extracted structured data as JSON"}],"handshakes_needed":["\x3Cprefix>-ingester","\x3Cprefix>-indexer"]}
indexer
{"setup":"document-processing","setup_name":"Document Processing","role":"indexer","role_name":"Search Indexer","hostname":"\x3Cprefix>-indexer","description":"Indexes extracted data for search, builds document summaries, publishes to downstream systems.","skills":{"pilot-webhook-bridge":"Push index events and summaries to downstream APIs and search engines.","pilot-announce":"Broadcast new document availability to interested subscribers.","pilot-metrics":"Track indexing throughput, search latency, and document counts."},"peers":[{"role":"extractor","hostname":"\x3Cprefix>-extractor","description":"Sends extracted structured data"}],"data_flows":[{"direction":"receive","peer":"\x3Cprefix>-extractor","port":1002,"topic":"extracted-data","description":"Extracted structured data as JSON"},{"direction":"send","peer":"external","port":443,"topic":"index-notification","description":"Index notifications to downstream systems"}],"handshakes_needed":["\x3Cprefix>-extractor"]}
Data Flows
ingester -> extractor: raw-document events (port 1002)extractor -> indexer: extracted-data events (port 1002)indexer -> downstream: index notifications via webhook (port 443)
Handshakes
# ingester \x3C-> extractor:
pilotctl --json handshake \x3Cprefix>-extractor "setup: document-processing"
pilotctl --json handshake \x3Cprefix>-ingester "setup: document-processing"
# extractor \x3C-> indexer:
pilotctl --json handshake \x3Cprefix>-indexer "setup: document-processing"
pilotctl --json handshake \x3Cprefix>-extractor "setup: document-processing"
Workflow Example
# On extractor — subscribe to raw documents:
pilotctl --json subscribe \x3Cprefix>-ingester raw-document
# On indexer — subscribe to extracted data:
pilotctl --json subscribe \x3Cprefix>-extractor extracted-data
# On ingester — publish a document:
pilotctl --json publish \x3Cprefix>-extractor raw-document '{"filename":"invoice-2024-003.pdf","type":"pdf","pages":2}'
# On extractor — publish extracted data:
pilotctl --json publish \x3Cprefix>-indexer extracted-data '{"filename":"invoice-2024-003.pdf","vendor":"Acme Corp","amount":12500.00}'
Dependencies
Requires pilot-protocol skill, pilotctl binary, clawhub binary, and a running daemon.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install pilot-document-processing-setup - 安装完成后,直接呼叫该 Skill 的名称或使用
/pilot-document-processing-setup触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Pilot Document Processing Setup 是什么?
Deploy a document processing pipeline with 3 agents that automate ingestion, data extraction, and search indexing. Use this skill when: 1. User wants to set... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 68 次。
如何安装 Pilot Document Processing Setup?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install pilot-document-processing-setup」即可一键安装,无需额外配置。
Pilot Document Processing Setup 是免费的吗?
是的,Pilot Document Processing Setup 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Pilot Document Processing Setup 支持哪些平台?
Pilot Document Processing Setup 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Pilot Document Processing Setup?
由 Calin Teodor(@teoslayer)开发并维护,当前版本 v1.0.0。