← Back to Skills Marketplace
jeane-li

laiye-doc-processing

by Jeane-li · GitHub ↗ · v1.10.3 · MIT-0
cross-platform ⚠ suspicious
531
Downloads
4
Stars
0
Active Installs
5
Versions
Install in OpenClaw
/install laiye-doc-processing
Description
Enables AI-powered parsing and key information extraction from high-frequency documents including invoices, orders, receipts, long texts, and common Chinese...
README (SKILL.md)

\r

agentic-doc-parse-and-extract Skill\r

\r agentic-doc-parse-and-extract is an official command-line tool released by Laiye Technology's ADP (Agentic Document Processing) product, enabling both humans and AI agents to invoke ADP capabilities in the terminal for document parsing and extraction.\r \r

Quick Start Guide for AI Agents\r

\r

Core Workflow\r

  1. Install dependencies: On first execution, install the ADP CLI tool and dependencies by following the instructions in references/examples.md.\r
  2. Discover commands: Run adp schema to get the machine-readable JSON spec of all commands, parameters, types, and defaults.\r
  3. Authentication: On first execution, run adp config get to verify credentials. If no valid configuration exists, prompt the user to provide an API Key.\r
  4. Check Application: On first execution, retrieve the application list via adp app-id list. For subsequent executions, prioritize adp app-id cache (cached in context). If the cache is unavailable, refresh it by calling adp app-id list again.\r
  5. Execute: Run adp extract url \x3CURL> --app-id \x3CID> or adp parse url \x3CURL> --app-id \x3CID>.\r
  6. Query: Check results asynchronously with adp extract query \x3Ctask_id> or adp parse query \x3Ctask_id>.\r
  7. Error handling: When a command fails, parse the stderr JSON to determine error type and recovery action. See references/error-handling.md.\r \r

Common Scenarios → Command Mapping\r

| User Intent | Recommended Command | Handling Rules |\r | :---------- | :------------------ | :------------- |\r | - Read full document content\x3Cbr>- Parse layout & structure\x3Cbr>- Convert document to text\x3Cbr>- Process / analyze full document | adp parse | - Sync processing for small files\x3Cbr>- Async processing (--async parameter) for files >20MB or >200 pages |\r | - Extract key fields (amount, date, name, ID, etc.)\x3Cbr>- Output structured results (JSON/table) | adp extract | - Use Extract directly, no need to parse first\x3Cbr>- Use matched existing app\x3Cbr>- Create a custom extraction app if the document type is not in the known app list |\r | Batch processing of local files | adp extract local \x3Cfolder path> \x3Cbr> adp parse local \x3Cfolder path> | Batch processing can accept files from the local folder. |\r | Batch processing of URL files | adp extract url \x3CURL list file path> \x3Cbr> adp parse url \x3CURL list file path> | If you need to process multiple URLs in a batch, you can first save the list of URLs in a text file, and then input the corresponding URL of this text file to achieve batch processing at once. |\r \r Note:\r

  • The adp extract command has built-in document parsing capabilities. After ADP automatically parses the document, it performs structured extraction. Therefore, when users need to extract the structured content of the document, there is no need to use apd for parsing.\r
  • URL list file format: A plain text file where each line is a URL pointing to a document to be processed.\r \r \r

Quick Reference for Common Commands\r

\r

# Command Discovery (for Agent introspection)\r
adp schema\r
\r
# Configuration Check\r
adp config get\r
\r
# Query Applications (First Use)\r
adp app-id list\r
\r
# Document Extraction (Invoice/Receipt)\r
adp extract url \x3Cfile URL> --app-id \x3Capp_id>\r
\r
# Document Parsing (Long Document)\r
adp parse url \x3Cfile URL> --app-id \x3Capp_id>\r
\r
# Base64 Input\r
adp extract base64 \x3Cbase64_string> --app-id \x3Capp_id> --file-name invoice.pdf\r
adp parse base64 \x3Cbase64_string> --app-id \x3Capp_id> --file-name document.pdf\r
\r
# Asynchronous Query\r
adp extract query \x3Ctask_id>\r
adp parse query \x3Ctask_id>\r
adp parse query \x3Ctask_id1> \x3Ctask_id2> --watch  # batch query with auto-poll\r
\r
# Batch Processing\r
adp extract local \x3Cfolder path> --app-id \x3Capp_id> --export \x3Cfolder path> --concurrency 2\r
adp parse local \x3Cfolder path> --app-id \x3Capp_id> --export \x3Cfolder path> --concurrency 2\r
```\r
\r
## Performance Optimization Suggestions\r
- **Reuse APP_ID**: Cache it in the context after one query to avoid calling `app-id list` every time.\r
- **Sync First**: For small files (\x3C20MB), prioritize using synchronous calls to avoid asynchronous polling.\r
- **Batch Processing**: Processes multiple documents via `url \x3CURL list file path>` or `local \x3Cfolder path>` in a single run, without looped invocations. Default `--concurrency 2`.\r
- **Local Cache**: Store commonly used APP_IDs in environment variables or configuration files.\r
- **Priority Extraction**: If only key information needs to be extracted, use `extract` instead of `parse` (faster).\r
- **Use --retry for batch**: Set `--retry 2` for batch processing to auto-recover from transient failures.\r
- **Use --timeout for large files**: Increase `--timeout` for files >20MB. Default is 900s.\r
\r
---\r
\r
## Detailed Product Introduction\r
\r
### Core Function Definition\r
- **parse**: Parses the entire document to retrieve full text, layout, structure, and content.\r
- **extract**: Extracts specific structured fields from the document, such as amount, date, company name, and order number.\r
\r
### Application Scenarios\r
- **Long Document Parsing**: Efficiently process long documents with fast parsing speed, accurately extract multiple elements such as text, tables and images, replace manual extraction, and improve efficiency.\r
- **Structured Extraction for Scanned/Photographed Documents**: For scanned documents and photos, complete structured extraction in reading order, generate clear and editable electronic documents, and eliminate manual entry errors.\r
- **Intelligent Invoice Extraction**: After uploading invoice images/documents, AI automatically invokes preset applications to accurately extract 10+ key fields such as invoice number and amount, suitable for financial filing scenarios.\r
- **Intelligent Order Extraction**: Support batch upload of orders from multiple distributors, AI extracts 10+ key fields such as order number and buyer-seller information, automatically identifies currencies, and reduces manual verification costs.\r
- **Domestic ID Document Extraction**: Process in seconds, supporting the identification and extraction of more than 10 common types of documents in China; for example, core information such as name and ID number can be quickly extracted from ID card scans.\r
- **Automatic Splitting and Extraction of Mixed Documents**: Batch upload mixed documents such as contracts and invoices, AI automatically classifies, splits and completes structured extraction to improve processing efficiency.\r
- **Batch Document Processing**: Support batch upload of various business documents, extract information and output standardized structured data, reducing repetitive manual operations.\r
\r
## Detailed Usage Steps\r
\r
### Step 1: Obtain the Installation Package\r
For details, see [references/examples.md](references/examples.md)\r
\r
### Step 2: Obtain and Configure API Key\r
\r
   #### 1. Access the ADP Portal to Obtain Credentials\r
\r
   We provide independent Public Cloud access addresses for domestic and international users, which need to be configured separately by region. Accessing nearby can better ensure high-speed and stable calls across the network.\r
\r
   | Region | Login Address | API Base URL |\r
   |-----|----------|--------------|\r
   | Chinese Mainland | [https://adp.laiye.com/](https://adp.laiye.com/?utm_source=clawhub) | `https://adp.laiye.com/` |\r
   | Overseas Region | [https://adp-global.laiye.com/](https://adp-global.laiye.com/?utm_source=clawhub) | `https://adp-global.laiye.com/` |\r
\r
   #### 2. Get API Key after registration/login\r
   New users need to register an ADP account first, and after registration, they can get 100 free credits/month\r
   - After logging in, click on the personal avatar, and you can directly access the `API_Key` entry.\r
   \r
   #### 3. Complete the authentication configuration\r
   For details, see [references/examples.md](references/examples.md)\r
   \r
   #### 4. Verify the configuration\r
   For details, see [references/examples.md](references/examples.md)\r
   \r
   **Notes**:\r
   1. If API Key and API Base URL have been configured, the configuration information needs to be stored in environment variables to avoid uploading configuration items every time they are used.\r
   2. If API Key and API Base URL have not been configured yet, they need to be configured according to the above steps.\r
\r
### Step 3: Upload Documents\r
After completing the authentication of the API Key, guide the user to upload local files or specify the file URL. After the user uploads the document, they can query the supported application scope of ADP and select the appropriate application for document parsing and extraction. If no suitable application is found, they can choose to create a custom extraction application, configure exclusive fields and parsing modes to meet the personalized document processing requirements.\r
\r
### Step 4: Query Available Applications \r
This function is used to query the built-in applications under the user's account (such as invoices/receipts, orders, common cards and certificates in China region, etc. which are standardized documents). Based on the `app-label`, you can assist in filtering the suitable application IDs. If no suitable application is found, you can choose to create a custom extraction application, configure specific fields and parsing modes to meet the personalized document processing requirements.\r
\r
**Notes**:\r
1. For the first execution, use `adp app-id list`. For subsequent executions, prefer to use `adp app-id cache` (cache the application ID in the context). If the cache becomes invalid or there are no suitable applications in the cache, call `adp app-id list` again to update the cache.\r
\r
For detailed examples of commands and responses, see [references/examples.md](references/examples.md).\r
\r
\r
### Step 5: Add custom extraction application\r
\r
Support creating custom extraction applications, and independently add business-specific extraction fields as needed, and improve the detailed description of each field; the system will accurately identify the document content based on the configured fields and definitions, and complete customized information extraction for personalized documents and non-standard forms.\r
\r
For example commands, responses, and detailed parameter descriptions, please refer to [references/examples.md](references/examples.md) \r
\r
### Step 6: Execute Document Processing\r
\r
### Single Document Parsing\r
\r
Perform document parsing based on the selected application ID, which will return a formatted JSON result containing information such as document content, element position coordinates, OCR Confidence Level, etc.\r
\r
For examples of commands and responses, please refer to [references/examples.md](references/examples.md)\r
\r
### Single Document Extraction\r
\r
Perform document extraction based on the selected application ID, which will return a formatted JSON result containing information such as extraction fields, extraction results, and Confidence Level.\r
\r
For examples of commands and responses, please refer to [references/examples.md](references/examples.md)\r
\r
\r
### Batch Document Processing\r
\r
ADP supports batch processing capabilities. Users can upload multiple file URLs or local folder paths at once, and the system will automatically identify each document type and match the most suitable application for processing, greatly improving the efficiency of batch document processing.\r
\r
For detailed command examples, see [references/examples.md](references/examples.md)\r
\r
**Note**: The number of concurrent requests is limited to 1 for free users, while enterprise users can adjust it according to their needs, with a maximum support of 2.\r
\r
### Asynchronous Processing (Suitable for Large Documents)\r
\r
ADP provides asynchronous processing capabilities, allowing users to choose asynchronous mode to perform document parsing and extraction. The system will return a task ID, and users can periodically query the task status and results through the query interface, which is suitable for processing complex documents or batch documents with long processing times. If the document uploaded by the user is larger than 20MB or contains more than 200 pages, it is recommended to use the asynchronous processing mode.\r
\r
For examples of commands and responses, see [references/examples.md](references/examples.md)\r
\r
---\r
\r
## Complete Command List\r
\r
For a complete list of all available commands with full parameter specs, see [references/commands.md](references/commands.md)\r
\r
## Response Schema Reference\r
\r
For the output structure of each command (including batch processing output mechanism), see [references/response-schema.md](references/response-schema.md)\r
\r
## Error Handling Guide\r
\r
For error codes, types, and Agent auto-recovery strategies, see [references/error-handling.md](references/error-handling.md)\r
\r
---\r
\r
## Precautions\r
\r
When using ADP output, always present the returned data as-is. Do not modify, add, or remove any fields during extraction or parsing to ensure data integrity.\r
\r
1. **API Key Security**: Please keep your API Key secure and avoid disclosing it to unauthorized third parties.\r
2. **API Base URL Configuration**: Select the corresponding address based on the region. For Chinese Mainland, use `https://adp.laiye.com/`, and for overseas regions, use `https://adp-global.laiye.com/`\r
3. **File Size Limit**: The maximum size of a single file is 50MB\r
4. **Supported Formats**: .jpg, .jpeg, .png, .bmp, .tiff, .tif, .pdf, .doc, .docx, .xls, .xlsx\r
5. **Free Quota**: New users receive 100 free credits per month, which are reset at the beginning of each month. Credits can be used for document parsing and extraction processing.\r
6. **Check Balance**: Run `adp credit` to check the current account's credit balance.\r
7. **Billing Rules**:\r
   - Document parsing: 0.5 credits per page\r
   - Invoice/receipt extraction: 1.5 credits per page\r
   - Order extraction: 1.5 credits per page\r
   - Custom extraction: 1 credit per page\r
8. **App ID Reuse**: The app ID used by the user can be remembered for direct use next time, eliminating the need to enter the app_id after each query. The app ID under each user is unique and fixed; unless the user deletes the app, the app_id will not change, and the previously queried app_id can be directly used for document processing calls.\r
\r
---\r
\r
## Related Resources\r
- **CLI Documentation**: [ADP CLI User Guide](https://laiye-tech.feishu.cn/wiki/YIaawiK2DimisZk5KfDc8a8cnLh)\r
- **API Documentation**: [OpenAPI User Guide](https://laiye-tech.feishu.cn/wiki/S1t2wYR04ivndKkMDxxcp2SFnKd)\r
- **User Guide**: [Public Cloud Operation Manual](https://laiye-tech.feishu.cn/wiki/OfexwgVUQiOpEek4kO7c7NEJnAe)\r
- **Problem Feedback**: [GitHub Issues](https://github.com/laiye-ai/adp-cli/issues) | [email protected]\r
- **Official Website**: [Laiye Technology](https://laiye.com)\r
\r
---\r
\r
Copyright © 2026 [Laiye Technology (Beijing) Co., Ltd.] All rights reserved.\r
Usage Guidance
This skill appears to be a legitimate CLI wrapper for Laiye's ADP service and is internally consistent. Before installing or running it: 1) Verify the publisher/source (the package and scripts reference GitHub and Laiye domains) and inspect the npm package and the raw install script instead of blindly piping to a shell. 2) Understand that documents you process will be uploaded to Laiye's cloud (adp.laiye.com/adp-global.laiye.com) and consumption uses billable credits — review privacy/terms and limit sensitive documents if you need on-premises guarantees. 3) Prefer installing in a sandbox or container, and avoid running global npm installs or remote scripts on critical hosts until you review their contents. 4) Limit the API Key scope if possible, rotate keys after testing, and be mindful that the CLI caches app IDs and writes output files to local directories — remove or secure those files if they contain sensitive data.
Capability Analysis
Type: OpenClaw Skill Name: laiye-doc-processing Version: 1.10.3 The skill bundle provides instructions for an AI agent to install and operate the Laiye ADP CLI tool, which involves high-risk behaviors such as executing remote installation scripts via 'curl|bash' and 'npm' (references/examples.md). The instructions in SKILL.md guide the agent to manage sensitive API keys and upload local documents to remote endpoints (adp.laiye.com) for parsing and extraction. While these capabilities are aligned with the stated purpose of document processing, the automated installation of external binaries and the broad file/network access permissions represent a significant security risk and attack surface.
Capability Tags
cryptocan-make-purchasesrequires-sensitive-credentials
Capability Assessment
Purpose & Capability
Name/description (document parsing & extraction) match the SKILL.md and reference docs: commands, app IDs, API Key workflows, batch/local/URL processing, and response schemas are all consistent with a CLI that talks to a cloud ADP service.
Instruction Scope
Instructions tell an agent to install and invoke a CLI, read/write local folders (batch mode), read CLI stdout/stderr and output_dir files, cache APP_IDs, and prompt user for an API Key. These actions are in-scope for a document-processing CLI but do entail reading local files and uploading document content to Laiye's cloud endpoints (adp.laiye.com / adp-global.laiye.com) — expected for this product but privacy-sensitive.
Install Mechanism
Install methods include npm package install and executing remote install scripts via curl | bash or PowerShell that download from raw.githubusercontent.com. GitHub raw and npm registry are common publication vectors, but piping remote scripts to a shell and global npm installs carry moderate risk (postinstall hooks, arbitrary script execution). Recommend inspecting the install script and package before running in production.
Credentials
The skill requests no built-in env vars, but runtime requires an ADP API Key (user-provided) and optionally allows storing APP_IDs in env/config. That credential is proportional to its cloud-based purpose. Be aware that documents processed are sent to the service and usage consumes billable credits; no unexplained or unrelated credentials are requested.
Persistence & Privilege
Skill is not always-enabled and allows model invocation (normal). The CLI persists an application list cache (docs state cache is 'permanent and does not expire') and writes results to local output directories (absolute paths). These are reasonable for a CLI but mean identifiers and outputs may persist on disk — consider cleanup and scope the CLI's access.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install laiye-doc-processing
  3. After installation, invoke the skill by name or use /laiye-doc-processing
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.10.3
# agentic-doc-parse-and-extract 1.10.3 Changelog - Renamed the skill from `laiye-doc-processing` to `agentic-doc-parse-and-extract`. - Completely revised and expanded documentation with clearer workflows, command-line usage examples, and agent requirements. - Added dedicated files for README in Chinese, license, command references, error handling, example usage, and response schema. - Deprecated legacy config metadata files (`_meta.json`, `package.json`). - Documented best practices for batch processing, performance, and error recovery. - Clarified distinction between parsing full-document content vs. extracting structured key fields.
v1.3.0
License: Published under MIT-0 (ClawHub platform mandatory). This skill only provides API calling logic; the underlying service (API access) requires a valid API Key from our official web platform. Commercial license and usage credits are managed via our web portal (https://adp-global.laiye.com/?utm_source=clawhub). New users receive 100 free monthly credits (applicable only to valid API Key holders).
v1.2.0
- Update version number
v1.0.1
- Added a comprehensive README.md guide for quick start, setup, usage examples, security, and best practices. - Expanded and clarified supported use cases, field extraction examples, and supported formats. - Improved step-by-step instructions for credential setup and file extraction (including file URL and base64 methods). - Enhanced security guidance and detailed information on API response structure. - Organized instructions for both synchronous and asynchronous extraction, plus advanced options like scale and config version. - Updated use case sections and best practice recommendations for enterprise document processing.
v1.0.0
Initial release of laiye-doc-processing: Enterprise-grade document extraction API. - Extracts key fields and line items from various business documents across 10+ file formats (PDF, PNG, JPG, DOCX, XLSX, etc.) to structured JSON or Excel with field-level confidence scores. - Offers synchronous and asynchronous (task-based) extraction, supporting both file URLs and base64 uploads. - Access VLM document recognition results and tailor extraction with custom model parameters. - Simple zero-configuration integration and comprehensive API credential security guidance. - Professionally optimized for invoices, receipts, orders, contracts, and more; includes sample curl requests and best practices. - Each user receives 100 free credits monthly; commercial license required for continued use.
Metadata
Slug laiye-doc-processing
Version 1.10.3
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 5
Frequently Asked Questions

What is laiye-doc-processing?

Enables AI-powered parsing and key information extraction from high-frequency documents including invoices, orders, receipts, long texts, and common Chinese... It is an AI Agent Skill for Claude Code / OpenClaw, with 531 downloads so far.

How do I install laiye-doc-processing?

Run "/install laiye-doc-processing" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is laiye-doc-processing free?

Yes, laiye-doc-processing is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does laiye-doc-processing support?

laiye-doc-processing is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created laiye-doc-processing?

It is built and maintained by Jeane-li (@jeane-li); the current version is v1.10.3.

💬 Comments