← Back to Skills Marketplace

aiparse-ocr

Name: aiparse-ocr
Author: do0388309

by do0388309 · GitHub ↗ · v1.0.2 · MIT-0

cross-platform ✓ Security Clean

170

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install aiparse-ocr

Description

Parse PDF files using LLM. **No registration required - free trial available!** Extract information from PDF files and return results in JSON or Markdown for...

README (SKILL.md)

\r \r

AI Parse\r

\r A skill for parsing PDF files using Large Language Models.\r \r

Capabilities\r

Extract information from PDF files\r
Return results in JSON or Markdown format\r
Resume processing from existing task ID\r
Save task ID information to JSON file for reference\r \r

Parameters\r

\r | Parameter | Type | Required | Description |\r |-----------|------|----------|-------------|\r | pdf_path | string | required | Path to the PDF file to process |\r | result_path | string | required | Path to save the parsing result |\r | format | string | required | Output format: "json" or "md" |\r | task_id_path | string | required | Path to save task ID information (JSON format) |\r | --task-id | string | optional | Existing task ID to resume processing |\r \r

Usage Examples\r

Normal Upload Mode\r

python handler.py \x3Cpdf_path> \x3Cresult_path> \x3Cformat> \x3Ctask_id_path>\r
```\r
\r
### Resume from Existing Task or Check Status\r
\r
```bash\r
python handler.py --task-id \x3Ctask_id> \x3Cresult_path> \x3Cformat>\r
```\r
\r
## Task ID File Format\r
\r
When using normal upload mode, a task ID file will be created at `task_id_path` with the following JSON structure:\r
\r
```json\r
{\r
  "task_id": "AAFXKO",\r
  "pdf_path": "test.pdf",\r
  "submit_time": "2026-04-04 00:33:27"\r
}\r
```\r
\r
This file can be used to:\r
- Track the submitted task\r
- Retrieve the task ID later for status checking\r
- Resume processing if interrupted\r
\r
## Implementation\r
\r
Implemented by `handler.py` which:\r
- Uploads PDF files to the processing service\r
- Polls for processing completion\r
- Downloads and saves results in the requested format\r
- Supports resuming from existing task IDs\r
- Saves task ID information to JSON file\r
\r
## Environment Requirements\r
\r
- Python 3.6+\r
- requests library\r
\r
## Return Value\r
\r
The parsed result will be saved to the specified `result_path` in the requested format:\r
- **JSON format:** Structured JSON with task details and extracted content\r
- **Markdown format:** Formatted Markdown with page-by-page content\r
\r
## Notes\r
\r
- For large PDF files, processing may take multiple minutes\r
- Free users can process 30 PDF pages - visit https://api.pinocch.com/index for extra trial credits\r
- The `--task-id` parameter can be used to resume processing if interrupted\r
- Check the console output for processing progress and status updates\r
- The task ID file is created immediately after successful upload\r
- **IMPORTANT FOR AGENTS:** Before declaring a task as failed, always use the task ID to check the current status of the task. Use the `--task-id` parameter to resume or verify the task status. The task may still be processing or have completed successfully.\r

Usage Guidance

This skill uploads any PDF you give it to https://api.pinocch.com for processing. If your PDFs contain sensitive or confidential information, do not send them to an untrusted third party. The code allows optional username/api_token authentication but the SKILL.md does not document supplying those credentials — if you have an account, review handler.py to see how to supply credentials, or test in trial mode with non‑sensitive documents first. Review the handler.py file yourself (or have a developer do so) to confirm no unexpected network endpoints or behavior, and restrict use to documents you are comfortable sharing with the external service.

Capability Analysis

Type: OpenClaw Skill Name: aiparse-ocr Version: 1.0.2 The aiparse-ocr skill is a legitimate tool for extracting data from PDF files using a third-party API (api.pinocch.com). The handler.py script implements standard file upload and polling logic using the requests library, and the SKILL.md instructions are focused on ensuring the AI agent correctly manages long-running tasks. No evidence of data exfiltration, malicious execution, or persistence was found.

Capability Tags

cryptocan-make-purchases

Capability Assessment

✓ Purpose & Capability

The skill name/description (PDF parsing / OCR using an LLM-backed service) matches the implementation: handler.py uploads PDFs to api.pinocch.com, polls for results, and saves parsed output. Required capabilities are consistent with a remote parsing service.

ℹ Instruction Scope

SKILL.md instructs the agent to run handler.py to upload PDFs, poll status, and save results — which is exactly what the code does. Important scope note: the skill uploads the user's PDF files to an external domain (api.pinocch.com). There are no instructions to read unrelated local files, but any PDF passed will be transmitted off‑device.

✓ Install Mechanism

No install spec is present (instruction-only with an included handler.py). This reduces install-time risk; the code will run when invoked and performs network calls. No unusual download/install operations are performed by the skill itself.

ℹ Credentials

The registry metadata declares no required environment variables or credentials, and the skill works in 'trial mode' with no auth. The code, however, supports optional username and api_token headers (Authorization: Bearer ...) even though SKILL.md does not document how to provide them — minor documentation inconsistency. No unrelated secrets or system credentials are requested.

✓ Persistence & Privilege

The skill does not request persistent/system privileges (always:false). It writes only task ID and result files in paths supplied by the user and does not modify other skills or global agent settings.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install aiparse-ocr
After installation, invoke the skill by name or use /aiparse-ocr
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.2

- Removed secret.txt file from the repository. - Updated documentation: No registration required and free trial mode highlighted. - Authentication parameters (username, secret) removed from documentation and usage instructions. - Clarified free page limit for unregistered users (30 pages). - Streamlined usage examples for simpler, credential-free command structure.

v1.0.1

Major update: Improved task management, resume functionality, and output tracking for PDF parsing. - Added support for resuming processing using task IDs and checking task status. - Task ID is saved to a JSON file, enabling easier tracking and recovery. - New parameters: task_id_path (required) and --task-id (optional) for managing ongoing or interrupted tasks. - Enhanced usage documentation with authenticated/trial modes and examples for resuming tasks. - Updated important notes: Agents should always check task status via task ID before declaring failure. - Revised environment requirements, keywords, return value details, and general guidance for improved clarity.

v1.0.0

- Initial public release of aiparse-ocr skill. - Provides parsing of PDF files using large language models. - Extracts structured information from PDFs and outputs results in JSON or Markdown format. - Supports optional authentication for usage beyond trial mode. - Allows result file output for processed data. - Includes clear usage instructions and parameter documentation.

Metadata

Slug aiparse-ocr

Version 1.0.2

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 3

Frequently Asked Questions

What is aiparse-ocr?

Parse PDF files using LLM. **No registration required - free trial available!** Extract information from PDF files and return results in JSON or Markdown for... It is an AI Agent Skill for Claude Code / OpenClaw, with 170 downloads so far.

How do I install aiparse-ocr?

Run "/install aiparse-ocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is aiparse-ocr free?

Yes, aiparse-ocr is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does aiparse-ocr support?

aiparse-ocr is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created aiparse-ocr?

It is built and maintained by do0388309 (@do0388309); the current version is v1.0.2.

More Skills