← Back to Skills Marketplace
149
Downloads
0
Stars
0
Active Installs
3
Versions
Install in OpenClaw
/install online-deepseek-ocr
Description
Perform cloud-based OCR on PNG, JPG, JPEG, and WEBP images using SiliconFlow's DeepSeek-OCR with automatic image preprocessing for enhanced recognition.
Usage Guidance
This skill appears to do what it says (call SiliconFlow DeepSeek-OCR with local preprocessing). Before installing, consider:
- Keep your SiliconFlow API key private: config.json stores it in plaintext in the skill directory — avoid committing that file to source control and protect the filesystem location.
- Verify the baseUrl in config.json is the legitimate SiliconFlow endpoint (https://api.siliconflow.cn/v1). An attacker-controlled baseUrl could capture your images and key.
- Be careful when passing image URLs: if you provide internal or sensitive URLs (e.g., 169.254.169.254 or internal hostnames), those will be fetched by the external API and could leak sensitive data (SSRF/data-exfiltration risk). Prefer local files when possible.
- Review the SiliconFlow service terms and privacy policy before sending sensitive images to a third party.
- The pip dependencies are normal; run dependency installation in a controlled environment (virtualenv) if you have security concerns.
If you want stronger secrecy, consider modifying the code to read the API key from a protected environment variable or a secure secrets store rather than a plaintext file.
Capability Analysis
Type: OpenClaw Skill
Name: online-deepseek-ocr
Version: 1.0.2
The skill provides online OCR capabilities using the SiliconFlow DeepSeek-OCR API. The code in scripts/ocr.py implements standard image preprocessing (using Pillow and numpy) and API communication logic without any signs of data exfiltration, malicious execution, or prompt injection. It correctly handles local files and URLs as described in SKILL.md and requires a user-provided API key in config.json.
Capability Assessment
Purpose & Capability
The skill advertises cloud OCR using SiliconFlow and the code reads a local config.json apiKey and posts images (or image data URIs) to SiliconFlow's API endpoint. Required dependencies (requests, Pillow, numpy) match the described preprocessing and HTTP calls. No unrelated credentials, binaries, or system paths are requested.
Instruction Scope
SKILL.md and the code instruct the user to place an API key in a local config.json and to pass either local image paths or arbitrary image URLs. Passing arbitrary URLs will cause the third-party API to fetch those URLs (the code forwards the URL to SiliconFlow). This can leak access to internal-only resources (SSRF risk) or disclose sensitive internal URLs/contents to the external service. The instructions also direct storing the API key on disk in plaintext (config.json).
Install Mechanism
There is no automatic install script; dependencies are a small pip requirements.txt (requests, Pillow, numpy) which is proportionate to image preprocessing and HTTP calls. No downloads from untrusted URLs or extract/install steps are present.
Credentials
No environment variables or unrelated credentials are requested. The skill requires a local config.json containing apiKey and baseUrl. Storing the API key in a plaintext file in the skill directory is functional but has privacy/operational trade-offs (risk of accidental commit or other local access). The baseUrl is configurable in config.json; if modified to a malicious host it could capture images and the apiKey.
Persistence & Privilege
The skill is user-invocable, not always-enabled, and does not request elevated platform privileges or modify other skills. It only persists the user-provided config.json in its own directory.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install online-deepseek-ocr - After installation, invoke the skill by name or use
/online-deepseek-ocr - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.2
- Removed the debug_config.py file for a cleaner production release.
- No user-facing feature changes or documentation updates in this version.
v1.0.1
- Added config.json for streamlined API key configuration.
- Added precompiled script file scripts/__pycache__/ocr.cpython-312.pyc.
- Updated documentation with simplified usage instructions, new output format, and troubleshooting section.
- Clarified steps for initial setup and API key configuration.
- Minor documentation cleanup; aliases and detailed preprocessing configuration moved or condensed.
v1.0.0
- First release of the Online DeepSeek OCR skill.
- Supports cloud-based OCR using SiliconFlow's free DeepSeek-OCR model.
- Accepts PNG, JPG, JPEG, and WEBP formats; multi-language recognition.
- Automatic image pre-processing for improved OCR accuracy: resizing, denoising, grayscale, contrast enhancement, sharpening, and optional binarization.
- Customizable API and pre-processing settings via openclaw.json.
- Output includes recognized text, confidence scores, and per-line details.
Metadata
Frequently Asked Questions
What is Online Deepseek-OCR?
Perform cloud-based OCR on PNG, JPG, JPEG, and WEBP images using SiliconFlow's DeepSeek-OCR with automatic image preprocessing for enhanced recognition. It is an AI Agent Skill for Claude Code / OpenClaw, with 149 downloads so far.
How do I install Online Deepseek-OCR?
Run "/install online-deepseek-ocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Online Deepseek-OCR free?
Yes, Online Deepseek-OCR is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Online Deepseek-OCR support?
Online Deepseek-OCR is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Online Deepseek-OCR?
It is built and maintained by qwq2023 (@qwq2023qwq); the current version is v1.0.2.
More Skills