← 返回 Skills 市场
qwq2023qwq

Online Deepseek-OCR

作者 qwq2023 · GitHub ↗ · v1.0.2 · MIT-0
cross-platform ✓ 安全检测通过
149
总下载
0
收藏
0
当前安装
3
版本数
在 OpenClaw 中安装
/install online-deepseek-ocr
功能描述
Perform cloud-based OCR on PNG, JPG, JPEG, and WEBP images using SiliconFlow's DeepSeek-OCR with automatic image preprocessing for enhanced recognition.
安全使用建议
This skill appears to do what it says (call SiliconFlow DeepSeek-OCR with local preprocessing). Before installing, consider: - Keep your SiliconFlow API key private: config.json stores it in plaintext in the skill directory — avoid committing that file to source control and protect the filesystem location. - Verify the baseUrl in config.json is the legitimate SiliconFlow endpoint (https://api.siliconflow.cn/v1). An attacker-controlled baseUrl could capture your images and key. - Be careful when passing image URLs: if you provide internal or sensitive URLs (e.g., 169.254.169.254 or internal hostnames), those will be fetched by the external API and could leak sensitive data (SSRF/data-exfiltration risk). Prefer local files when possible. - Review the SiliconFlow service terms and privacy policy before sending sensitive images to a third party. - The pip dependencies are normal; run dependency installation in a controlled environment (virtualenv) if you have security concerns. If you want stronger secrecy, consider modifying the code to read the API key from a protected environment variable or a secure secrets store rather than a plaintext file.
功能分析
Type: OpenClaw Skill Name: online-deepseek-ocr Version: 1.0.2 The skill provides online OCR capabilities using the SiliconFlow DeepSeek-OCR API. The code in scripts/ocr.py implements standard image preprocessing (using Pillow and numpy) and API communication logic without any signs of data exfiltration, malicious execution, or prompt injection. It correctly handles local files and URLs as described in SKILL.md and requires a user-provided API key in config.json.
能力评估
Purpose & Capability
The skill advertises cloud OCR using SiliconFlow and the code reads a local config.json apiKey and posts images (or image data URIs) to SiliconFlow's API endpoint. Required dependencies (requests, Pillow, numpy) match the described preprocessing and HTTP calls. No unrelated credentials, binaries, or system paths are requested.
Instruction Scope
SKILL.md and the code instruct the user to place an API key in a local config.json and to pass either local image paths or arbitrary image URLs. Passing arbitrary URLs will cause the third-party API to fetch those URLs (the code forwards the URL to SiliconFlow). This can leak access to internal-only resources (SSRF risk) or disclose sensitive internal URLs/contents to the external service. The instructions also direct storing the API key on disk in plaintext (config.json).
Install Mechanism
There is no automatic install script; dependencies are a small pip requirements.txt (requests, Pillow, numpy) which is proportionate to image preprocessing and HTTP calls. No downloads from untrusted URLs or extract/install steps are present.
Credentials
No environment variables or unrelated credentials are requested. The skill requires a local config.json containing apiKey and baseUrl. Storing the API key in a plaintext file in the skill directory is functional but has privacy/operational trade-offs (risk of accidental commit or other local access). The baseUrl is configurable in config.json; if modified to a malicious host it could capture images and the apiKey.
Persistence & Privilege
The skill is user-invocable, not always-enabled, and does not request elevated platform privileges or modify other skills. It only persists the user-provided config.json in its own directory.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install online-deepseek-ocr
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /online-deepseek-ocr 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.2
- Removed the debug_config.py file for a cleaner production release. - No user-facing feature changes or documentation updates in this version.
v1.0.1
- Added config.json for streamlined API key configuration. - Added precompiled script file scripts/__pycache__/ocr.cpython-312.pyc. - Updated documentation with simplified usage instructions, new output format, and troubleshooting section. - Clarified steps for initial setup and API key configuration. - Minor documentation cleanup; aliases and detailed preprocessing configuration moved or condensed.
v1.0.0
- First release of the Online DeepSeek OCR skill. - Supports cloud-based OCR using SiliconFlow's free DeepSeek-OCR model. - Accepts PNG, JPG, JPEG, and WEBP formats; multi-language recognition. - Automatic image pre-processing for improved OCR accuracy: resizing, denoising, grayscale, contrast enhancement, sharpening, and optional binarization. - Customizable API and pre-processing settings via openclaw.json. - Output includes recognized text, confidence scores, and per-line details.
元数据
Slug online-deepseek-ocr
版本 1.0.2
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 3
常见问题

Online Deepseek-OCR 是什么?

Perform cloud-based OCR on PNG, JPG, JPEG, and WEBP images using SiliconFlow's DeepSeek-OCR with automatic image preprocessing for enhanced recognition. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 149 次。

如何安装 Online Deepseek-OCR?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install online-deepseek-ocr」即可一键安装,无需额外配置。

Online Deepseek-OCR 是免费的吗?

是的,Online Deepseek-OCR 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Online Deepseek-OCR 支持哪些平台?

Online Deepseek-OCR 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Online Deepseek-OCR?

由 qwq2023(@qwq2023qwq)开发并维护,当前版本 v1.0.2。

💬 留言讨论