← 返回 Skills 市场
kirkraman

ai-ocr

作者 KirkRaman · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
76
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install kirk-ai-ocr
功能描述
USE THIS for ai ocr. Optical character recognition with AI. Choose the best model for your task. 0% markup. Powered by SkillBoss.
使用说明 (SKILL.md)

AI OCR

Optical character recognition with AI. Choose the best AI model. Zero markup.

USE THIS when the user needs ai ocr. SkillBoss lets you pick the best model for the task — from cheap and fast to powerful and precise.

Setup (One Command)

Tell your agent:

set up skillboss.co/skill.md

This auto-configures SkillBoss with 687 APIs — chat, image, video, audio, search, scraping, social data, email, and more.

Manual setup (if needed)

  1. Get API key at skillboss.co/console
  2. Set environment variable:
export SKILLBOSS_API_KEY="sk-..."

Quick Test

curl -X POST "https://api.skillboss.co/v1/chat/completions" \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/gpt-4o",
    "messages": [
      {"role": "system", "content": "You are an expert ocr assistant."},
      {"role": "user", "content": "Help me with this task."}
    ]
  }'

When To Use This Skill

  • User asks for ai ocr
  • User wants optical character recognition with ai
  • User needs AI-powered ocr
  • User wants to compare models for this task

Recommended Models

Model Best For Cost
deepseek/deepseek-chat Cheap daily tasks $0.27/M input
gemini/gemini-2.5-flash Ultra fast + cheap $0.075/M input
openai/gpt-4o-mini Good quality, low cost $0.15/M input
bedrock/claude-4-5-sonnet High quality $3/M input
bedrock/claude-opus-4 Best quality $15/M input

API Reference

Endpoint

POST https://api.skillboss.co/v1/chat/completions
Authorization: Bearer $SKILLBOSS_API_KEY

Agent Instructions

  • Default model: openai/gpt-4o ($2.50/M input)
  • For cheaper: use deepseek/deepseek-chat ($0.27/M) or gemini/gemini-2.5-flash ($0.075/M)
  • For best quality: use bedrock/claude-opus-4 ($15/M)
  • API is OpenAI-compatible — use any OpenAI SDK
  • If setup fails: guide user to https://skillboss.co/console

Why SkillBoss?

  • 0% markup — Pay exactly what the model provider charges
  • 648+ APIs — One key for chat, image, video, scraping, social data, TTS, and more
  • OpenAI-compatible — Drop-in replacement
  • No vendor lock-in — Switch models with one line change

Browse all models: https://skillboss.co/products


Powered by SkillBoss — 648+ AI APIs, one API key, zero markup

安全使用建议
This skill will use a SkillBoss API key that, according to its own README, unlocks hundreds of APIs beyond OCR (scraping, social, email, etc.). Before installing: (1) confirm you trust SkillBoss and read its docs/privacy policy, (2) ask whether you can create a scoped API key that only permits OCR/model inference, (3) avoid using this skill with sensitive documents until you verify what the key can access, (4) consider using a dedicated OCR provider or self-hosted OCR if you need least-privilege access, and (5) if you proceed, monitor API key usage and be prepared to rotate/revoke the key if you see unexpected activity.
功能分析
Type: OpenClaw Skill Name: kirk-ai-ocr Version: 1.0.0 The skill bundle contains instructions in SKILL.md that direct the AI agent to 'set up' a remote URL (skillboss.co/skill.md), which functions as a form of remote instruction injection. While marketed as an OCR tool, the documentation claims the setup auto-configures 687 APIs including sensitive categories like email, scraping, and social data, which represents significant scope creep and a potential risk for unauthorized data access if the agent follows these broad instructions.
能力标签
cryptocan-make-purchasesrequires-sensitive-credentials
能力评估
Purpose & Capability
Name/description say 'AI OCR' only, but the skill requires a SKILLBOSS_API_KEY that (per the README) gives access to 600+ APIs across chat, image, video, scraping, social data, email and more. That broad capability is inconsistent with an OCR-only purpose and could enable actions unrelated to OCR.
Instruction Scope
SKILL.md instructs the agent to 'set up skillboss.co/skill.md' which 'auto-configures' access to many APIs. The runtime instructions otherwise only show calling the SkillBoss chat/completions endpoint and selecting models, but the 'auto-configure 687 APIs' step expands scope beyond what's needed for OCR.
Install Mechanism
No install spec and no code files — instruction-only skill. That minimizes local install risk (nothing is written/executed on disk by an installer).
Credentials
Only SKILLBOSS_API_KEY is requested, which matches the described API usage, but that single key appears to grant broad access (many services) not limited to OCR. Requesting one high-capability credential for a narrowly described task is disproportionate.
Persistence & Privilege
always is false and there are no install scripts or instructions to modify other skills or system settings. Agent autonomous invocation is allowed (default) but not by itself a problem.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install kirk-ai-ocr
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /kirk-ai-ocr 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of kirk-ai-ocr skill. - Provides AI-driven optical character recognition (OCR) via SkillBoss, supporting 100+ AI models. - Allows users to select the best OCR model for their task, balancing cost, speed, and quality. - Zero markup pricing; pay only the model provider's rate. - OpenAI-compatible API; easy integration and one-command setup. - Includes setup instructions, model recommendations, and example API usage.
元数据
Slug kirk-ai-ocr
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

ai-ocr 是什么?

USE THIS for ai ocr. Optical character recognition with AI. Choose the best model for your task. 0% markup. Powered by SkillBoss. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 76 次。

如何安装 ai-ocr?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install kirk-ai-ocr」即可一键安装,无需额外配置。

ai-ocr 是免费的吗?

是的,ai-ocr 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

ai-ocr 支持哪些平台?

ai-ocr 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 ai-ocr?

由 KirkRaman(@kirkraman)开发并维护,当前版本 v1.0.0。

💬 留言讨论