← 返回 Skills 市场
ats3v

Deep Infra

作者 Georgi Atsev · GitHub ↗ · v1.0.0 · MIT-0
linuxdarwinwin32 ✓ 安全检测通过
141
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install deep-infra
功能描述
Configure DeepInfra model routing with provider auth, model selection, fallback chains, and cost-aware defaults for stable open-source and frontier model wor...
使用说明 (SKILL.md)

Setup

On first use, read setup.md to align activation boundaries, reliability goals, and routing preferences before making configuration changes.

When to Use

Use this skill when the user wants to connect an OpenAI-compatible workflow to DeepInfra, choose open-source and frontier models by task type, set safe fallbacks, and control cost drift over time.

Architecture

Memory lives in ~/deep-infra/. See memory-template.md for structure.

~/deep-infra/
├── memory.md            # Active routing profile and constraints
├── providers.md         # Confirmed provider and auth choices
├── routing-rules.md     # Task -> model and fallback policy
├── incidents.md         # Outages, rate limits, and recovery notes
└── budgets.md           # Spend guardrails and optimization actions

Quick Reference

Use the smallest relevant file for the current task.

Topic File
Setup and activation preferences setup.md
Memory template memory-template.md
Authentication and provider wiring auth-and-provider.md
Routing patterns by workload routing-playbooks.md
Reliability and fallback handling fallback-reliability.md
Cost controls and spend reviews cost-guardrails.md

Core Rules

1. Start from Workload Classes, Not Model Hype

  • Classify requests first: coding, analysis, extraction, summarization, or long-context synthesis.
  • Map each class to a primary model and a fallback before changing any defaults.

2. Keep Authentication Explicit and Verifiable

  • Use DEEPINFRA_API_KEY from the local environment, never pasted into logs or chat memory.
  • Validate auth with a minimal request before applying routing changes.

3. Design Fallbacks for Failure Modes, Not Convenience

  • Separate fallback reasons: rate limit, provider outage, latency spike, or output quality failure.
  • Keep at least one fallback from a different model family for resilience.

4. Leverage Open-Source Model Diversity

  • DeepInfra hosts models from many providers (DeepSeek, Moonshot, MiniMax, StepFun, NVIDIA, and more).
  • Use model diversity to build resilient fallback chains across independent model families.

5. Enforce Cost Boundaries Before Throughput Tuning

  • Set cost ceilings by task class and check expected token burn before broad rollout.
  • Route low-stakes tasks to cheaper models and reserve premium models for high-impact tasks.

6. Change One Layer at a Time

  • Modify either model selection, fallback policy, or budget limits in a single iteration.
  • After each change, run a quick verification prompt set and record outcome.

7. Record Decisions for Repeatability

  • Save the final routing policy, rationale, and known tradeoffs in memory.
  • Reuse proven policies instead of repeatedly rebuilding from scratch.

Common Traps

  • Choosing one model for every task -> higher cost and unstable quality under varied workloads.
  • Using same-family fallback chain only -> cascading failures during model-specific incidents.
  • Ignoring token limits for long inputs -> truncated responses and hidden quality loss.
  • Changing routing and budgets simultaneously -> unclear root cause when quality drops.
  • Running without verification prompts -> broken routing detected only after user-facing failures.

External Endpoints

These endpoints are used only to discover model metadata and execute routed inference requests under explicit user task intent.

Endpoint Data Sent Purpose
https://api.deepinfra.com/v1/openai/models none or auth header Discover current model catalog and metadata
https://api.deepinfra.com/v1/openai/chat/completions user prompt content and selected model id Execute routed inference requests

No other data is sent externally.

Security & Privacy

Data that leaves your machine:

  • Prompt text and selected model metadata sent to DeepInfra when inference is requested.

Data that stays local:

  • Routing notes and preferences under ~/deep-infra/.
  • Local environment variable references and verification logs.

This skill does NOT:

  • Request raw API keys in chat.
  • Store plaintext secrets in skill memory files.
  • Modify files outside ~/deep-infra/ for its own state.

Trust

By using this skill, prompt content is sent to DeepInfra for model execution. Only install if you trust this service with your data.

Related Skills

Install with clawhub install \x3Cslug> if user confirms:

  • api — API request design, payload shaping, and response validation patterns
  • auth — credential handling and auth troubleshooting workflows
  • models — model comparison and selection guidance
  • monitoring — runtime health checks and incident tracking practices

Feedback

  • If useful: clawhub star deep-infra
  • Stay updated: clawhub sync
安全使用建议
This skill appears coherent for configuring DeepInfra routing. Before installing: 1) Confirm you trust api.deepinfra.com (data sent there includes user prompts). 2) Protect your DEEPINFRA_API_KEY — set it as an environment variable rather than passing it on a command line to avoid process-list exposure. 3) Inspect ~/deep-infra/ after first run to ensure no secrets are stored in plaintext. 4) If you need tighter controls, use a least-privileged/rotation-capable key and review DeepInfra's data retention and privacy terms. If any of these checks fail or DeepInfra is untrusted, do not install.
功能分析
Type: OpenClaw Skill Name: deep-infra Version: 1.0.0 The skill is a legitimate configuration and management tool for DeepInfra model routing. It uses standard CLI utilities (curl, jq) to interact with official DeepInfra API endpoints and follows security best practices by explicitly instructing the agent to use environment variables for authentication rather than storing secrets in plaintext. All file operations are localized to the '~/deep-infra/' directory, and the instructions in SKILL.md and setup.md are strictly aligned with the stated purpose of model selection, cost management, and reliability.
能力标签
requires-oauth-token
能力评估
Purpose & Capability
Name/description match the requested resources: curl/jq are used in examples, and DEEPINFRA_API_KEY is the expected credential for calling api.deepinfra.com. The documented files (routing playbooks, auth, setup, cost guardrails) align with a routing/configuration skill.
Instruction Scope
Runtime instructions operate on ~/deep-infra/ and call DeepInfra endpoints via curl/jq; they do not request unrelated system credentials or cross-check unrelated files. Note: the README shows an example CLI invocation (openclaw onboard --deepinfra-api-key <key>) that would place the key on the command line (process args) and could expose it to local process listings — prefer environment variables or secure input. Also verify created memory files do not accidentally include secrets.
Install Mechanism
Instruction-only skill with no install spec and no remote downloads; lowest install risk. Required binaries are minimal and expected for the provided curl/jq examples.
Credentials
Only a single environment variable (DEEPINFRA_API_KEY) is required and is justified by the skill's purpose. No unrelated secrets, config paths, or additional credentials are requested.
Persistence & Privilege
The skill stores state under ~/deep-infra/ (documented) and does not request always:true or other elevated persistent privileges. It does not claim to modify other skills or system-wide settings.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install deep-infra
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /deep-infra 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release with robust DeepInfra model routing and management: - Enables configuration of model selection, provider authentication, and structured fallback chains. - Provides cost-aware defaults and guidance for stable open-source and frontier model workflows. - Documents memory structure for routing profiles, provider setup, incident tracking, and budget control. - Outlines practical setup steps, core routing rules, and common pitfalls to avoid. - Ensures all authentication stays secure and local; secrets never exposed in chat or memory files.
元数据
Slug deep-infra
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Deep Infra 是什么?

Configure DeepInfra model routing with provider auth, model selection, fallback chains, and cost-aware defaults for stable open-source and frontier model wor... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 141 次。

如何安装 Deep Infra?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install deep-infra」即可一键安装,无需额外配置。

Deep Infra 是免费的吗?

是的,Deep Infra 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Deep Infra 支持哪些平台?

Deep Infra 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(linux, darwin, win32)。

谁开发了 Deep Infra?

由 Georgi Atsev(@ats3v)开发并维护,当前版本 v1.0.0。

💬 留言讨论