← 返回 Skills 市场
upstage-deployment

Upstage Builder

作者 Upstage Deployment · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
29
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install upstage-builder
功能描述
Build pipelines, agents, RAG flows, and full web services by combining Upstage Solar models, embeddings, and document APIs. Use when building, scaffolding, o...
使用说明 (SKILL.md)

upstage-builder — Upstage API and Webapp Delivery Skill

You are an expert at generating code that uses the Upstage API (api.upstage.ai). When the user asks you to build features or services using Upstage/Solar models, follow this guide.

For full webapp requests, do not stop at code generation. Treat project location, environment variables, deployment method, and shareable URL delivery as part of the task.

Webapp Setup Rules

When the user asks for a full web service/app built with Upstage, follow this startup flow:

  1. If deployment system and project root are not already known, ask once:
    • Which deployment system should be used?
    • Which project root should be used?
  2. If defaults are already configured, use them.
  3. For this installation, default to:
    • project root: /data/.openclaw/workspace/projects
    • deployment provider: vercel
    • visibility mode: password-protected
  4. Prefer password-protected or private delivery over public delivery unless the user explicitly asks for public access.
  5. Create one folder per app under the configured project root.
  6. Return these at the end whenever possible:
    • project path
    • stack used
    • required environment variables
    • deployment method
    • visibility mode
    • external deployment URL, or the exact next step if deployment could not be completed
    • site password if password-protected mode was used

Read references/webapp-workflow.md for the full project/deployment workflow.

Quick Start

Upstage APIs are OpenAI SDK compatible. Just change base_url:

from openai import OpenAI
import os

client = OpenAI(
    api_key=os.environ["UPSTAGE_API_KEY"],
    base_url="https://api.upstage.ai/v1"
)

response = client.chat.completions.create(
    model="solar-pro3",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

API Key: Always use os.environ["UPSTAGE_API_KEY"]. Never hardcode keys. Users get their key from console.upstage.ai.


Output Files

When generated code writes intermediate result files (extracted JSON, parsed markdown, embeddings cache, etc.):

  • Default: \x3Csystem-temp>/\x3Cinput-stem>.\x3Csuffix>.\x3Cext> (e.g., /tmp/receipt.ocr.json, /tmp/report.parsed.md). Use tempfile.gettempdir() for cross-platform code.
  • Override: if the user specifies an output path, use it.
  • Always print the resolved absolute path so the user can locate the file.

This rule does NOT apply to webapp scaffolding (project root, .env, DEPLOY.md) — those follow the configured project root in Webapp Setup Rules above.

Per-API suffix convention (matches the dedicated specialty skills):

API Suffix Common ext
OCR .ocr .json
Document Parse .parsed .md, .html
Document Classification .classified .json
Information Extraction .extracted .json
Schema Generation .schema .json
Agent (Studio) .agent (or .\x3Cstep-name> per step) .json
Solar (delegated) .solar (with timestamp prefix) .md, .txt

Model Catalog

Chat Models

Model Description Context Best For
solar-pro3 Flagship (102B MoE, 12B active) 128K Complex reasoning, function calling, structured output
solar-pro2 Previous gen flagship (31B) 65K General tasks, good balance
solar-mini Lightweight, fast (10.7B) 32K Cost-sensitive, simple tasks
syn-pro Synthetic data optimized - Data generation (no function calling)

Embedding Models

Model Description Dimensions
embedding-query For search queries/questions 4096
embedding-passage For documents/passages to search 4096

Document Models

Model Description
ocr Text extraction with word-level coordinates
document-parse Convert docs to HTML/Markdown with layout detection
document-classify Classify documents into user-defined categories
information-extract Extract structured data with custom JSON schema
schema-generate Auto-generate extraction schemas from sample docs
receipt-extraction Prebuilt: extract from receipts

Model Selection Guide

Your Need Use This Model
Complex reasoning, coding solar-pro3 with reasoning_effort: "high"
Fast simple responses solar-mini
Cost-sensitive production solar-mini
Synthetic data generation syn-pro
Function calling / tool use solar-pro3 (parallel tool calls supported)
Structured JSON output solar-pro3, solar-pro2, or solar-mini
Semantic search (queries) embedding-query
Semantic search (documents) embedding-passage
PDF/image → text ocr
PDF/image → markdown/HTML document-parse
Extract fields from docs information-extract
Classify document types document-classify

API Categories

1. Chat Completions

Endpoint: POST /v1/chat/completions

  • Standard chat, streaming, function calling, structured output, reasoning, prompt caching
  • OpenAI SDK compatible (just change base_url)
  • Details: Read references/chat-completions.md

2. Embeddings

Endpoint: POST /v1/embeddings

  • Dual-model: embedding-query for queries, embedding-passage for documents
  • 4096-dimensional normalized vectors (dot product = cosine similarity)
  • Max 100 texts per batch, 4000 tokens per text
  • Details: Read references/embeddings.md

3. Document Processing (OCR + Parse + Split)

Endpoints: POST /v1/document-digitization, POST /v1/document-digitization/async

  • OCR: word-level text extraction with bounding boxes
  • Parse: converts PDF/images to structured HTML/Markdown, chart recognition, equation LaTeX
  • Sync (≤100 pages) and Async (≤1000 pages) modes
  • Split: via Classification API with split=true
  • Details: Read references/document-processing.md

4. Information Extraction

Endpoints: POST /v1/information-extraction, POST /v1/information-extraction/async

  • Custom schema-based extraction from documents
  • Schema Generation: auto-generate schemas from sample docs
  • Prebuilt models: receipt, air waybill, bill of lading, commercial invoice, KR export declaration
  • OpenAI SDK compatible (base_url changes to /v1/information-extraction)
  • Details: Read references/information-extraction.md

5. Document Classification

Endpoint: POST /v1/document-classification

  • Classify into user-defined categories with confidence scores
  • Document split feature (split=true) for multi-doc PDFs
  • OpenAI SDK compatible (base_url changes to /v1/document-classification)
  • Details: Read references/document-classification.md

6. Agent API (Studio Workflows)

Base URL: https://api.upstage.ai/v2 (v2, not v1)

  • Multi-step workflows configured in Upstage Studio
  • File upload → Agent job → Poll for results
  • OpenAI Responses API compatible
  • Details: Read references/agent-api.md

7. Common Patterns & Error Handling

  • Error codes, rate limits, retry strategies, SDK setup
  • RAG pipeline, document routing, batch processing patterns
  • Details: Read references/common-patterns.md

Code Generation Guidelines

When generating Upstage API code, follow these rules:

  1. Always use OpenAI SDK unless the API requires multipart/form-data (OCR, Document Parse, Prebuilt IE)
  2. API key from environment: os.environ["UPSTAGE_API_KEY"] — never hardcode
  3. base_url varies by API:
    • Chat & Embeddings: https://api.upstage.ai/v1
    • Document Classification: https://api.upstage.ai/v1/document-classification
    • Information Extraction: https://api.upstage.ai/v1/information-extraction
    • Agent API: https://api.upstage.ai/v2
  4. Use model aliases (e.g., solar-pro3), not version-specific names
  5. Default to solar-pro3 for complex tasks, solar-mini for simple/cost-sensitive
  6. For document APIs using multipart/form-data, use requests library directly
  7. For RAG pipelines: use embedding-passage for indexing, embedding-query for search
  8. Include error handling: catch openai.RateLimitError with exponential backoff

Key Differences from OpenAI

  • Reasoning uses reasoning_effort param (not separate reasoning model)
  • Embeddings use dual-model approach (query vs passage) — not a single model
  • Document APIs are unique to Upstage (OCR, Parse, IE, Classification)
  • Structured outputs require strict: true and additionalProperties: false
  • IE schemas: first-level properties must be string/integer/number/array (no objects at top level)

Examples

  • Basic chat: See examples/chat-example.py
  • RAG pipeline: See examples/rag-example.py
  • Document processing: See examples/document-example.py \x3Cpath/to/document.pdf>
  • Smoke test: Install requirements.txt, then run python scripts/smoke_test.py to verify chat, embeddings, and optional design registry access
  • Reference refresh: Run python scripts/refresh_references.py to pull the latest API reference snapshots into references/
  • Webapp project init: Run python scripts/init_webapp_project.py \x3Cproject-slug> to create a standard app folder with README.md, .env.example, and DEPLOY.md

Reference Files

When you need detailed API parameters, response formats, or advanced features, read the appropriate reference file. These files are generated snapshots; refresh them with python scripts/refresh_references.py when you want the latest upstream docs.

File Content
references/chat-completions.md Full Chat API: params, function calling, structured output, streaming, reasoning, prompt caching
references/embeddings.md Embeddings API: query/passage models, batch processing, similarity
references/document-processing.md OCR, Document Parse (sync/async), Document Split
references/information-extraction.md IE (sync/async), Schema Generation, Prebuilt IE
references/document-classification.md Classification API with confidence scores
references/agent-api.md Agent API v2: Studio workflows, file upload, jobs
references/common-patterns.md Error handling, rate limits, auth, RAG/routing/batch patterns
references/webapp-workflow.md Project-root, deployment-provider, and delivery workflow for full webapp tasks

Source URLs

Original sources for reference files. Running python scripts/refresh_references.py fetches the latest content from these URLs and updates the references/ files.

Source URL Auth Updates
Upstage API Docs https://console.upstage.ai/api/docs/for-agents/raw None references/chat-completions.md ~ common-patterns.md (7 files)
安全使用建议
Install only if you intend to build against Upstage APIs. Keep UPSTAGE_API_KEY in environment variables, review any generated deployment steps before running them, and avoid sending confidential documents unless Upstage’s data handling terms meet your needs.
功能分析
Type: OpenClaw Skill Name: upstage-builder Version: 1.0.0 The upstage-builder skill bundle is a legitimate developer tool designed to help AI agents scaffold, build, and deploy applications using Upstage's Solar LLM and document APIs. The bundle includes comprehensive documentation, functional examples for RAG and document processing, and utility scripts for project initialization (init_webapp_project.py) and documentation synchronization (refresh_references.py). The instructions in SKILL.md are well-aligned with the stated purpose, emphasizing security best practices such as using environment variables for API keys and preferring password-protected deployments. The scripts perform necessary file and network operations with appropriate safeguards, such as input slugification to prevent path traversal.
能力标签
cryptocan-make-purchasesrequires-sensitive-credentials
能力评估
Purpose & Capability
The stated purpose—building Upstage chat, document, RAG, agent, and webapp workflows—matches the provided examples and references. Sensitive actions such as API calls, document processing, and deployment are expected for this purpose.
Instruction Scope
For full webapp requests, the skill instructs the agent to include deployment and URL delivery, using defaults if configured. This is disclosed and purpose-aligned, but users should confirm deployment target and visibility.
Install Mechanism
There is no install spec and no evidence of automatic package installation or script execution during install. Included examples/scripts are not shown as auto-executed in the provided artifacts.
Credentials
Use of UPSTAGE_API_KEY and network calls to api.upstage.ai are proportionate to the Upstage integration, but the key can authorize paid API usage and should be handled carefully.
Persistence & Privilege
The skill may create project folders and deployed webapps for full-service requests, with password-protected visibility preferred by default. No hidden background persistence is evidenced.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install upstage-builder
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /upstage-builder 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
upstage-builder 1.0.0 – Initial Release - Provides a unified skill to build pipelines, agents, RAG flows, and full web services using Upstage Solar models, embeddings, and document APIs. - Supports project scaffolding, deployment provider selection (default: Vercel), and guided full webapp delivery, including stack, environment, and visibility mode. - Covers Solar models (Pro3/Pro2/Mini), embedding models, OCR, document parse, classification, schema generation, and Agent API. - Includes clear file output conventions and security defaults (password-protected webapps). - Offers model selection guide, API usage patterns, and compatibility notes for OpenAI SDK.
元数据
Slug upstage-builder
版本 1.0.0
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 1
常见问题

Upstage Builder 是什么?

Build pipelines, agents, RAG flows, and full web services by combining Upstage Solar models, embeddings, and document APIs. Use when building, scaffolding, o... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 29 次。

如何安装 Upstage Builder?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install upstage-builder」即可一键安装,无需额外配置。

Upstage Builder 是免费的吗?

是的,Upstage Builder 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Upstage Builder 支持哪些平台?

Upstage Builder 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Upstage Builder?

由 Upstage Deployment(@upstage-deployment)开发并维护,当前版本 v1.0.0。

💬 留言讨论