← 返回 Skills 市场

Volcengine STT

Name: Volcengine STT
Author: reed1898

作者 Reed · GitHub ↗ · v0.2.1

cross-platform ⚠ suspicious

506

总下载

当前安装

版本数

在 OpenClaw 中安装

/install volcengine-stt

功能描述

Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr...

安全使用建议

Do not assume this skill uses Volcengine/ARK based on its name or SKILL.md. The bundled script actually uploads audio to openspeech.bytedance.com and expects VOLC_APP_ID / VOLC_ACCESS_TOKEN (or reads ~/.openclaw/openclaw.json) — a mismatch that may be accidental or intentional. Before installing: 1) Ask the publisher which provider the skill is intended for and request corrected docs or code. 2) If you must test it, run the script in a sandbox or isolated account and with non-sensitive test audio. 3) Don't provide production credentials until the provider/credential mapping is clarified; if you already supplied keys, consider rotating them. 4) If you expect Volcengine/ARK, either obtain a version that actually calls the ARK endpoints or modify the script accordingly. 5) Be aware the script transmits local audio and may read OpenClaw config for secrets — only run it where you trust that destination and have reviewed the code.

功能分析

Type: OpenClaw Skill Name: volcengine-stt Version: 0.2.1 The skill is designed for transcribing audio to text using Volcengine/ByteDance APIs, a legitimate function. The `SKILL.md` provides clear instructions without any prompt injection attempts. The `transcribe.sh` script uses standard tools (`curl`, `jq`, `base64`) safely, handles API keys from environment variables or `~/.openclaw/openclaw.json` securely via HTTP headers, and connects to legitimate Volcengine/ByteDance API endpoints (e.g., `https://openspeech.bytedance.com/api/v3/auc/bigmodel/`). There is no evidence of malicious intent such as data exfiltration, unauthorized execution, persistence mechanisms, or obfuscation. The script's use of `jq -n --arg` for constructing JSON payloads mitigates injection risks.

能力评估

⚠ Purpose & Capability

SKILL.md and the skill name promise Volcengine (ARK) STT and list ARK_API_KEY / ARK_BASE_URL, but the runnable script posts base64 audio to openspeech.bytedance.com endpoints and uses VOLC_APP_ID / VOLC_ACCESS_TOKEN / VOLC_RESOURCE_ID headers. This is a clear mismatch: either the README is wrong or the script implements a different provider.

⚠ Instruction Scope

The runtime script will read credentials from environment variables or from ~/.openclaw/openclaw.json (via jq), base64-encode local audio, and upload it to external endpoints (openspeech.bytedance.com). SKILL.md does not document the config-file fallback or the actual network endpoints used, so users may be unaware their audio and local config will be transmitted to Bytedance servers.

✓ Install Mechanism

There is no install spec (instruction-only with an included script). No additional packages are automatically downloaded or extracted. The script requires common system tools (curl, jq, base64, uuidgen or /proc UUID) but does not perform external installs.

⚠ Credentials

SKILL.md declares ARK_API_KEY (and ARK_* env vars) as required, but the script actually requires VOLC_APP_ID and VOLC_ACCESS_TOKEN (and optionally VOLC_RESOURCE_ID or values from ~/.openclaw/openclaw.json). The skill therefore asks for credentials that don't match the code, and it also accesses a user config file path not mentioned in the docs.

✓ Persistence & Privilege

The skill does not request permanent 'always' inclusion and does not modify other skills or system-wide settings. Its only elevated access is reading a local OpenClaw config fallback file (~/.openclaw/openclaw.json) to obtain credentials.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install volcengine-stt
安装完成后，直接呼叫该 Skill 的名称或使用 /volcengine-stt 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.2.1

Fix config fallback to prioritize skills.entries.volcengine-stt.env.VOLC_*; keep standard API flow stable

v0.2.0

Switch default to Volcengine standard AUC submit/query mode; add flash mode option; config fallback for appId/accessToken/resourceId

v0.1.0

Initial release: reusable Volcengine/ARK speech-to-text skill for OpenClaw agents

元数据

Slug volcengine-stt

版本 0.2.1

许可证 —

累计安装 5

当前安装数 5

历史版本数 3

常见问题

Volcengine STT 是什么？

Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 506 次。

如何安装 Volcengine STT？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install volcengine-stt」即可一键安装，无需额外配置。

Volcengine STT 是免费的吗？

是的，Volcengine STT 完全免费（开源免费），可自由下载、安装和使用。

Volcengine STT 支持哪些平台？

Volcengine STT 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Volcengine STT？

由 Reed（@reed1898）开发并维护，当前版本 v0.2.1。