← 返回 Skills 市场
reed1898

Volcengine STT

作者 Reed · GitHub ↗ · v0.2.1
cross-platform ⚠ suspicious
506
总下载
2
收藏
5
当前安装
3
版本数
在 OpenClaw 中安装
/install volcengine-stt
功能描述
Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr...
安全使用建议
Do not assume this skill uses Volcengine/ARK based on its name or SKILL.md. The bundled script actually uploads audio to openspeech.bytedance.com and expects VOLC_APP_ID / VOLC_ACCESS_TOKEN (or reads ~/.openclaw/openclaw.json) — a mismatch that may be accidental or intentional. Before installing: 1) Ask the publisher which provider the skill is intended for and request corrected docs or code. 2) If you must test it, run the script in a sandbox or isolated account and with non-sensitive test audio. 3) Don't provide production credentials until the provider/credential mapping is clarified; if you already supplied keys, consider rotating them. 4) If you expect Volcengine/ARK, either obtain a version that actually calls the ARK endpoints or modify the script accordingly. 5) Be aware the script transmits local audio and may read OpenClaw config for secrets — only run it where you trust that destination and have reviewed the code.
功能分析
Type: OpenClaw Skill Name: volcengine-stt Version: 0.2.1 The skill is designed for transcribing audio to text using Volcengine/ByteDance APIs, a legitimate function. The `SKILL.md` provides clear instructions without any prompt injection attempts. The `transcribe.sh` script uses standard tools (`curl`, `jq`, `base64`) safely, handles API keys from environment variables or `~/.openclaw/openclaw.json` securely via HTTP headers, and connects to legitimate Volcengine/ByteDance API endpoints (e.g., `https://openspeech.bytedance.com/api/v3/auc/bigmodel/`). There is no evidence of malicious intent such as data exfiltration, unauthorized execution, persistence mechanisms, or obfuscation. The script's use of `jq -n --arg` for constructing JSON payloads mitigates injection risks.
能力评估
Purpose & Capability
SKILL.md and the skill name promise Volcengine (ARK) STT and list ARK_API_KEY / ARK_BASE_URL, but the runnable script posts base64 audio to openspeech.bytedance.com endpoints and uses VOLC_APP_ID / VOLC_ACCESS_TOKEN / VOLC_RESOURCE_ID headers. This is a clear mismatch: either the README is wrong or the script implements a different provider.
Instruction Scope
The runtime script will read credentials from environment variables or from ~/.openclaw/openclaw.json (via jq), base64-encode local audio, and upload it to external endpoints (openspeech.bytedance.com). SKILL.md does not document the config-file fallback or the actual network endpoints used, so users may be unaware their audio and local config will be transmitted to Bytedance servers.
Install Mechanism
There is no install spec (instruction-only with an included script). No additional packages are automatically downloaded or extracted. The script requires common system tools (curl, jq, base64, uuidgen or /proc UUID) but does not perform external installs.
Credentials
SKILL.md declares ARK_API_KEY (and ARK_* env vars) as required, but the script actually requires VOLC_APP_ID and VOLC_ACCESS_TOKEN (and optionally VOLC_RESOURCE_ID or values from ~/.openclaw/openclaw.json). The skill therefore asks for credentials that don't match the code, and it also accesses a user config file path not mentioned in the docs.
Persistence & Privilege
The skill does not request permanent 'always' inclusion and does not modify other skills or system-wide settings. Its only elevated access is reading a local OpenClaw config fallback file (~/.openclaw/openclaw.json) to obtain credentials.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install volcengine-stt
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /volcengine-stt 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.2.1
Fix config fallback to prioritize skills.entries.volcengine-stt.env.VOLC_*; keep standard API flow stable
v0.2.0
Switch default to Volcengine standard AUC submit/query mode; add flash mode option; config fallback for appId/accessToken/resourceId
v0.1.0
Initial release: reusable Volcengine/ARK speech-to-text skill for OpenClaw agents
元数据
Slug volcengine-stt
版本 0.2.1
许可证
累计安装 5
当前安装数 5
历史版本数 3
常见问题

Volcengine STT 是什么?

Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 506 次。

如何安装 Volcengine STT?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install volcengine-stt」即可一键安装,无需额外配置。

Volcengine STT 是免费的吗?

是的,Volcengine STT 完全免费(开源免费),可自由下载、安装和使用。

Volcengine STT 支持哪些平台?

Volcengine STT 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Volcengine STT?

由 Reed(@reed1898)开发并维护,当前版本 v0.2.1。

💬 留言讨论