← 返回 Skills 市场
kirkraman

transcribe

作者 KirkRaman · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
70
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install kirk-transcribe
功能描述
Speech-to-text via SkillBoss API Hub (STT, powered by Whisper and more).
使用说明 (SKILL.md)

Whisper STT via SkillBoss API Hub

Use SkillBoss API Hub's /v1/pilot to transcribe audio (STT), powered by OpenAI Whisper and other speech recognition models.

Quick start (Python)

import requests, base64, os

SKILLBOSS_API_KEY = os.environ["SKILLBOSS_API_KEY"]
API_BASE = "https://api.skillbossai.com/v1"

def pilot(body: dict) -> dict:
    r = requests.post(
        f"{API_BASE}/pilot",
        headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"},
        json=body,
        timeout=60,
    )
    return r.json()

# Transcribe audio file
audio_b64 = base64.b64encode(open("/path/audio.mp3", "rb").read()).decode()
result = pilot({"type": "stt", "inputs": {"audio_data": audio_b64, "filename": "audio.mp3"}})
text = result["result"]["text"]
print(text)

# Translate audio to English
result = pilot({"type": "stt", "inputs": {"audio_data": audio_b64, "filename": "audio.m4a", "task": "translate"}})
text = result["result"]["text"]
print(text)

Notes

  • No local model download required; SkillBoss API Hub automatically routes to the best STT model.
  • SKILLBOSS_API_KEY environment variable required.
  • Response text is at result["result"]["text"].
安全使用建议
This skill will transmit any audio you provide to api.skillbossai.com using the SKILLBOSS_API_KEY. Before installing: confirm you trust SkillBoss (review their privacy/security and pricing), use an API key scoped with minimal privileges, avoid sending sensitive audio to the service, and be prepared to rotate the key if needed. Also verify the publisher (registry owner and homepage) since the SKILL.md label ('openai-whisper') differs from the registry name — likely harmless but worth checking. Finally, ensure the agent is only given the specific audio files you intend to transcribe.
功能分析
Type: OpenClaw Skill Name: kirk-transcribe Version: 1.0.0 The skill bundle provides a standard interface for speech-to-text transcription using the SkillBoss API Hub (api.skillbossai.com). The code and instructions in SKILL.md are consistent with the stated purpose, requiring only the necessary API key and audio data to function, with no evidence of malicious intent, obfuscation, or unauthorized data access.
能力标签
requires-sensitive-credentials
能力评估
Purpose & Capability
The skill's description (transcribe via SkillBoss) matches the runtime instructions which call https://api.skillbossai.com/v1/pilot. Minor inconsistency: SKILL.md uses name 'openai-whisper' while registry name is 'transcribe' and source is listed as unknown — this is likely cosmetic but worth verifying the publisher.
Instruction Scope
Instructions show reading a local audio file, base64-encoding it, and POSTing it to api.skillbossai.com with the SKILLBOSS_API_KEY. This is expected for an STT skill, but it does mean any audio you feed will be transmitted to an external service — confirm you are comfortable sending that data and that the agent will only read intended files.
Install Mechanism
No install spec and no code files (instruction-only). This minimizes local persistence and disk writes.
Credentials
Only SKILLBOSS_API_KEY is required, which is proportionate to calling a third-party API. No other unrelated secrets or config paths are requested.
Persistence & Privilege
always is false and the skill does not request elevated or persistent platform privileges. It does allow normal autonomous invocation (platform default).
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install kirk-transcribe
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /kirk-transcribe 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of speech-to-text skill powered by Whisper and other models via SkillBoss API Hub. - Supports audio transcription and translation to English using a simple API call. - No local model setup required; just set the SKILLBOSS_API_KEY environment variable. - Includes Python quick start guide for immediate use. - Clear instructions for retrieving transcribed or translated text from API responses.
元数据
Slug kirk-transcribe
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

transcribe 是什么?

Speech-to-text via SkillBoss API Hub (STT, powered by Whisper and more). 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 70 次。

如何安装 transcribe?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install kirk-transcribe」即可一键安装,无需额外配置。

transcribe 是免费的吗?

是的,transcribe 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

transcribe 支持哪些平台?

transcribe 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 transcribe?

由 KirkRaman(@kirkraman)开发并维护,当前版本 v1.0.0。

💬 留言讨论