← 返回 Skills 市场
jessewunderlich

Deepgram Transcribe

作者 Jesse Wunderlich · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
423
总下载
0
收藏
2
当前安装
1
版本数
在 OpenClaw 中安装
/install deepgram-transcribe
功能描述
Transcribe audio via Deepgram Nova-3 API (5.26% WER, 40x faster than Whisper, built-in speaker diarization). Use when user asks to transcribe audio, podcasts...
使用说明 (SKILL.md)

Deepgram Nova-3 Transcription

Transcribe audio files using Deepgram's Nova-3 model — more accurate and faster than OpenAI Whisper.

Why Deepgram over Whisper

  • 5.26% word error rate (vs ~8-10% for Whisper)
  • 40x faster for batch processing
  • Built-in speaker diarization (who said what)
  • Smart formatting (numbers, dates, punctuation)
  • $200 free credits on signup at deepgram.com

Setup

Store your API key:

echo "YOUR_DEEPGRAM_API_KEY" > ~/.openclaw/secrets/deepgram-api-key.txt

Or set the environment variable:

export DEEPGRAM_API_KEY="your-key-here"

Usage

bash scripts/transcribe.sh /path/to/audio.mp3
bash scripts/transcribe.sh recording.mp3 --out transcript.txt
bash scripts/transcribe.sh recording.mp3 --json --out full.json
bash scripts/transcribe.sh recording.mp3 --language es
bash scripts/transcribe.sh recording.mp3 --model nova-2

Models

Model WER Cost/min Best for
nova-3 (default) 5.26% $0.0077 Best accuracy
nova-2 ~8% $0.0043 Budget-friendly
whisper-large ~8-10% $0.0048 Whisper parity

Supported Formats

mp3, wav, m4a, ogg, flac, webm, aiff

Fallback

If DEEPGRAM_API_KEY is not set, use the OpenAI Whisper skill instead (if installed).

安全使用建议
This skill is coherent with its stated purpose, but check these before installing: - It will send audio files to Deepgram (api.deepgram.com). Only use it for audio you are comfortable transmitting to that provider (PII/sensitive content consideration). - Provide a Deepgram API key either as environment variable DEEPGRAM_API_KEY or by storing it at ~/.openclaw/secrets/deepgram-api-key.txt. Prefer secure file permissions (chmod 600) if you use the secrets file. - The package metadata did not list required binaries; the script requires bash (already used), curl, and python3 to run. Ensure those are available and up-to-date. - The script parses remote responses with embedded Python; while it appears to only parse JSON, avoid running it on untrusted systems or modifying it without understanding the parsing behavior. - If you need offline/local transcription only, consider alternatives; otherwise the behavior and credentials requested are proportionate for a Deepgram integration.
功能分析
Type: OpenClaw Skill Name: deepgram-transcribe Version: 1.0.0 The skill bundle is designed for Deepgram audio transcription, and its `SKILL.md` instructions are benign. However, the `scripts/transcribe.sh` script contains a potential shell injection vulnerability. The `$RESPONSE` variable, which holds the Deepgram API's JSON output, is directly embedded into a Python command's string literal (`python3 -c "... d = json.loads('''$RESPONSE''') ..."`). If an attacker could manipulate the Deepgram API response (e.g., via a compromised API or MITM), they could inject arbitrary Python code, leading to remote code execution. This is a vulnerability, not clear malicious intent, hence 'suspicious'.
能力评估
Purpose & Capability
Name/description match the implementation: the script posts audio to Deepgram's API and parses the response. However the registry metadata does not declare the DEEPGRAM_API_KEY environment variable nor the runtime binaries the script uses (bash, curl, python3), which is an omission/inconsistency.
Instruction Scope
SKILL.md and the script limit actions to reading an audio file, reading an API key from DEEPGRAM_API_KEY or ~/.openclaw/secrets/deepgram-api-key.txt, POSTing the audio to api.deepgram.com, and writing transcript output. There are no instructions to read unrelated files or exfiltrate other system data.
Install Mechanism
No install spec (instruction-only plus a local shell script) — nothing is downloaded or extracted. This is the lower-risk model for skills.
Credentials
The skill legitimately requires a Deepgram API key (DEEPGRAM_API_KEY or the file under ~/.openclaw/secrets). That is proportionate for its purpose, but the registry's declared requirements did not list this credential; the omission should be corrected/acknowledged.
Persistence & Privilege
The skill does not request persistent 'always' inclusion and does not modify other skills or system-wide agent settings. It reads/writes only a per-user secret file location and user-specified output files.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install deepgram-transcribe
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /deepgram-transcribe 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release
元数据
Slug deepgram-transcribe
版本 1.0.0
许可证
累计安装 2
当前安装数 2
历史版本数 1
常见问题

Deepgram Transcribe 是什么?

Transcribe audio via Deepgram Nova-3 API (5.26% WER, 40x faster than Whisper, built-in speaker diarization). Use when user asks to transcribe audio, podcasts... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 423 次。

如何安装 Deepgram Transcribe?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install deepgram-transcribe」即可一键安装,无需额外配置。

Deepgram Transcribe 是免费的吗?

是的,Deepgram Transcribe 完全免费(开源免费),可自由下载、安装和使用。

Deepgram Transcribe 支持哪些平台?

Deepgram Transcribe 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Deepgram Transcribe?

由 Jesse Wunderlich(@jessewunderlich)开发并维护,当前版本 v1.0.0。

💬 留言讨论