← 返回 Skills 市场
timing-up

Groq Voice Transcribe

作者 Timing-up · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
421
总下载
0
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install groq-voice-transcribe
功能描述
Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio files and you need fast cloud speech-...
使用说明 (SKILL.md)

Groq Voice Transcribe

Fast speech-to-text for voice notes and audio files through Groq's OpenAI-compatible transcription endpoint. Use it when you want cloud transcription via Groq instead of running Whisper locally.

Best for:

  • Telegram / Signal voice notes
  • short audio clips
  • Chinese, English, or mixed daily speech
  • fast transcript generation for follow-up summarization or chat replies

What you need

You need a Groq API key. Groq often provides a free developer tier / trial credits for new users. Get one from:

Easiest setup in OpenClaw

If OpenClaw is already running and configured, you can simply ask your assistant:

  • "Configure Groq Voice Transcribe for me"
  • "Here is my Groq API key, set up Groq Voice Transcribe"

The assistant can place the key into ~/.openclaw/openclaw.json for you.

Manual setup

Set GROQ_API_KEY, or configure it in ~/.openclaw/openclaw.json under:

{
  "skills": {
    "entries": {
      "groq-voice-transcribe": {
        "apiKey": "GROQ_KEY_HERE"
      }
    }
  }
}

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

Defaults:

  • Model: whisper-large-v3-turbo
  • Output: \x3Cinput>.txt
  • Format: plain text

Common examples

# Basic transcript
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

# Chinese voice message
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --language zh --prompt "中文普通话,日常聊天"

# Save to a custom file
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --out /tmp/transcript.txt

# Verbose JSON output
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --json --out /tmp/transcript.json

Flags

  • --model \x3Cname>: transcription model (default whisper-large-v3-turbo)
  • --out \x3Cpath>: output file path
  • --language \x3Ccode>: hint the spoken language, for example zh, en, ja
  • --prompt \x3Ctext>: optional context or spelling hint
  • --json: write verbose JSON instead of plain text

Notes

  • Audio is sent to Groq for transcription.
  • This skill is meant for transcription, not text-to-speech.
  • If language is omitted, Groq can usually auto-detect it, but passing --language zh often helps for Chinese voice notes.
安全使用建议
This skill's code does exactly what it says: it uploads a local audio file to https://api.groq.com/openai/v1/audio/transcriptions using a GROQ_API_KEY and writes the transcription to a file. Before installing or using it: - Treat the GROQ API key as a secret. Do not paste it into public chats. Prefer setting GROQ_API_KEY in your environment or adding it manually to ~/.openclaw/openclaw.json rather than letting the assistant edit config automatically. - The registry metadata did not list any required env var, but the script requires GROQ_API_KEY — that's an inconsistency. Confirm where you store the key and how it will be used. - The skill will send audio to Groq's cloud for transcription. Only use it for audio you are comfortable sending to an external service. - The package has no install step and the included script is small and readable; you can run the script manually to verify behavior before granting the assistant permission to configure anything. - The skill source and homepage are unknown. If you need stronger provenance, ask the publisher for a homepage or repository and prefer skills with clear authorship. If you decide to proceed: set GROQ_API_KEY locally (export GROQ_API_KEY=...) and run the provided script manually to test, or explicitly instruct the assistant how/where to store the key (and confirm it will not be shared elsewhere).
功能分析
Type: OpenClaw Skill Name: groq-voice-transcribe Version: 1.0.1 The skill provides a straightforward implementation for transcribing audio files using the Groq API. The bash script (scripts/transcribe.sh) correctly handles arguments and uses curl to send audio data to the legitimate Groq endpoint (api.groq.com) using the user-provided API key. No indicators of data exfiltration to unauthorized parties, malicious execution, or prompt injection attacks were found.
能力评估
Purpose & Capability
Name/description align with the included script and purpose (cloud transcription via Groq). However, the registry metadata lists no required environment variables or primary credential while the script and SKILL.md clearly require a GROQ_API_KEY — an incoherence between declared requirements and actual need.
Instruction Scope
SKILL.md and scripts are narrowly scoped to sending an audio file to Groq's transcription endpoint and saving the response. The SKILL.md additionally tells the assistant it can place the API key into ~/.openclaw/openclaw.json — an action that writes user configuration and involves handling a secret. This is explainable for setup but should be done explicitly and securely.
Install Mechanism
No install spec; the skill is instruction + a small shell script. Nothing is downloaded from external/unknown URLs or installed automatically, which keeps installation risk low.
Credentials
Functionally the skill needs one secret (GROQ_API_KEY) which is proportionate. The concern is that registry metadata did not declare any required env vars or primary credential despite the script depending on GROQ_API_KEY; this mismatch is an integrity problem and could lead to misconfiguration or accidental key disclosure if users follow unclear instructions.
Persistence & Privilege
No elevated privileges requested. always is false and the skill does not request persistent system-level changes beyond optionally writing the API key into the local OpenClaw config (which is limited in scope).
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install groq-voice-transcribe
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /groq-voice-transcribe 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
Improve getting-started guide, Groq API key setup instructions, and OpenClaw-assisted configuration flow.
v1.0.0
Initial release: Groq-based speech-to-text for voice notes and audio files via OpenAI-compatible transcription API.
元数据
Slug groq-voice-transcribe
版本 1.0.1
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 2
常见问题

Groq Voice Transcribe 是什么?

Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio files and you need fast cloud speech-... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 421 次。

如何安装 Groq Voice Transcribe?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install groq-voice-transcribe」即可一键安装,无需额外配置。

Groq Voice Transcribe 是免费的吗?

是的,Groq Voice Transcribe 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Groq Voice Transcribe 支持哪些平台?

Groq Voice Transcribe 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Groq Voice Transcribe?

由 Timing-up(@timing-up)开发并维护,当前版本 v1.0.1。

💬 留言讨论