← 返回 Skills 市场

Groq Voice Transcribe

Name: Groq Voice Transcribe
Author: timing-up

作者 Timing-up · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ⚠ suspicious

421

总下载

当前安装

版本数

在 OpenClaw 中安装

/install groq-voice-transcribe

功能描述

Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio files and you need fast cloud speech-...

使用说明 (SKILL.md)

Groq Voice Transcribe

Fast speech-to-text for voice notes and audio files through Groq's OpenAI-compatible transcription endpoint. Use it when you want cloud transcription via Groq instead of running Whisper locally.

Best for:

Telegram / Signal voice notes
short audio clips
Chinese, English, or mixed daily speech
fast transcript generation for follow-up summarization or chat replies

What you need

You need a Groq API key. Groq often provides a free developer tier / trial credits for new users. Get one from:

https://console.groq.com/
Sign in, open API Keys, then create a key

Easiest setup in OpenClaw

If OpenClaw is already running and configured, you can simply ask your assistant:

"Configure Groq Voice Transcribe for me"
"Here is my Groq API key, set up Groq Voice Transcribe"

The assistant can place the key into ~/.openclaw/openclaw.json for you.

Manual setup

Set GROQ_API_KEY, or configure it in ~/.openclaw/openclaw.json under:

{
  "skills": {
    "entries": {
      "groq-voice-transcribe": {
        "apiKey": "GROQ_KEY_HERE"
      }
    }
  }
}

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

Defaults:

Model: whisper-large-v3-turbo
Output: \x3Cinput>.txt
Format: plain text

Common examples

# Basic transcript
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

# Chinese voice message
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --language zh --prompt "中文普通话，日常聊天"

# Save to a custom file
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --out /tmp/transcript.txt

# Verbose JSON output
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --json --out /tmp/transcript.json

Flags

--model \x3Cname>: transcription model (default whisper-large-v3-turbo)
--out \x3Cpath>: output file path
--language \x3Ccode>: hint the spoken language, for example zh, en, ja
--prompt \x3Ctext>: optional context or spelling hint
--json: write verbose JSON instead of plain text

Notes

Audio is sent to Groq for transcription.
This skill is meant for transcription, not text-to-speech.
If language is omitted, Groq can usually auto-detect it, but passing --language zh often helps for Chinese voice notes.

安全使用建议

This skill's code does exactly what it says: it uploads a local audio file to https://api.groq.com/openai/v1/audio/transcriptions using a GROQ_API_KEY and writes the transcription to a file. Before installing or using it: - Treat the GROQ API key as a secret. Do not paste it into public chats. Prefer setting GROQ_API_KEY in your environment or adding it manually to ~/.openclaw/openclaw.json rather than letting the assistant edit config automatically. - The registry metadata did not list any required env var, but the script requires GROQ_API_KEY — that's an inconsistency. Confirm where you store the key and how it will be used. - The skill will send audio to Groq's cloud for transcription. Only use it for audio you are comfortable sending to an external service. - The package has no install step and the included script is small and readable; you can run the script manually to verify behavior before granting the assistant permission to configure anything. - The skill source and homepage are unknown. If you need stronger provenance, ask the publisher for a homepage or repository and prefer skills with clear authorship. If you decide to proceed: set GROQ_API_KEY locally (export GROQ_API_KEY=...) and run the provided script manually to test, or explicitly instruct the assistant how/where to store the key (and confirm it will not be shared elsewhere).

功能分析

Type: OpenClaw Skill Name: groq-voice-transcribe Version: 1.0.1 The skill provides a straightforward implementation for transcribing audio files using the Groq API. The bash script (scripts/transcribe.sh) correctly handles arguments and uses curl to send audio data to the legitimate Groq endpoint (api.groq.com) using the user-provided API key. No indicators of data exfiltration to unauthorized parties, malicious execution, or prompt injection attacks were found.

能力评估

⚠ Purpose & Capability

Name/description align with the included script and purpose (cloud transcription via Groq). However, the registry metadata lists no required environment variables or primary credential while the script and SKILL.md clearly require a GROQ_API_KEY — an incoherence between declared requirements and actual need.

ℹ Instruction Scope

SKILL.md and scripts are narrowly scoped to sending an audio file to Groq's transcription endpoint and saving the response. The SKILL.md additionally tells the assistant it can place the API key into ~/.openclaw/openclaw.json — an action that writes user configuration and involves handling a secret. This is explainable for setup but should be done explicitly and securely.

✓ Install Mechanism

No install spec; the skill is instruction + a small shell script. Nothing is downloaded from external/unknown URLs or installed automatically, which keeps installation risk low.

⚠ Credentials

Functionally the skill needs one secret (GROQ_API_KEY) which is proportionate. The concern is that registry metadata did not declare any required env vars or primary credential despite the script depending on GROQ_API_KEY; this mismatch is an integrity problem and could lead to misconfiguration or accidental key disclosure if users follow unclear instructions.

✓ Persistence & Privilege

No elevated privileges requested. always is false and the skill does not request persistent system-level changes beyond optionally writing the API key into the local OpenClaw config (which is limited in scope).

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install groq-voice-transcribe
安装完成后，直接呼叫该 Skill 的名称或使用 /groq-voice-transcribe 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.1

Improve getting-started guide, Groq API key setup instructions, and OpenClaw-assisted configuration flow.

v1.0.0

Initial release: Groq-based speech-to-text for voice notes and audio files via OpenAI-compatible transcription API.

元数据

Slug groq-voice-transcribe

版本 1.0.1

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 2

常见问题