← 返回 Skills 市场
dreadterror

Telegram Voice Transcribe

作者 Alfredo Bárcena · GitHub ↗ · v1.3.1 · MIT-0
cross-platform ✓ 安全检测通过
548
总下载
0
收藏
2
当前安装
2
版本数
在 OpenClaw 中安装
/install telegram-voice-transcribe
功能描述
Transcribe Telegram voice messages and audio notes into text using the OpenAI Whisper API. Use when (1) a user sends a voice message or audio note via Telegr...
安全使用建议
This skill is generally coherent for transcribing Telegram voice notes, but check these before installing or enabling it: - Metadata mismatch: the registry metadata claims no required env vars but SKILL.md and the script require OPENAI_API_KEY (for API mode) and TELEGRAM_BOT_TOKEN (for --file-id). Confirm which mode you will use and only provision the minimum credentials. - Privacy vs API: local (--local) mode keeps audio on your server; API mode sends audio to OpenAI. If privacy is required, use --local and ensure you have the required local models and resources (disk, RAM, possibly GPU). - Resource & dependencies: installing openai-whisper / torch and downloading models (small/medium) can use hundreds of MBs to GBs and CPU/GPU time. Plan for that on your host. - Claimed auto-download: CLAWHUB.md states an auto-download of static ffmpeg; the script only tries to import static_ffmpeg. Verify whether your environment will actually provide ffmpeg or static_ffmpeg — do not assume the skill will fetch executables from external servers. - Network flows: in file-id and API modes the skill calls Telegram API and (optionally) OpenAI API; if you have egress controls, permit only those hosts. The URL mode will fetch arbitrary URLs you pass — avoid untrusted URLs to reduce risk. - Inspect hooks: if you install the suggested pre-processing hook into OpenClaw, review the hook code to confirm it runs with appropriate timeouts and environment and does not expose extra secrets. If these points are acceptable and you only grant the minimal env vars needed for your chosen mode, the skill is consistent with its stated purpose.
功能分析
Type: OpenClaw Skill Name: telegram-voice-transcribe Version: 1.3.1 The skill 'telegram-voice-transcribe' is a legitimate tool for transcribing Telegram voice notes using either the OpenAI Whisper API or a local Whisper model. The Python script 'scripts/transcribe.py' handles file downloads from Telegram's official API and performs transcription as described, with proper cleanup of temporary files and no evidence of data exfiltration or unauthorized execution. The documentation in 'SKILL.md' and 'references/setup.md' provides clear, functional instructions for the AI agent and system administrators without any malicious prompt injection or suspicious requirements.
能力评估
Purpose & Capability
Name, description, SKILL.md and the script align: the script supports Telegram file_id, local files, and URL modes and implements both local Whisper and OpenAI API transcription. Requested secrets (OPENAI_API_KEY, TELEGRAM_BOT_TOKEN) are appropriate to the stated purpose. However the registry metadata claims no required env vars while SKILL.md and code require OPENAI_API_KEY (for API mode) and TELEGRAM_BOT_TOKEN (for --file-id); this mismatch is a documentation inconsistency to resolve.
Instruction Scope
Instructions limit actions to fetching Telegram files (via Telegram API), optionally downloading a URL or reading a local file, then transcribing locally or via OpenAI. The example hooks run the script as a preprocessor and prepend the transcript to the message text — all within the stated transcription workflow. No unrelated system paths or credentials are referenced by the runtime instructions.
Install Mechanism
There is no automated install spec (instruction-only plus a script file). Setup docs recommend pip installing openai-whisper / static-ffmpeg / torch which is typical. CLAWHUB.md claims an 'auto-fallback — downloads static ffmpeg', but the actual script only tries to import static_ffmpeg if ffmpeg is absent and does not itself download a binary from an arbitrary host; this is a documentation/claim mismatch to verify.
Credentials
The only sensitive env vars used by the code are OPENAI_API_KEY (for API mode) and TELEGRAM_BOT_TOKEN (for downloading files by file_id), which are proportionate to the functionality. Again, registry metadata incorrectly lists none required — ensure your environment configuration and permission scoping (bot token scope) are deliberate.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or system-wide settings. The recommended hook runs the script as a pre-processor (normal behavior) and the script cleans up temporary files; autonomous invocation is default but not combined with other high privileges here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install telegram-voice-transcribe
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /telegram-voice-transcribe 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.3.1
- Added CLAWHUB.md file for integration with the CLAW ecosystem. - No changes to functionality or documentation outside of this ecosystem addition.
v1.3.0
v1.3.0 — Initial release. Local Whisper transcription (model small), static-ffmpeg fallback, tested on production with real Telegram voice notes.
元数据
Slug telegram-voice-transcribe
版本 1.3.1
许可证 MIT-0
累计安装 3
当前安装数 2
历史版本数 2
常见问题

Telegram Voice Transcribe 是什么?

Transcribe Telegram voice messages and audio notes into text using the OpenAI Whisper API. Use when (1) a user sends a voice message or audio note via Telegr... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 548 次。

如何安装 Telegram Voice Transcribe?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install telegram-voice-transcribe」即可一键安装,无需额外配置。

Telegram Voice Transcribe 是免费的吗?

是的,Telegram Voice Transcribe 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Telegram Voice Transcribe 支持哪些平台?

Telegram Voice Transcribe 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Telegram Voice Transcribe?

由 Alfredo Bárcena(@dreadterror)开发并维护,当前版本 v1.3.1。

💬 留言讨论