← 返回 Skills 市场
itsfabioroma

Transcribee 🐝

作者 itsfabioroma · GitHub ↗ · v1.2.1
cross-platform ⚠ suspicious
3185
总下载
6
收藏
10
当前安装
4
版本数
在 OpenClaw 中安装
/install transcribee
功能描述
Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.
安全使用建议
Before installing or enabling this skill: 1) Expect it to require two API keys (ELEVEN_LABS_API_KEY and ANTHROPIC_API_KEY) and system binaries (yt-dlp, ffmpeg) even though the registry listing omitted them — verify and supply keys only if you trust those services. 2) Be aware audio and transcripts are uploaded to external services; if privacy-sensitive audio will be transcribed, consider running a local-only alternative. 3) The skill will read your ~/Documents/transcripts/ library and write new transcript folders there — review or sandbox it if you don't want that folder modified. 4) Verify the skill's source (there is no homepage listed) — prefer installing from a trusted repo and inspect the .env.example and index.ts for any extra endpoints or hardcoded secrets. 5) If you allow autonomous agent invocation, consider restricting its access or running the skill manually the first few times to confirm behavior. If you want to go ahead, run it in an isolated environment, provide least-privilege API keys, and review the code for any hidden network calls not documented in README/CLAUDE.md.
功能分析
Type: OpenClaw Skill Name: transcribee Version: 1.2.1 The skill is classified as benign. It transparently uses `yt-dlp` and `ffmpeg` for media processing and `ElevenLabs` and `Anthropic` APIs for transcription and categorization, which aligns with its stated purpose. External command execution is handled using `execFileAsync`, which is a safer method than `exec` as it prevents shell injection. Output files are saved to a user-owned directory (`~/Documents/transcripts`). There is no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts against the OpenClaw agent itself; the prompt engineering observed is for the internal Anthropic LLM used for categorization.
能力评估
Purpose & Capability
Name/description (transcribing YouTube/local files with diarization and auto-organization) matches the included code. However the registry metadata claims no required env vars/binaries while the code clearly requires ELEVEN_LABS_API_KEY and ANTHROPIC_API_KEY and expects yt-dlp/ffmpeg. README/CLAUDE.md mention Instagram and TikTok support, but the shipped wrapper (transcribe.sh) warns only about YouTube — inconsistent scope/claims.
Instruction Scope
Runtime instructions and scripts run yt-dlp/ffmpeg (downloads/extracts media), call ElevenLabs and Anthropic SDKs, and read/write the user's library at ~/Documents/transcripts/. The code reads existing transcripts to decide categories. It does not appear to access unrelated system credentials, but it will transmit user audio/transcripts to external services (ElevenLabs and Anthropic) — a privacy/telemetry consideration that is expected but worth noting.
Install Mechanism
There is no install spec; the package includes a package.json and pnpm lock (uses npm packages 'elevenlabs' and '@anthropic-ai/sdk'). This is a moderate-risk, standard npm dependency surface — no arbitrary download URLs or extract-from-remote artifacts were found. Running pnpm install will pull dependencies from public registries.
Credentials
The code requires ELEVEN_LABS_API_KEY and ANTHROPIC_API_KEY (and expects a local .env in the skill directory), and expects system binaries yt-dlp and ffmpeg. The registry metadata reported no required env vars or binaries — that's a clear mismatch. Requesting API keys for the transcription and classification services is reasonable for the stated purpose, but the omission in metadata is a red flag (it hides required credentials).
Persistence & Privilege
always:false and the skill does not request elevated system privileges. It reads/writes a folder under the user's home (~/Documents/transcripts) and creates temporary audio in OS tmpdir. Autonomous invocation is allowed (platform default) — combine that with external API access if you intend to allow agent-initiated runs.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install transcribee
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /transcribee 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.2.1
Add TikTok support to docs
v1.2.0
Rich metadata + cleaner output (2 files default, --raw for timestamps)
v1.1.0
Add Instagram Reels support
v1.0.0
- Initial release of Transcribee. - Transcribe YouTube videos and local audio/video files with speaker diarization. - Outputs clean, speaker-labeled transcripts ready for analysis. - Supports various audio (mp3, m4a, wav, ogg, flac) and video (mp4, mkv, webm, mov, avi) formats. - Organizes transcripts and metadata in a structured output directory. - Includes troubleshooting steps and clear usage instructions.
元数据
Slug transcribee
版本 1.2.1
许可证
累计安装 10
当前安装数 10
历史版本数 4
常见问题

Transcribee 🐝 是什么?

Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 3185 次。

如何安装 Transcribee 🐝?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install transcribee」即可一键安装,无需额外配置。

Transcribee 🐝 是免费的吗?

是的,Transcribee 🐝 完全免费(开源免费),可自由下载、安装和使用。

Transcribee 🐝 支持哪些平台?

Transcribee 🐝 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Transcribee 🐝?

由 itsfabioroma(@itsfabioroma)开发并维护,当前版本 v1.2.1。

💬 留言讨论