← Back to Skills Marketplace
mehediahamed

Youtube Transcription Generator

by MehediAhamed · GitHub ↗ · v0.1.0
cross-platform ⚠ suspicious
757
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install youtube-transcription-generator
Description
Use VLM Run (vlmrun) to generate transcriptions from YouTube videos. Download a video with yt-dlp, then run vlmrun to transcribe with optional timestamps. VLMRUN_API_KEY must be in .env; follow vlmrun-cli-skill for CLI setup and options.
Usage Guidance
This skill's goal (download a YouTube video with yt-dlp and transcribe it via vlmrun) is plausible, but the package metadata and the included files are inconsistent with the instructions. Before installing or following the steps: 1) Ask the publisher for the source repository or homepage and for the missing files (requirements.txt, scripts/run_transcription.py, .env_template). 2) Confirm what the 'uv' command prefix means — it's not a standard system command and could be a wrapper you don't expect. 3) Be aware that running vlmrun will send your video/audio to an external VLM Run service (privacy/copyright risk). 4) Do not have the assistant read your .env or other configuration files unless you explicitly confirm which variables it may access; VLMRUN_API_KEY should be the only declared secret if needed. 5) If you decide to run any provided scripts, open and inspect them locally (or have a trusted reviewer do so) to confirm they only call yt-dlp and vlmrun and do not exfiltrate other data. Resolving the metadata omissions and providing the missing repository files would increase confidence; until then treat this skill as suspicious.
Capability Analysis
Type: OpenClaw Skill Name: youtube-transcription-generator Version: 0.1.0 The skill bundle's `_meta.json` and `SKILL.md` files describe a legitimate function: transcribing YouTube videos using `yt-dlp` for download and `vlmrun` for transcription. The `SKILL.md` provides clear, task-specific instructions for the AI agent, including checking for an API key and executing local commands. There is no evidence of prompt injection attempts, data exfiltration, unauthorized actions, or other malicious intent within the provided files. The commands shown are standard usage for the stated tools, and file operations are local and expected for the skill's purpose.
Capability Assessment
Purpose & Capability
The stated purpose (download YouTube with yt-dlp, transcribe with vlmrun) is internally consistent. However the registry metadata lists no required binaries or env vars while the SKILL.md explicitly requires yt-dlp, vlmrun, Python, and VLMRUN_API_KEY. That mismatch between metadata and runtime instructions is unexpected and should be clarified.
Instruction Scope
SKILL.md instructs checking .env/.env.local for VLMRUN_API_KEY and running local scripts (e.g., scripts/run_transcription.py), installing requirements.txt, and using yt-dlp and vlmrun. The skill bundle contains no code files, no requirements.txt, and no .env_template — the instructions assume files that are not present. The doc also uses an unexplained 'uv' prefix for venv and pip commands. These gaps mean the assistant could give commands or expect artifacts that don't exist, and the SKILL.md has authority to instruct the agent to read .env (sensitive) and to send video data to an external VLM Run service (expected for transcription, but a privacy/exfiltration consideration).
Install Mechanism
There is no install spec (instruction-only), so nothing is written to disk by a supplied installer. This minimizes direct installer risk. However, the instructions ask the user/agent to install vlmrun[cli] and yt-dlp via pip — that's normal for this workflow but not declared in the registry metadata.
Credentials
The SKILL.md requires VLMRUN_API_KEY in .env, but the registry lists no required environment variables or primary credential. Asking to read .env/.env.local is sensitive because these files can contain other secrets; the skill should explicitly declare the env vars it needs. Otherwise the assistant might be instructed to examine or rely on environment config that wasn't disclosed.
Persistence & Privilege
The skill is not forced-always, does not request persistent privileges, and does not claim to modify other skills or agent-wide settings. Autonomous invocation is allowed (platform default) but not combined with other elevated flags.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install youtube-transcription-generator
  3. After installation, invoke the skill by name or use /youtube-transcription-generator
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
YouTube Transcription Generator skill, initial release: - Generate transcriptions from YouTube videos using the vlmrun CLI and yt-dlp. - Supports transcripts with optional timestamps and various output formats. - Requires VLMRUN_API_KEY in .env and setup of both yt-dlp and vlmrun CLI. - Guides provided for installation, quickstart, workflow, and troubleshooting. - Offers example prompts and manual usage instructions for flexibility.
Metadata
Slug youtube-transcription-generator
Version 0.1.0
License
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is Youtube Transcription Generator?

Use VLM Run (vlmrun) to generate transcriptions from YouTube videos. Download a video with yt-dlp, then run vlmrun to transcribe with optional timestamps. VLMRUN_API_KEY must be in .env; follow vlmrun-cli-skill for CLI setup and options. It is an AI Agent Skill for Claude Code / OpenClaw, with 757 downloads so far.

How do I install Youtube Transcription Generator?

Run "/install youtube-transcription-generator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Youtube Transcription Generator free?

Yes, Youtube Transcription Generator is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Youtube Transcription Generator support?

Youtube Transcription Generator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Youtube Transcription Generator?

It is built and maintained by MehediAhamed (@mehediahamed); the current version is v0.1.0.

💬 Comments