← Back to Skills Marketplace
kanfred

Gladia YouTube Transcription (Free)

by kanfred · GitHub ↗ · v1.0.4
cross-platform ✓ Security Clean
352
Downloads
2
Stars
2
Active Installs
5
Versions
Install in OpenClaw
/install gladia-youtube-transcribe
Description
Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...
README (SKILL.md)

Video/Audio Transcription Skill

Overview

This skill provides automated transcription for video and audio content using Gladia API. It converts spoken content from YouTube videos, MP3 files, or any accessible audio/video URL into text, which can then be summarized by an LLM.

Use Cases

  • YouTube Video Summary - Transcribe YouTube videos for LLM summarization (especially useful for Cantonese/Chinese content without captions)
  • MP3/WAV to Text - Convert audio files to transcript
  • Video Content Extraction - Extract speech from any publicly accessible video URL
  • Podcast Transcription - Convert podcast episodes to text

Service: Gladia API

What is Gladia?

Gladia is an audio transcription API that supports multiple languages including Cantonese. It provides both async (pre-recorded) and real-time transcription.

Free Tier (as of March 2026)

Feature Free Tier
Monthly transcription 10 hours
Renewal Monthly (resets automatically)
New streams limit 5 per minute
Languages All included
Cost after quota $0.61/hour (async) / $0.75/hour (real-time)

How to Sign Up

  1. Visit gladia.io
  2. Click "Try for free" → Sign up with email
  3. Go to Dashboard → API Keys
  4. Create a new API key
  5. Copy the key (format: xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx)

Checking Usage

  1. Log in to Gladia Dashboard
  2. Navigate to Usage or Billing section
  3. View current month consumption (hours/minutes used)

Alternatively, you can check via API:

curl -X GET "https://api.gladia.io/v2/usage" -H "x-gladia-key: YOUR_API_KEY"

Setup

Step 1: Save Your API Key

Recommended: Set in current session

export GLADIA_API_KEY="your-api-key-here"

Or add to ~/.bashrc (ensure ~/.bashrc is in .gitignore):

echo 'export GLADIA_API_KEY="your-api-key-here"' >> ~/.bashrc
source ~/.bashrc

Note: Storing secrets in shell rc files is discouraged due to risk of accidental commits. Prefer setting the environment variable directly in your session or use a secrets manager.

Step 2: Verify Key Works

Test that your API key is valid by checking usage:

curl -X GET "https://api.gladia.io/v2/usage" -H "x-gladia-key: $GLADIA_API_KEY"

If successful, you'll see your usage information. If you get an auth error, check your API key.

How to Use

Command Line

# Navigate to skill directory (where you installed the skill)
cd /path/to/video-transcription

# Basic usage
./scripts/youtube_transcribe.sh "YOUTUBE_URL"

# Save to specific file
./scripts/youtube_transcribe.sh "YOUTUBE_URL" /path/to/output.txt

Via OpenClaw

  1. Provide a YouTube URL or video link
  2. The skill will:
    • Submit transcription job to Gladia
    • Poll for completion (~1-2 min for 10-15 min videos)
    • Return the full transcript

Script Location

/path/to/video-transcription/scripts/youtube_transcribe.sh

Configuration

Environment Variables

Variable Required Description
GLADIA_API_KEY Yes Your Gladia API key

Output

The script saves transcripts to a transcripts/ subdirectory in the skill folder:

/path/to/video-transcription/transcripts/

Privacy & Security Notes

  • NEVER share your API key publicly
  • NEVER include your API key in any skill documentation or code commits
  • IMPORTANT: Do NOT store API keys in shell rc files (~/.bashrc, ~/.zshrc) or config files that might be committed to version control
  • Use session-only environment variables: export GLADIA_API_KEY='your-key'
  • Or use a secrets manager (e.g., 1Password, AWS Secrets Manager)
  • Output transcripts may contain sensitive content - handle accordingly

Limitations

  • Video must be publicly accessible (no private/unlisted content)
  • Audio quality affects transcription accuracy
  • Some copyrighted content may have restrictions
  • Processing time depends on video length (~10 seconds per minute of video)
  • Free quota resets monthly; excess usage incurs charges

Troubleshooting

"Failed to call the url"

  • The video URL may be inaccessible or private
  • Try a different video URL

"Quota exceeded"

  • You've reached the 10-hour monthly limit
  • Wait for quota reset next month, or upgrade to paid plan

"Authentication failed"

  • Check your API key is correct
  • Ensure GLADIA_API_KEY environment variable is set

Alternative Services

If Gladia quota is exhausted:

Service Free Tier Notes
AssemblyAI Limited Requires credit card
Deepgram $0 credit Pay-per-use
YouTube Transcript Free (if available) Only works if video has captions

Future Enhancements

Potential improvements:

  • Add speaker diarization (identify different speakers)
  • Support real-time transcription
  • Automatic LLM summarization after transcription
  • Multi-language translation
  • Save transcripts to cloud storage

Last updated: March 2026

Usage Guidance
This skill appears to do exactly what it claims: call the Gladia pre-recorded transcription API for a provided public audio/video URL and save the transcript locally. Before installing: 1) Only provide public URLs (the script and docs already advise against private/unlisted content). 2) Keep your GLADIA_API_KEY secret—prefer session-only env vars or a secrets manager rather than committing it to dotfiles; the SKILL.md warns about this. 3) Be aware of Gladia quota/charges (10 free hours/month) and that transcripts may contain sensitive data. 4) The script writes transcripts into a transcripts/ subdirectory under the skill folder—verify filesystem permissions and cleanup policies if you expect sensitive output. 5) If you do not want the agent to call this skill autonomously, disable autonomous invocation at the agent/platform level.
Capability Analysis
Type: OpenClaw Skill Name: gladia-youtube-transcribe Version: 1.0.4 The skill is a legitimate tool for transcribing audio and video content using the Gladia API. The primary script, `scripts/youtube_transcribe.sh`, performs standard API interactions (POST/GET) to submit jobs and retrieve results, using Python for safe JSON parsing. The documentation in `SKILL.md` demonstrates security awareness by explicitly advising users against insecure practices like hardcoding API keys or storing them in shell configuration files.
Capability Assessment
Purpose & Capability
The name/description say 'transcribe YouTube/audio via Gladia'. The only required credential is GLADIA_API_KEY and required binaries are curl and python3 — all directly used by the included script to call Gladia endpoints. There are no unrelated credentials, binaries, or config paths.
Instruction Scope
SKILL.md and the shell script stay within the transcription task: they instruct how to set GLADIA_API_KEY, submit an audio_url to Gladia, poll for results, and save transcripts locally. The instructions do not ask the agent to read unrelated files, system secrets, or transmit data to endpoints other than api.gladia.io. The doc does mention storing keys in ~/.bashrc as an option but also warns against it.
Install Mechanism
There is no install spec and the skill is instruction-only plus a small shell script. Nothing is downloaded or written by an installer; risk from installation is minimal.
Credentials
Only GLADIA_API_KEY is required, which is proportionate for an API-based transcription service. The script only reads this env var. No additional secrets or unrelated API keys are requested.
Persistence & Privilege
The skill is not always-enabled (always:false) and does not request elevated or persistent system privileges. It does not modify other skills or system-wide settings. The normal platform default allowing autonomous invocation remains (disable-model-invocation:false).
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install gladia-youtube-transcribe
  3. After installation, invoke the skill by name or use /gladia-youtube-transcribe
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.4
- Moved the main script from the top level to scripts/youtube_transcribe.sh for better organization. - Updated documentation to reflect the new script location in usage instructions and file paths. - Removed the old top-level youtube_transcribe.sh script.
v1.0.3
- Improved instructions for verifying Gladia API key, now recommending direct API usage check. - Updated privacy guidance: strongly advise against storing API keys in shell rc/config files and encourage using session-only environment variables or secrets managers. - No changes to core functionality; documentation and security advice improved for safer usage.
v1.0.2
- Added metadata field "credentials" indicating GLADIA_API_KEY as a required credential. - Improved setup instructions to highlight best practice: recommend setting the API key in the current session and warn against storing secrets in shell rc files. - Minor clarifications throughout documentation for better security and clarity. - No changes to code or main functionality; documentation update only.
v1.0.1
- Added OpenClaw metadata block with usage description, emoji, and environment/bin requirements to SKILL.md. - Updated privacy notes to emphasize never committing ~/.bashrc with API keys to version control. - No changes to source code or usage, only to documentation/metadata.
v1.0.0
Version 1.0.0 - Initial release of the skill for automated video/audio transcription using the Gladia API. - Supports transcription from YouTube videos, MP3/WAV files, and accessible video URLs. - CLI tool for submitting jobs and saving transcripts; output organized in a transcripts/ folder. - Strong focus on Cantonese/Chinese content and privacy best practices. - Usage limited by Gladia’s 10-hour/month free tier; instructions provided for API key management and troubleshooting.
Metadata
Slug gladia-youtube-transcribe
Version 1.0.4
License
All-time Installs 2
Active Installs 2
Total Versions 5
Frequently Asked Questions

What is Gladia YouTube Transcription (Free)?

Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa... It is an AI Agent Skill for Claude Code / OpenClaw, with 352 downloads so far.

How do I install Gladia YouTube Transcription (Free)?

Run "/install gladia-youtube-transcribe" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Gladia YouTube Transcription (Free) free?

Yes, Gladia YouTube Transcription (Free) is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Gladia YouTube Transcription (Free) support?

Gladia YouTube Transcription (Free) is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Gladia YouTube Transcription (Free)?

It is built and maintained by kanfred (@kanfred); the current version is v1.0.4.

💬 Comments