/install burmese-audio-understanding
Burmese Audio Understanding Skill
This skill allows you to transcribe Burmese audio (voice notes, speech) directly into Burmese text using your own Google Gemini API key. It uses the official Google GenAI SDK for secure and reliable file handling.
Required Environment Variables
GEMINI_API_KEY: Required. Set your Google Gemini API key to allow the skill to access transcription services.
Usage
Ensure GEMINI_API_KEY is set in your environment, then run:
node scripts/transcribe-direct.js /path/to/my-audio.ogg
Features
- Official SDK: Uses the official
@google/genaiSDK. - Improved Security: No shell commands (ffmpeg/child_process) used; file processing is handled via SDK file upload directly to Gemini.
- Model: Uses
gemini-3.1-flash-previewfor high-quality audio transcription.
Security Notes
- This skill sends audio data to Google Gemini API for transcription.
- No data is stored locally after processing.
- Requires a valid GEMINI_API_KEY with minimal permissions.
Prerequisites
- Dependencies must be installed:
npm install @google/genai.
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install burmese-audio-understanding - After installation, invoke the skill by name or use
/burmese-audio-understanding - Provide required inputs per the skill's parameter spec and get structured output
What is Burmese Audio Understanding?
High-accuracy Burmese audio transcription using Gemini 3.1 Flash Preview. It is an AI Agent Skill for Claude Code / OpenClaw, with 182 downloads so far.
How do I install Burmese Audio Understanding?
Run "/install burmese-audio-understanding" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Burmese Audio Understanding free?
Yes, Burmese Audio Understanding is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Burmese Audio Understanding support?
Burmese Audio Understanding is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Burmese Audio Understanding?
It is built and maintained by thelapyae (@thelapyae); the current version is v1.2.2.