← 返回 Skills 市场

transcription

Name: transcription
Author: djismgaming

作者 djismgaming · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ⚠ suspicious

482

总下载

当前安装

版本数

在 OpenClaw 中安装

/install transcription

功能描述

Transcribe audio and video files using OpenAI Whisper API. Use when user wants to transcribe audio/video files, extract speech from media, or get text from r...

安全使用建议

This skill will run included Python scripts and invoke ffmpeg, then upload whatever audio/video you provide to the hardcoded endpoint http://192.168.0.11:8080/v1. Before installing or using it: 1) Confirm that 192.168.0.11 is a trusted, intended Whisper service (or modify the scripts to point to a trusted endpoint or to read endpoint from a config/env var). 2) Ensure ffmpeg is installed and the Python 'requests' package is available (the skill metadata does not declare these). 3) Do not send sensitive audio to the skill until you verify the receiving host and its retention/privacy policies. 4) If you need to use an official OpenAI-hosted API, update the endpoint/auth appropriately rather than relying on the hardcoded address. 5) For extra caution, run the scripts on a sandbox machine and test with non-sensitive files first.

功能分析

Type: OpenClaw Skill Name: transcription Version: 1.0.1 The transcription skill bundle is designed to transcribe audio and video files using a local OpenAI Whisper API endpoint (192.168.0.11). It utilizes ffmpeg for audio extraction from video files and standard Python libraries (requests, subprocess) for processing. The code logic is transparent, aligns with the stated purpose in SKILL.md, and contains no evidence of malicious intent, data exfiltration, or prompt injection. A minor unreachable code block in scripts/transcribe_audio.py appears to be a harmless copy-paste error.

能力评估

ℹ Purpose & Capability

Name/description say 'OpenAI Whisper API' and the code calls a Whisper-compatible endpoint; this is coherent. Minor mismatch: the SKILL.md and scripts point to a hardcoded local endpoint (http://192.168.0.11:8080/v1) rather than the public OpenAI cloud API — this is plausible (self-hosted Whisper) but should be explicit to users.

⚠ Instruction Scope

Runtime instructions and scripts will upload user-supplied audio/video files to a fixed HTTP endpoint (192.168.0.11:8080) and call ffmpeg locally. That means any file you provide will be transmitted to that host; ensure that host is trusted and network access is intended. The instructions do not provide an option to override the endpoint via an environment variable or configuration.

ℹ Install Mechanism

There is no install spec (instruction-only), which limits disk writes, but the skill ships Python scripts that require external runtime ingredients. The SKILL metadata does not declare required binaries or Python deps even though scripts call ffmpeg and import the requests library.

ℹ Credentials

The skill requests no credentials or environment variables (good), but it hardcodes an HTTP endpoint and model name. Because no auth is declared, the endpoint is assumed unauthenticated; ensure this is correct for your environment. Also the skill fails to declare that it requires ffmpeg and Python requests.

✓ Persistence & Privilege

always is false and the skill doesn't request persistent platform privileges or modify other skills. It runs as-invoked and does not declare any elevated host privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install transcription
安装完成后，直接呼叫该 Skill 的名称或使用 /transcription 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.1

- Added a new script: scripts/transcribe.sh. - Updated documentation to include manual script usage instructions for direct command-line transcription. - Removed language selection and auto-detect language features from feature list. - Simplified usage examples to focus on running scripts via command line.

v1.0.0

Transcription skill initial release: - Transcribe audio and video files using a local OpenAI Whisper API. - Supports a wide range of audio (mp3, wav, ogg, etc.) and video (mp4, mov, mkv, etc.) formats. - Automatic language detection or specify language for transcription. - Extract timestamps, choose output formats (text, JSON, SRT, VTT). - Batch processing: send multiple files for simultaneous transcription. - Automatically extracts audio from video files before transcription.

元数据

Slug transcription

版本 1.0.1

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 2

常见问题

transcription 是什么？

Transcribe audio and video files using OpenAI Whisper API. Use when user wants to transcribe audio/video files, extract speech from media, or get text from r... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 482 次。

如何安装 transcription？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install transcription」即可一键安装，无需额外配置。

transcription 是免费的吗？

是的，transcription 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

transcription 支持哪些平台？

transcription 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 transcription？

由 djismgaming（@djismgaming）开发并维护，当前版本 v1.0.1。