← Back to Skills Marketplace

video-transcript-pro

Name: video-transcript-pro
Author: artminding

by Yi,Li （李祎） · GitHub ↗ · v2.3.0 · MIT-0

cross-platform ⚠ suspicious

160

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install video-transcript-pro

Description

自动转录视频音频，智能修正润色并生成逐字稿及知乎、微信、小红书多平台发布稿，支持用户个性化定制。

Usage Guidance

The skill appears to do what it claims (local faster-whisper transcription + automated polishing and multi-platform formatting). Before installing, consider: 1) Privacy — SKILL.md says it will perform automatic web searches to add background info and will do so without asking; if your transcripts contain sensitive data that could be sent to external search endpoints, disable browsing/web tools or avoid using this skill for sensitive content. 2) Local requirements — it expects Python and faster-whisper/ctranslate2; models and GPU libraries may be downloaded/used locally. 3) Persistence — it saves user preferences to MEMORY.md in the working directory. 4) Behavior control — the skill auto-continues after a 3-minute timeout; if you want manual approval for network lookups or publication, change that workflow or monitor the session. If you require higher assurance, review the two Python files and run the scripts in an isolated/test environment with non-sensitive sample files first.

Capability Analysis

Type: OpenClaw Skill Name: video-transcript-pro Version: 2.3.0 The skill provides a legitimate workflow for video transcription and content optimization but is classified as suspicious due to high-risk autonomous behaviors and broad capabilities. Specifically, SKILL.md instructs the agent to automatically proceed with execution after a 3-minute timeout without user confirmation, which could lead to unintended resource consumption or actions. Additionally, the skill utilizes risky capabilities such as shell execution (transcribe_simple.py and transcribe_segmented.py) and network access (web_search/web_fetch), which, while aligned with the stated purpose, increase the overall attack surface. The Python scripts themselves appear functional and lack intentional malware, but the lack of input validation on command-line arguments and the autonomous instructions in the markdown files represent a vulnerability-prone design.

Capability Assessment

✓ Purpose & Capability

Name/description (video/audio transcription, polishing, multi-platform outputs) match the included files: two Python transcription scripts using faster-whisper/faster_whisper and a detailed SKILL.md. Required binaries/credentials/configs are none — proportionate for a local transcription skill that expects Python and Whisper-related packages.

ℹ Instruction Scope

SKILL.md stays on-task (transcribe → auto-correct → polish → format for platforms). It instructs the agent to perform automatic web searches ('知识补充（联网搜索）') without asking the user and to record user preferences in MEMORY.md. That behavior is plausible for improving outputs but means transcript content may be sent to external search/fetch tools (depending on the agent's browsing/web tools). Also the skill auto-continues after 3 minutes if the user doesn't reply.

✓ Install Mechanism

No install spec is provided (instruction-only + Python scripts). The code depends on faster-whisper/ctranslate2/Whisper-style models which are expected for this purpose; model downloads and native GPU libraries are normal for Whisper usage. There are no downloads from untrusted URLs in the package manifest.

✓ Credentials

The skill declares no required environment variables or credentials. The scripts access local files and GPU libraries (DLL paths) which is appropriate. One operational note: SKILL.md expects to use web_search/web_fetch but declares no API keys — the agent will rely on whatever browsing/web tools are available, which could cause network transmission of transcript text if enabled.

ℹ Persistence & Privilege

always:false (no forced global presence). The skill writes/reads a local MEMORY.md to remember user preferences (explicit in SKILL.md), which is reasonable but is persistent local state. The combination of auto-continue (3-minute timeout) and Automatic web searches means the skill can act autonomously on user content unless the agent's web/tools are restricted.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install video-transcript-pro
After installation, invoke the skill by name or use /video-transcript-pro
Provide required inputs per the skill's parameter spec and get structured output

Version History

v2.3.0

v2.3: 完善多平台发布稿字数对照表，增强金句提取规则，添加配图提示词规范

v2.2.0

优化版/知乎/微信/小红书字数要求全部增加50%，支持多平台输出

Metadata

Slug video-transcript-pro

Version 2.3.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is video-transcript-pro?

自动转录视频音频，智能修正润色并生成逐字稿及知乎、微信、小红书多平台发布稿，支持用户个性化定制。 It is an AI Agent Skill for Claude Code / OpenClaw, with 160 downloads so far.

How do I install video-transcript-pro?

Run "/install video-transcript-pro" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is video-transcript-pro free?

Yes, video-transcript-pro is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does video-transcript-pro support?

video-transcript-pro is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created video-transcript-pro?

It is built and maintained by Yi,Li （李祎） (@artminding); the current version is v2.3.0.

More Skills