← 返回 Skills 市场

Web Video Transcribe DOCX

Name: Web Video Transcribe DOCX
Author: c-narcissus

作者 c-narcissus · GitHub ↗ · v1.0.2 · MIT-0

cross-platform ✓ 安全检测通过

总下载

当前安装

版本数

在 OpenClaw 中安装

/install web-video-transcribe-docx

功能描述

Offline-first workflow for turning Chinese web page video or audio into text and Word deliverables. Use when Codex needs to (1) extract playable media stream...

安全使用建议

This skill is coherent with its purpose but will: (1) download a large ASR model from GitHub into a user cache the first time you run it, (2) pip-install Python packages if you run scripts/bootstrap_env.py, and (3) use Playwright and a local Chrome/Edge to extract media from pages when needed. Those actions require network access and will write files to your user cache and whatever output directory you choose. If you install or run it: review the bootstrap step before running, run initial tests on non-sensitive/public pages, consider using an isolated environment (virtualenv/container), and inspect the small agents/openai.yaml file if you want to confirm there are no hidden external endpoints or API keys. The skill explicitly avoids capturing cookies/tokens and uses a safe tar extraction check for the model archive.

功能分析

Type: OpenClaw Skill Name: web-video-transcribe-docx Version: 1.0.2 The skill provides a legitimate and well-documented workflow for extracting media from web pages, performing offline transcription using SenseVoice (via sherpa-onnx), and generating DOCX files. The scripts (e.g., `extract_web_media.py`, `transcribe_sensevoice.py`) align perfectly with the stated purpose. Security-wise, the code includes a path traversal check in `pipeline_common.py` during model extraction, and `SKILL.md` explicitly instructs the agent not to exfiltrate sensitive data like cookies or credentials. All external network calls are directed to user-provided URLs or the official GitHub repository for the ASR models.

能力评估

✓ Purpose & Capability

The skill's name and description (extract web media, download streams, run local SenseVoice ASR, produce TXT/DOCX) align with the included scripts. Declared runtime requirement (python) matches the Python scripts. Required actions such as browser automation, media downloading, ffmpeg usage, and model download are expected for this functionality.

✓ Instruction Scope

SKILL.md and the scripts confine behavior to extracting media URLs from pages, downloading media, running local ASR, and producing DOCX. The runtime instructions explicitly state not to request/store cookies or tokens and not to bypass DRM or logins. Extractors capture request headers but sanitize them (only keeping Referer/Origin), and pipelines only operate on user-supplied or page-extracted URLs.

ℹ Install Mechanism

No marketplace install spec is present (instruction-only), but the bootstrap script will pip-install several Python packages and can run Playwright's browser installer if invoked. The SenseVoice model is downloaded from a GitHub releases URL and extracted with a safe-path check; these behaviors are appropriate for the task but do involve network downloads and installing Python packages on first run, which is expected but worth noting.

✓ Credentials

The skill requests no environment variables or external credentials. It does look for a local Chrome/Edge executable and writes cache/model files to a per-user cache directory. The only remote endpoint used for code operation is a GitHub releases URL to download the ASR model (appropriate).

✓ Persistence & Privilege

The skill is not 'always: true' and does not claim to modify other skills or global agent settings. It writes files to its own cache and output directories and installs packages only when the bootstrap script is run; this level of presence is appropriate for the described offline transcription workflow.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install web-video-transcribe-docx
安装完成后，直接呼叫该 Skill 的名称或使用 /web-video-transcribe-docx 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.2

Initial release of web-video-transcribe-docx. - Provides an offline-first workflow for converting Chinese web page video/audio into text and Word documents. - Supports extraction of media streams (video/audio) from web pages, including MP4, M3U8, MPD, and split audio streams. - Includes pipelines and scripts for media extraction, download (with custom headers), offline transcription using SenseVoice ASR, and rendering to TXT/DOCX. - Special handling for Toutiao pages alongside generic web sources. - Raw and refined transcripts are managed separately for accuracy and auditability. - Stays within ethical and legal boundaries (no DRM bypass, no credential handling, no unauthorized downloads).

元数据

Slug web-video-transcribe-docx

版本 1.0.2

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

Web Video Transcribe DOCX 是什么？

Offline-first workflow for turning Chinese web page video or audio into text and Word deliverables. Use when Codex needs to (1) extract playable media stream... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 79 次。

如何安装 Web Video Transcribe DOCX？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install web-video-transcribe-docx」即可一键安装，无需额外配置。

Web Video Transcribe DOCX 是免费的吗？

是的，Web Video Transcribe DOCX 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Web Video Transcribe DOCX 支持哪些平台？

Web Video Transcribe DOCX 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Web Video Transcribe DOCX？

由 c-narcissus（@c-narcissus）开发并维护，当前版本 v1.0.2。